quinn's picture

quinn

jwhe

·

AI & ML interests

None yet

Recent Activity

new activity about 3 hours ago

harborframework/parity-experiments:[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)

new activity 10 days ago

harborframework/parity-experiments:[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)

authored a paper about 2 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all activity

Organizations

New activity in harborframework/parity-experiments about 3 hours ago

[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)

#230 opened about 3 hours ago by

New activity in harborframework/parity-experiments 10 days ago

[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)

#210 opened 10 days ago by