Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning
Paper
•
2601.20829
•
Published
•
6
Collection for the paper: Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning