Delta Belief RL Collection Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction". • 6 items • Updated 13 days ago • 1
Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision Paper • 2509.14234 • Published Sep 17, 2025 • 6
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs Paper • 2509.09677 • Published Sep 11, 2025 • 35 • 4
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs Paper • 2509.09677 • Published Sep 11, 2025 • 35