Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 7 days ago • 135
gradients-io-tournaments/tournament-exp-qwen-1.5b-test-ed5f2570-6088-4b63-8edd-7e797eddbb3c-5Exp355e 7B • Updated 4 days ago • 31 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 23 days ago • 195
ReactiveGWM: Steering NPC in Reactive Game World Models Paper • 2605.15256 • Published 21 days ago • 28
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 29 days ago • 102
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 166