Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published 4 days ago • 25
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published 15 days ago • 32
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published 6 days ago • 49
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published 5 days ago • 54
Long Context Pre-Training with Lighthouse Attention Paper • 2605.06554 • Published 12 days ago • 23
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 6 days ago • 149
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 5 days ago • 45
PreScam: A Benchmark for Predicting Scam Progression from Early Conversations Paper • 2605.12243 • Published 7 days ago • 2
LLM-based Detection of Manipulative Political Narratives Paper • 2605.14354 • Published 5 days ago • 3
PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution Paper • 2605.13027 • Published 6 days ago • 6
Adaptive Teacher Exposure for Self-Distillation in LLM Reasoning Paper • 2605.11458 • Published 7 days ago • 5
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published 12 days ago • 42
From Pixels to Concepts: Do Segmentation Models Understand What They Segment? Paper • 2605.09591 • Published 9 days ago • 2
The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 10 days ago • 3
FeatCal: Feature Calibration for Post-Merging Models Paper • 2605.13030 • Published 6 days ago • 7
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion Paper • 2605.12825 • Published 7 days ago • 11
Many-Shot CoT-ICL: Making In-Context Learning Truly Learn Paper • 2605.13511 • Published 6 days ago • 30