Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 7 days ago • 152 • 4
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion Paper • 2605.12825 • Published 8 days ago • 11 • 2
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs Paper • 2605.12460 • Published 8 days ago • 17 • 2
PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks Paper • 2605.10977 • Published 11 days ago • 10 • 2
LoopUS: Recasting Pretrained LLMs into Looped Latent Refinement Models Paper • 2605.11011 • Published 10 days ago • 9 • 2
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States Paper • 2605.07579 • Published 12 days ago • 16 • 3
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 8 days ago • 118 • 3
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting Paper • 2605.07243 • Published 12 days ago • 4 • 3
Large Language Models Explore by Latent Distilling Paper • 2604.24927 • Published 23 days ago • 74 • 7
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper • 2604.20779 • Published 28 days ago • 15 • 5
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published about 1 month ago • 94 • 4
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published Apr 1 • 50 • 8