Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published 8 days ago • 79
SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published 24 days ago • 25
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 12 days ago • 204
Babsie/Qwen3.6-27B-Heretic2-Uncensored-Finetune-Thinking Image-Text-to-Text • 27B • Updated 10 days ago • 38 • 2
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 19 days ago • 269
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents Paper • 2604.04979 • Published Apr 4 • 10
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115