S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models Paper • 2604.01168 • Published 23 days ago • 7
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 23 days ago • 41
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 19 days ago • 41
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 14 days ago • 75
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 11 days ago • 98
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published 18 days ago • 117
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 16 days ago • 284
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 9 days ago • 63
view article Article PangolinGuard: Fine-Tuning ModernBERT as a Lightweight Approach to AI Guardrails Mar 23, 2025 • 13
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 23 days ago • 877
DFlash Collection Block Diffusion for Flash Speculative Decoding • 14 items • Updated 8 days ago • 75