HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing Paper • 2602.03560 • Published 15 days ago • 44
Training Data Efficiency in Multimodal Process Reward Models Paper • 2602.04145 • Published 15 days ago • 76
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science Paper • 2510.16872 • Published Oct 19, 2025 • 109