BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning Paper ⢠2603.04918 ⢠Published 7 days ago ⢠54
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper ⢠2602.10934 ⢠Published 29 days ago ⢠49
Prism: Spectral-Aware Block-Sparse Attention Paper ⢠2602.08426 ⢠Published about 1 month ago ⢠36
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper ⢠2602.08794 ⢠Published about 1 month ago ⢠156
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars Paper ⢠2602.01538 ⢠Published Feb 2 ⢠15
Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models Paper ⢠2602.02185 ⢠Published Feb 2 ⢠115
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing Paper ⢠2602.02437 ⢠Published Feb 2 ⢠77
AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Paper ⢠2601.20730 ⢠Published Jan 28 ⢠19
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Paper ⢠2601.16480 ⢠Published Jan 23 ⢠51
view reply @sysia48 , I think the comment is random (or at least pseudo random š ). Yes I also received this harassment with no reason, really frustrating š
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper ⢠2512.04677 ⢠Published Dec 4, 2025 ⢠174
Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL Paper ⢠2601.09876 ⢠Published Jan 14 ⢠7
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper ⢠2601.15876 ⢠Published Jan 22 ⢠91