World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published 10 days ago • 64
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published Feb 9 • 159
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems Paper • 2506.16381 • Published Jun 19, 2025 • 4
view article Article The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs SII-xrliu • Nov 15, 2025 • 15