Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 1 day ago • 28
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion Paper • 2605.30351 • Published 7 days ago • 24
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 11 days ago • 27
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 6 days ago • 51
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Paper • 2605.31264 • Published 6 days ago • 100
AdaState: Self-Evolving Anchors for Streaming Video Generation Paper • 2605.30349 • Published 7 days ago • 11
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 7 days ago • 132
GEM: Generative Supervision Helps Embodied Intelligence Paper • 2605.28548 • Published 8 days ago • 40
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 8 days ago • 58
Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration Paper • 2605.17423 • Published 18 days ago • 33
On-Policy Adversarial Flow Distillation for Autoregressive Video Generation Paper • 2605.26105 • Published 10 days ago • 18
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published 10 days ago • 101
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 13 days ago • 45
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 13 days ago • 219
One Sentence, One Drama: Personalized Short-Form Drama Generation via Multi-Agent Systems Paper • 2605.22144 • Published 14 days ago • 10
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 16 days ago • 185
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 17 days ago • 78
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published 22 days ago • 59