VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published 4 days ago • 17
AgentOCR: Reimagining Agent History via Optical Self-Compression Paper • 2601.04786 • Published 5 days ago • 25
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 4 days ago • 45
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 5 days ago • 144
GARDO: Reinforcing Diffusion Models without Reward Hacking Paper • 2512.24138 • Published 14 days ago • 28
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 14 days ago • 57
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 27 days ago • 30
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing Paper • 2512.17909 • Published 25 days ago • 36
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published 26 days ago • 33
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published 26 days ago • 37
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning Paper • 2512.13874 • Published 29 days ago • 16
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling Paper • 2512.15702 • Published 27 days ago • 14
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 27 days ago • 42
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning Paper • 2512.14442 • Published 28 days ago • 10
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning Paper • 2512.14442 • Published 28 days ago • 10
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation Paper • 2512.08294 • Published Dec 9, 2025 • 17