Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published Dec 30, 2025 • 17
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published Dec 31, 2025 • 15
Guiding a Diffusion Transformer with the Internal Dynamics of Itself Paper • 2512.24176 • Published Dec 30, 2025 • 8
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published Dec 30, 2025 • 51
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction Paper • 2601.00796 • Published Jan 2 • 31
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning Paper • 2512.24146 • Published Dec 30, 2025 • 14
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published Jan 6 • 28
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 225
The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models Paper • 2601.03425 • Published Jan 6 • 16
AgentOCR: Reimagining Agent History via Optical Self-Compression Paper • 2601.04786 • Published Jan 8 • 29
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published Jan 12 • 32
Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models Paper • 2601.07351 • Published Jan 12 • 26
Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published Jan 11 • 20
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper • 2601.08225 • Published Jan 13 • 52
The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents Paper • 2601.07264 • Published Jan 12 • 24
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices Paper • 2601.08303 • Published Jan 13 • 17
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning Paper • 2601.09708 • Published Jan 14 • 53
Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments Paper • 2601.01075 • Published Jan 3 • 6
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 147
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published Jan 15 • 31
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper • 2601.10355 • Published Jan 15 • 39
Language of Thought Shapes Output Diversity in Large Language Models Paper • 2601.11227 • Published about 1 month ago • 9
More Images, More Problems? A Controlled Analysis of VLM Failure Modes Paper • 2601.07812 • Published Jan 12 • 6
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 26 days ago • 54
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published 24 days ago • 51
PROGRESSLM: Towards Progress Reasoning in Vision-Language Models Paper • 2601.15224 • Published 25 days ago • 12
360Anything: Geometry-Free Lifting of Images and Videos to 360° Paper • 2601.16192 • Published 24 days ago • 8
MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences Paper • 2601.07251 • Published Jan 12 • 11
KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices Paper • 2601.21579 • Published 18 days ago • 6
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 17 days ago • 99
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published 18 days ago • 21
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published 13 days ago • 41
Rethinking the Trust Region in LLM Reinforcement Learning Paper • 2602.04879 • Published 11 days ago • 31
Protein Autoregressive Modeling via Multiscale Structure Generation Paper • 2602.04883 • Published 11 days ago • 3
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 14 days ago • 32
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published 7 days ago • 252
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published 12 days ago • 8
Col-Bandit: Zero-Shot Query-Time Pruning for Late-Interaction Retrieval Paper • 2602.02827 • Published 13 days ago • 2
Stable Velocity: A Variance Perspective on Flow Matching Paper • 2602.05435 • Published 11 days ago • 3
The Pensieve Paradigm: Stateful Language Models Mastering Their Own Context Paper • 2602.12108 • Published 4 days ago • 13
Free(): Learning to Forget in Malloc-Only Reasoning Models Paper • 2602.08030 • Published 8 days ago • 5
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models Paper • 2602.10179 • Published 5 days ago • 6