-
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Paper • 2602.01058 • Published • 39 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 42 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 99 -
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
Paper • 2602.04804 • Published • 46
Collections
Discover the best community collections!
Collections including paper arxiv:2602.02493
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 20 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 51 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 218
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 13 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 61 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 18 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 79 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 44 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 75 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 42
-
Test-Time Scaling with Reflective Generative Model
Paper • 2507.01951 • Published • 108 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 152 -
Autoregressive Diffusion Models
Paper • 2110.02037 • Published -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!
Paper • 2502.07374 • Published • 40 -
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Paper • 2502.06703 • Published • 152 -
S*: Test Time Scaling for Code Generation
Paper • 2502.14382 • Published • 63
-
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
Paper • 2602.01058 • Published • 39 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 42 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 99 -
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
Paper • 2602.04804 • Published • 46
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 20 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 51 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 75 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 42
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 43 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 93 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 218
-
Test-Time Scaling with Reflective Generative Model
Paper • 2507.01951 • Published • 108 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 152 -
Autoregressive Diffusion Models
Paper • 2110.02037 • Published -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 13 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 61 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 18 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!
Paper • 2502.07374 • Published • 40 -
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Paper • 2502.06703 • Published • 152 -
S*: Test Time Scaling for Code Generation
Paper • 2502.14382 • Published • 63
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 79 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 44 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45