-
Structured Denoising Diffusion Models in Discrete State-Spaces
Paper • 2107.03006 • Published • 1 -
Simplified and Generalized Masked Diffusion for Discrete Data
Paper • 2406.04329 • Published • 8 -
Simple and Effective Masked Diffusion Language Models
Paper • 2406.07524 • Published • 12 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 123
Collections
Discover the best community collections!
Collections including paper arxiv:2505.15809
-
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Paper • 2505.24864 • Published • 142 -
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Paper • 2506.05010 • Published • 79 -
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Paper • 2506.05301 • Published • 56 -
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Paper • 2505.16933 • Published • 34
-
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54 -
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Paper • 2505.21600 • Published • 71 -
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Paper • 2505.22453 • Published • 46
-
Test-Time Scaling with Reflective Generative Model
Paper • 2507.01951 • Published • 107 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 151 -
Autoregressive Diffusion Models
Paper • 2110.02037 • Published -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8
-
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Paper • 2506.13759 • Published • 43 -
GSAI-ML/LLaDA-8B-Instruct
Text Generation • 8B • Updated • 239k • 337 -
Dream-org/Dream-v0-Base-7B
Text Generation • 8B • Updated • 332k • 51 -
Dream-org/Dream-v0-Instruct-7B
Text Generation • 8B • Updated • 92.5k • 144
-
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54 -
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation
Paper • 2506.18095 • Published • 66 -
Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models
Paper • 2506.19103 • Published • 42
-
Structured Denoising Diffusion Models in Discrete State-Spaces
Paper • 2107.03006 • Published • 1 -
Simplified and Generalized Masked Diffusion for Discrete Data
Paper • 2406.04329 • Published • 8 -
Simple and Effective Masked Diffusion Language Models
Paper • 2406.07524 • Published • 12 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 123
-
Test-Time Scaling with Reflective Generative Model
Paper • 2507.01951 • Published • 107 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 151 -
Autoregressive Diffusion Models
Paper • 2110.02037 • Published -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 8
-
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Paper • 2506.13759 • Published • 43 -
GSAI-ML/LLaDA-8B-Instruct
Text Generation • 8B • Updated • 239k • 337 -
Dream-org/Dream-v0-Base-7B
Text Generation • 8B • Updated • 332k • 51 -
Dream-org/Dream-v0-Instruct-7B
Text Generation • 8B • Updated • 92.5k • 144
-
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Paper • 2505.24864 • Published • 142 -
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Paper • 2506.05010 • Published • 79 -
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Paper • 2506.05301 • Published • 56 -
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Paper • 2505.16933 • Published • 34
-
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54 -
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Paper • 2505.21600 • Published • 71 -
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Paper • 2505.22453 • Published • 46
-
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 97 -
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 54 -
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation
Paper • 2506.18095 • Published • 66 -
Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models
Paper • 2506.19103 • Published • 42