Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2602.02380

UnifiedReward Flex

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published 14 days ago • 20
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora

Text-to-Image • Updated 9 days ago • 321 • 15
CodeGoat24/Wan2.2-T2V-A14B-UnifiedReward-Flex-lora

Text-to-Video • Updated 7 days ago • 112 • 8
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora

Text-to-Video • Updated 9 days ago • 96 • 6

E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models

Paper • 2601.00423 • Published Jan 1 • 10
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 226
FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning

Paper • 2601.18150 • Published 22 days ago • 7
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Paper • 2601.20218 • Published 20 days ago • 15

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Paper • 2601.14250 • Published 27 days ago • 47
Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published 14 days ago • 20

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

Paper • 2506.22434 • Published Jun 27, 2025 • 10
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79
RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

UnifiedReward Flex

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published 14 days ago • 20
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora

Text-to-Image • Updated 9 days ago • 321 • 15
CodeGoat24/Wan2.2-T2V-A14B-UnifiedReward-Flex-lora

Text-to-Video • Updated 7 days ago • 112 • 8
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora

Text-to-Video • Updated 9 days ago • 96 • 6

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer

Paper • 2601.14250 • Published 27 days ago • 47
Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published 14 days ago • 20

E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models

Paper • 2601.00423 • Published Jan 1 • 10
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 226
FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning

Paper • 2601.18150 • Published 22 days ago • 7
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Paper • 2601.20218 • Published 20 days ago • 15

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

Paper • 2506.22434 • Published Jun 27, 2025 • 10
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79
RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs