CodeGoat24
's Collections
UnifiedReward Training Data
updated
Unified Reward Model for Multimodal Understanding and Generation
Paper
•
2503.05236
•
Published
•
122
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement
Fine-Tuning
Paper
•
2505.03318
•
Published
•
92
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
•
Updated
•
337k
•
207
CodeGoat24/ImageGen-CoT-Reward-5K
Viewer
•
Updated
•
5.54k
•
124
•
1
CodeGoat24/LLaVA-Critic-113k
Preview
•
Updated
•
200
Viewer
•
Updated
•
21.4k
•
95
CodeGoat24/ShareGPTVideo-DPO
Viewer
•
Updated
•
101k
•
103
Viewer
•
Updated
•
29k
•
197
Preview
•
Updated
•
165
Viewer
•
Updated
•
73.2k
•
86
Viewer
•
Updated
•
72.7k
•
104
Viewer
•
Updated
•
19k
•
75