Update README.md
Browse files
README.md
CHANGED
|
@@ -21,7 +21,7 @@ base_model:
|
|
| 21 |
`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
|
| 22 |
|
| 23 |
For further details, please refer to the following resources:
|
| 24 |
-
- 📰 Paper:
|
| 25 |
- 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/
|
| 26 |
- 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
|
| 27 |
- 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
|
|
@@ -95,5 +95,10 @@ print(text_outputs[0])
|
|
| 95 |
## Citation
|
| 96 |
|
| 97 |
```
|
| 98 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 99 |
```
|
|
|
|
| 21 |
`Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
|
| 22 |
|
| 23 |
For further details, please refer to the following resources:
|
| 24 |
+
- 📰 Paper: https://arxiv.org/pdf/2503.05236
|
| 25 |
- 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/
|
| 26 |
- 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
|
| 27 |
- 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
|
|
|
|
| 95 |
## Citation
|
| 96 |
|
| 97 |
```
|
| 98 |
+
@article{UnifiedReward,
|
| 99 |
+
title={Unified Reward Model for Multimodal Understanding and Generation.},
|
| 100 |
+
author={Wang, Yibin and Zang, Yuhang, and Li, Hao and Jin, Cheng and Wang Jiaqi},
|
| 101 |
+
journal={arXiv preprint arXiv:2503.05236},
|
| 102 |
+
year={2025}
|
| 103 |
+
}
|
| 104 |
```
|