CodeGoat24's picture
Update README.md
cf8bf20 verified
metadata
library_name: diffusers
license: mit
pipeline_tag: text-to-image
base_model:
  - black-forest-labs/FLUX.1-dev

Model Summary

This model is GRPO trained using UnifiedReward-Flex as reward on the training dataset of UniGenBench.

๐Ÿš€ The inference code is available at Github.

For further details, please refer to the following resources:

Qualitative Results

image

image

Quantitative Results

image

Citation

@article{unifiedreward-flex,
  title={Unified Personalized Reward Model for Vision Generation},
  author={Wang, Yibin and Zang, Yuhang and Han, Feng and Bu, Jiazi and Zhou, Yujie and Jin, Cheng and Wang, Jiaqi},
  journal={arXiv preprint arXiv:2602.02380},
  year={2026}
}