Extending Reinforcement Learning for LLMs with Flow Environment
SII-Jhao Zhang
JingHaoZ
AI & ML interests
Large Reasoning Model, Unified Understanding and Generation in MLLM
Recent Activity
updated
a dataset
24 days ago
JingHaoZ/RLFR-Dataset-LM
upvoted
a
paper
25 days ago
TiDAR: Think in Diffusion, Talk in Autoregression
upvoted
a
paper
about 2 months ago
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image
Generation