R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
liked
a dataset 17 days ago
BAAI/Chinese-LiPS liked
a dataset 17 days ago
PleIAs/YouTube-Commons new activity
about 1 month ago
mispeech/GLAP:Model Weight