RLRM

community

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang authored a paper about 20 hours ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

DongfuJiang authored a paper about 20 hours ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

DongfuJiang authored a paper about 1 month ago

RewardHarness: Self-Evolving Agentic Post-Training

View all activity

models 1

RLRM/big_math_rl_pair_ct_7B

8B • Updated Mar 26, 2025

datasets 2

RLRM/Big-Math-RL-Verified-CT-7B

Viewer • Updated Mar 14, 2025 • 251k • 126

RLRM/Big-Math-RL-Verified-CT

Viewer • Updated Mar 14, 2025 • 251k • 6