RLRM

community

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang authored a paper 1 day ago

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

DongfuJiang authored a paper 1 day ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

DongfuJiang authored a paper about 1 month ago

RewardHarness: Self-Evolving Agentic Post-Training

View all activity

RLRM 's models 1

RLRM/big_math_rl_pair_ct_7B

8B • Updated Mar 26, 2025