Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
55
9
24
Banghua Zhu
banghua
Follow
tuantm's profile picture
GEMCorp's profile picture
seroe's profile picture
57 followers
Β·
18 following
https://banghua.me
BanghuaZ
BHZ-BER
AI & ML interests
Foundation models, reinforcement learning, statistics, information theory
Recent Activity
liked
a dataset
4 days ago
nvidia/Nemotron-RL-math-OpenMathReasoning
updated
a dataset
20 days ago
nvidia/Nemotron-RL-math-OpenMathReasoning
published
a dataset
26 days ago
nvidia/Nemotron-RL-instruction_following-structured_outputs
View all activity
Organizations
banghua
's models
33
Sort:Β Recently updated
banghua/Qwen2.5-0.5B-DPO
Text Generation
β’
0.5B
β’
Updated
Jun 12
β’
20
banghua/Qwen2.5-0.5B-GRPO
Text Generation
β’
0.5B
β’
Updated
Jun 9
β’
16
banghua/Qwen3-0.6B-SFT
Text Generation
β’
0.6B
β’
Updated
May 6
β’
19
β’
1
banghua/yi-34b-rm-400
Updated
Jan 22, 2024
banghua/openchat-3.5-ppo-nn-ckpt5k
Text Generation
β’
9B
β’
Updated
Dec 29, 2023
β’
5
banghua/openchat-3.5-ppo-n-ckpt7k5
Text Generation
β’
9B
β’
Updated
Dec 29, 2023
β’
5
banghua/openchat-3.5-ppo-n-ckpt6k5
Text Generation
β’
9B
β’
Updated
Dec 26, 2023
β’
6
banghua/openchat-3.5-ppo-n-ckpt6k
Text Generation
β’
9B
β’
Updated
Dec 26, 2023
β’
8
banghua/openchat-3.5-1210-bin
Text Generation
β’
Updated
Dec 16, 2023
β’
16
banghua/openchat3.5_apa2_log_ckpt4k
Text Generation
β’
9B
β’
Updated
Dec 14, 2023
β’
6
banghua/openchat3.5_apa2_log_ckpt4k5
Text Generation
β’
9B
β’
Updated
Dec 13, 2023
β’
7
banghua/openchat-3.5-p3o-tog-rf-ckpt6k
Text Generation
β’
7B
β’
Updated
Dec 9, 2023
β’
4
banghua/openchat-3.5-p3o-tog-rf-ckpt4k
Text Generation
β’
7B
β’
Updated
Dec 9, 2023
β’
5
banghua/openchat-3.5-p3o-tog-rf-ckpt2k
Text Generation
β’
7B
β’
Updated
Dec 9, 2023
β’
4
banghua/openchat-3.5-p3o-tog-rf-ckpt3k
Text Generation
β’
7B
β’
Updated
Dec 9, 2023
β’
5
banghua/openchat-3.5-p3o-tog-rf-ckpt5k
Text Generation
β’
7B
β’
Updated
Dec 8, 2023
β’
5
banghua/openchat-3.5-p3o-tog-rf-ckpt1k
Text Generation
β’
7B
β’
Updated
Dec 8, 2023
β’
5
banghua/openhermes-dpo-ckpt20k
Updated
Dec 8, 2023
β’
2
banghua/openhermes-dpo-ckpt9k5
Updated
Dec 6, 2023
β’
4
banghua/openchat-3.5-apa-tog-ckpt4k5
Text Generation
β’
7B
β’
Updated
Dec 5, 2023
β’
14
banghua/openchat3.5_apa_log_ckpt4k5
Text Generation
β’
7B
β’
Updated
Dec 5, 2023
β’
6
banghua/openchat-3.5-apa-tog-ckpt4k
Text Generation
β’
7B
β’
Updated
Dec 4, 2023
β’
6
banghua/openchat-3.5-apa-tog-ckpt3k5
Text Generation
β’
7B
β’
Updated
Dec 4, 2023
β’
4
banghua/openchat3.5_apa_log_ckpt3k5
Text Generation
β’
7B
β’
Updated
Dec 4, 2023
β’
5
banghua/openchat3.5_apa_ckpt5k
Text Generation
β’
7B
β’
Updated
Dec 2, 2023
β’
6
banghua/openchat-3.5-dpo-ckpt4k
Updated
Dec 2, 2023
β’
3
banghua/openchat-3.5-apa-ckpt4k
Text Generation
β’
7B
β’
Updated
Dec 2, 2023
β’
6
banghua/n_rm
Updated
Nov 28, 2023
banghua/pairwise_rm_epoch1
Updated
Nov 28, 2023
banghua/refine_rm
Updated
Nov 16, 2023
Previous
1
2
Next