Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
30
Jin Zhu
mamba413
Follow
Kyleyee's profile picture
Eehan's profile picture
callmespring's profile picture
3 followers
·
5 following
https://mamba413.github.io/
Mamba413
AI & ML interests
None yet
Recent Activity
updated
a Space
2 days ago
stats-powered-ai/StatDetectLLM
liked
a dataset
18 days ago
fancyzhx/ag_news
authored
a paper
about 1 month ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
View all activity
Organizations
mamba413
's models
10
Sort:Â Recently updated
mamba413/Qwen2.5-1.5B-PPO-DR-HH-Seed1
2B
•
Updated
Mar 21, 2025
•
2
mamba413/Qwen2.5-1.5B-PPO-BENCH-HH-Seed1
2B
•
Updated
Mar 21, 2025
•
3
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed1
2B
•
Updated
Mar 21, 2025
•
1
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed0
Updated
Mar 20, 2025
mamba413/Qwen2.5-1.5B-Instruct-Reward-DR-HH-Seed0
Updated
Mar 20, 2025
mamba413/Qwen2-0.5B-Reward-DR-HH-Seed0
Text Classification
•
0.5B
•
Updated
Mar 19, 2025
•
6
mamba413/Qwen2.5-1.5B-Reward-DR-IMDB-Seed0
Updated
Mar 18, 2025
mamba413/Qwen2.5-1.5B-Reward-DR-SIMU-Seed0
Updated
Mar 18, 2025
mamba413/Qwen2-0.5B-Reward-DR-SIMU-Seed0
Text Classification
•
0.5B
•
Updated
Mar 16, 2025
•
4
mamba413/Qwen2-0.5B-Reward-DR-SIMU
Text Classification
•
0.5B
•
Updated
Mar 15, 2025
•
7