Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLLab
's Collections
Gemma-3-Text
DPO
Secure-Code-Generation
RL-Dataset
DPO
updated
11 days ago
Upvote
-
allenai/Olmo-3-7B-Instruct-SFT
Text Generation
•
7B
•
Updated
Jan 5
•
119k
•
4
RLLab/olmo-3-7b-it-sft
Text Generation
•
7B
•
Updated
Dec 18, 2025
•
16
RLLab/allenai-Dolci-Instruct-DPO-Filtered
Viewer
•
Updated
26 days ago
•
125k
•
95
RLLab/OpenR1-Math-220K-Filtered-DPO
Viewer
•
Updated
18 days ago
•
79.3k
•
45
allenai/Dolci-Instruct-SFT-No-Tools
Viewer
•
Updated
Jan 5
•
1.92M
•
264
•
4
RLLab/Dolci-Instruct-SFT-No-Tools-Filtered
Viewer
•
Updated
11 days ago
•
1.92M
•
13
Upvote
-
Share collection
View history
Collection guide
Browse collections