Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
0.8
TFLOPS
2
Saksham Loonker
SLoonker
Follow
saksham-loonker
AI & ML interests
I am very interested in RL and other post-training, as well as building Efficient LLMs for Sparse Resources.
Recent Activity
updated
a dataset
about 17 hours ago
SLoonker/Grok-Code-Fast-1-Distillation-Done-By-GPT5.4
published
a dataset
about 17 hours ago
SLoonker/Grok-Code-Fast-1-Distillation-Done-By-GPT5.4
updated
a dataset
about 1 month ago
SLoonker/Mistral-Small-4-Reasoning-315x
View all activity
Organizations
None yet
SLoonker
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
about 17 hours ago
SLoonker/Grok-Code-Fast-1-Distillation-Done-By-GPT5.4
Viewer
•
Updated
about 16 hours ago
•
500
published
a dataset
about 17 hours ago
SLoonker/Grok-Code-Fast-1-Distillation-Done-By-GPT5.4
Viewer
•
Updated
about 16 hours ago
•
500
updated
2 datasets
about 1 month ago
SLoonker/Mistral-Small-4-Reasoning-315x
Viewer
•
Updated
Mar 23
•
315
•
4
SLoonker/Kimi-K2.5-Reasoning-300x
Viewer
•
Updated
Mar 18
•
300
•
15
published
2 datasets
about 1 month ago
SLoonker/Kimi-K2.5-Reasoning-300x
Viewer
•
Updated
Mar 18
•
300
•
15
SLoonker/Mistral-Small-4-Reasoning-315x
Viewer
•
Updated
Mar 23
•
315
•
4
New activity in
SLoonker/RL-Claude-Reasoning-SFT
about 1 month ago
which claude model?
1
#2 opened about 2 months ago by
Roman1111111
updated
a collection
about 2 months ago
Small RL Datasets For Training
Collection
6 items
•
Updated
Mar 5
New activity in
SLoonker/RL-OpenCodeReasoning-DPO
about 2 months ago
Added License Of Source
#1 opened about 2 months ago by
SmartAnon
updated
a dataset
about 2 months ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
Mar 5
•
818
•
19
published
a dataset
about 2 months ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
Mar 5
•
818
•
19
updated
a dataset
about 2 months ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
Mar 5
•
1.51k
•
14
published
a dataset
about 2 months ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
Mar 5
•
1.51k
•
14
updated
a dataset
about 2 months ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
Mar 5
•
2.75k
•
22
published
a dataset
about 2 months ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
Mar 5
•
2.75k
•
22
Load more