Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
4
Mark
Makrrr
Follow
0 followers
·
2 following
AI & ML interests
NLP, RLHF, IR
Recent Activity
upvoted
a
paper
13 days ago
Adaptation of Agentic AI
upvoted
a
paper
2 months ago
DeepSeek-OCR: Contexts Optical Compression
new
activity
2 months ago
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl:
Can we have the training setting?
View all activity
Organizations
Makrrr
's models
13
Sort: Recently updated
Makrrr/qwen3-8B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 24, 2025
•
9
Makrrr/qwen2.5-7B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 23, 2025
•
8
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
•
2B
•
Updated
Jul 5, 2025
•
85
•
2
Makrrr/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 31, 2025
•
2
Makrrr/Pyramids
Reinforcement Learning
•
Updated
May 30, 2025
•
13
Makrrr/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 30, 2025
•
13
Makrrr/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 29, 2025
Makrrr/Cartpole-v1
Reinforcement Learning
•
Updated
May 29, 2025
Makrrr/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 28, 2025
•
7
Makrrr/QTable-Taxi-V3
Reinforcement Learning
•
Updated
May 28, 2025
Makrrr/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
May 28, 2025
Makrrr/ppo-Huggy
Reinforcement Learning
•
Updated
May 27, 2025
•
27
Makrrr/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 27, 2025
•
2