Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Amir Hossein Yari
AmirHossein2002
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
AmirHossein2002/gemma-3-4b-it-grpo-high
published
a model
6 days ago
AmirHossein2002/gemma-3-4b-it-grpo-high
updated
a model
6 days ago
AmirHossein2002/gemma-3-4b-it-grpo
View all activity
Organizations
None yet
AmirHossein2002
's models
29
Sort: Recently updated
AmirHossein2002/gemma-3-4b-it-grpo-high
Updated
6 days ago
AmirHossein2002/gemma-3-4b-it-grpo
Updated
6 days ago
AmirHossein2002/gemma-3-4b-it-grpo-low
Updated
6 days ago
•
5
AmirHossein2002/gemma-3-4b-it-grpo-dpo
Updated
7 days ago
AmirHossein2002/Qwen2.5-7B-Instruct-gspo-low
Text Generation
•
Updated
14 days ago
•
6
AmirHossein2002/Qwen2.5-7B-Instruct-gspo-high
Text Generation
•
Updated
14 days ago
•
34
AmirHossein2002/Qwen2.5-7B-Instruct-gspo-dpo
Text Generation
•
Updated
15 days ago
•
28
AmirHossein2002/Qwen2.5-7B-Instruct-gspo
Text Generation
•
Updated
15 days ago
•
41
AmirHossein2002/Qwen2.5-3B-Instruct-calib-grpo-low
Text Generation
•
Updated
21 days ago
•
20
AmirHossein2002/Qwen2.5-3B-Instruct-calib-grpo-dpo
Text Generation
•
Updated
22 days ago
•
34
AmirHossein2002/Qwen2.5-3B-Instruct-calib-grpo
Text Generation
•
Updated
23 days ago
•
73
AmirHossein2002/Llama-3.2-3B-Instruct-calib-grpo-dpo-high
Text Generation
•
Updated
23 days ago
•
25
AmirHossein2002/Llama-3.2-3B-Instruct-calib-grpo-dpo
Text Generation
•
Updated
23 days ago
•
35
AmirHossein2002/Qwen2.5-7B-Instruct-calib-grpo-dpo
Text Generation
•
Updated
25 days ago
•
51
AmirHossein2002/Llama-3.2-3B-Instruct-calib-grpo
Text Generation
•
Updated
25 days ago
•
26
AmirHossein2002/Qwen2.5-7B-Instruct-calib-grpo-dpo-high
Text Generation
•
Updated
26 days ago
•
67
AmirHossein2002/Qwen2.5-7B-Instruct-calib-grpo-dpo-low
Text Generation
•
Updated
26 days ago
•
62
AmirHossein2002/Qwen2.5-7B-Instruct-calib-grpo
Text Generation
•
Updated
27 days ago
•
51
AmirHossein2002/Qwen2.5-Math-1.5B-Instruct-grpo
Text Generation
•
Updated
27 days ago
•
13
AmirHossein2002/Qwen2.5-7B-Instruct-grpo-dpo-low
Text Generation
•
Updated
28 days ago
•
47
AmirHossein2002/Llama-3.2-3B-Instruct-grpo-dpo-high
Text Generation
•
Updated
28 days ago
•
52
AmirHossein2002/Llama-3.2-3B-Instruct-grpo-dpo-low
Text Generation
•
Updated
about 1 month ago
•
35
AmirHossein2002/Llama-3.2-3B-Instruct-grpo-dpo
Text Generation
•
Updated
about 1 month ago
•
33
AmirHossein2002/Qwen2.5-7B-Instruct-grpo-dpo
Text Generation
•
Updated
about 1 month ago
•
38
AmirHossein2002/Llama-3.2-3B-Instruct-grpo
Text Generation
•
Updated
about 1 month ago
•
36
AmirHossein2002/Qwen2.5-7B-Instruct-grpo-dpo-high
Text Generation
•
Updated
Nov 28
•
4
AmirHossein2002/Qwen2.5-7B-Instruct-drgrpo
Text Generation
•
Updated
Nov 27
•
7
AmirHossein2002/Qwen2.5-7B-Instruct-grpo
Text Generation
•
Updated
Nov 27
•
4
AmirHossein2002/Qwen2.5-7B-Instruct-dapo
Text Generation
•
Updated
Nov 27
•
12