Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
sirynoma
uavleeva
Follow
0 followers
·
1 following
Suchotin
AI & ML interests
None yet
Recent Activity
updated
a collection
13 days ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
13 days ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
13 days ago
Multitask RLVR using GRPO (HSE Project)
View all activity
Organizations
uavleeva
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a collection
13 days ago
Multitask RLVR using GRPO (HSE Project)
Collection
15 items
•
Updated
13 days ago
updated
2 models
13 days ago
uavleeva/grpo_math_run_level3_all_rewards_001
Updated
13 days ago
uavleeva/grpo_mixed_run_004
Updated
13 days ago
published
a model
13 days ago
uavleeva/grpo_mixed_run_004
Updated
13 days ago
updated
2 models
13 days ago
uavleeva/grpo_merged_math_sql_code_ties_001
Text Generation
•
Updated
13 days ago
•
7
uavleeva/grpo_merged_math_sql_code_linear_001
Text Generation
•
Updated
13 days ago
published
2 models
13 days ago
uavleeva/grpo_merged_math_sql_code_ties_001
Text Generation
•
Updated
13 days ago
•
7
uavleeva/grpo_merged_math_sql_code_linear_001
Text Generation
•
Updated
13 days ago
updated
a model
13 days ago
uavleeva/grpo_code_run_002
Updated
13 days ago
published
a model
13 days ago
uavleeva/grpo_code_run_002
Updated
13 days ago
updated
2 models
13 days ago
uavleeva/grpo_sql_run_002
Updated
13 days ago
uavleeva/grpo_sql_run_005
Updated
13 days ago
published
a model
13 days ago
uavleeva/grpo_sql_run_005
Updated
13 days ago
Load more