sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

Suchotin

AI & ML interests

None yet

Recent Activity

updated a collection 13 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 13 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 13 days ago

Multitask RLVR using GRPO (HSE Project)

View all activity

Organizations

updated a collection 13 days ago

Multitask RLVR using GRPO (HSE Project)

15 items • Updated 13 days ago

updated 2 models 13 days ago

uavleeva/grpo_math_run_level3_all_rewards_001

Updated 13 days ago

uavleeva/grpo_mixed_run_004

Updated 13 days ago

published a model 13 days ago

uavleeva/grpo_mixed_run_004

Updated 13 days ago

updated 2 models 13 days ago

uavleeva/grpo_merged_math_sql_code_ties_001

Text Generation • Updated 13 days ago • 7

uavleeva/grpo_merged_math_sql_code_linear_001

Text Generation • Updated 13 days ago

published 2 models 13 days ago

uavleeva/grpo_merged_math_sql_code_ties_001

Text Generation • Updated 13 days ago • 7

uavleeva/grpo_merged_math_sql_code_linear_001

Text Generation • Updated 13 days ago

updated a model 13 days ago

uavleeva/grpo_code_run_002

Updated 13 days ago

published a model 13 days ago

uavleeva/grpo_code_run_002

Updated 13 days ago

updated 2 models 13 days ago

uavleeva/grpo_sql_run_002

Updated 13 days ago

uavleeva/grpo_sql_run_005

Updated 13 days ago

published a model 13 days ago

uavleeva/grpo_sql_run_005

Updated 13 days ago