arxiv:2307.15343
Logesh Kumar umapathi
infinitylogesh
AI & ML interests
NLP - Healthcare , Information retrieval , Open domain question answering.
Organizations
models 19
infinitylogesh/svg_rl_grpo_qwen3-coder-30b-a3b-instruct_ckpt_71
31B • Updated
infinitylogesh/svg_rl_grpo_qwen3-coder-30b-a3b-instruct
31B • Updated
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_50
2B • Updated
• 1
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_100
2B • Updated
• 1
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline
2B • Updated
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50
2B • Updated
• 2
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100
2B • Updated
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged
2B • Updated
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Single-Stage-Rollout-16-Full-Finetuning
2B • Updated
infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-2
Text Generation • 2B • Updated
datasets 10
infinitylogesh/svg-rl-dataset
Viewer
• Updated
• 2.53k • 31
infinitylogesh/math_12k_gt
Viewer
• Updated
• 12.5k • 10
infinitylogesh/math_12k
Viewer
• Updated
• 12.5k • 19
infinitylogesh/yupp-svg-20251204_rendered
Viewer
• Updated
• 3.53k • 7
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B
Viewer
• Updated
• 606k • 75
infinitylogesh/math_12k_srt_single_stage
Viewer
• Updated
• 12.5k • 8
infinitylogesh/math_12k_srt_splits
Viewer
• Updated
• 13.5k • 18
infinitylogesh/eval_grab_toy_car_smolvla
Viewer
• Updated
• 3.23k • 19
infinitylogesh/eval_grab_toy_car_act
Viewer
• Updated
• 1.2k • 6
infinitylogesh/eval_desk_setup_record_1_cam_smolvla_2
Viewer
• Updated
• 497 • 7