Phi PRO
Xalphinions
AI & ML interests
None yet
Recent Activity
updated a collection 6 days ago
MARS updated a collection 6 days ago
MARS upvoted a paper 8 days ago
The Past Is Not Past: Memory-Enhanced Dynamic Reward ShapingOrganizations
models 18
Xalphinions/MARS-Qwen2.5-7B-blk16
8B • Updated • 10
Xalphinions/MARS-Qwen2.5-7B-blk8
8B • Updated • 12
Xalphinions/MARS-Qwen2.5-0.5B-blk8
0.6B • Updated • 25
Xalphinions/MARS-Qwen2.5-0.5B-AR-SFT
0.6B • Updated • 41
Xalphinions/MARS-Qwen2.5-0.5B-blk16
0.6B • Updated • 23
Xalphinions/MARS-Qwen2.5-0.5B-blk8-no-sft
0.6B • Updated • 25
Xalphinions/MARS-Qwen2.5-0.5B-blk4-no-sft
0.6B • Updated • 28
Xalphinions/MARS-Qwen2.5-0.5B-blk4
0.6B • Updated • 34
Xalphinions/MARS-Qwen2.5-0.5B-blk16-no-sft
0.6B • Updated • 28
Xalphinions/MARS-Qwen2.5-7B-blk4
8B • Updated • 37
datasets 10
Xalphinions/olmo-verified-data
Viewer • Updated • 21.1k • 12
Xalphinions/transformed_long_story_short_no_strip
Viewer • Updated • 17.6k • 4 • 1
Xalphinions/UltraFeedback_with_tie_armorm_1e-2_non_zero_2
Viewer • Updated • 22k • 4
Xalphinions/UltraFeedback_with_tie_armorm_1e-2_non_zero
Viewer • Updated • 22k • 7
Xalphinions/UltraFeedback_with_tie_armorm_1e-4
Viewer • Updated • 20.3k • 7
Xalphinions/UltraFeedback_with_tie_armorm
Viewer • Updated • 26.6k • 7
Xalphinions/UltraFeedback_with_tie_strict
Viewer • Updated • 65.1k • 9
Xalphinions/llama3_ultrafeedback_with_tie_armorm
Viewer • Updated • 61.8k • 23
Xalphinions/UltraFeedback_with_tie
Viewer • Updated • 133k • 9
Xalphinions/llama3_ultrafeedback_with_tie
Viewer • Updated • 61.8k • 5