This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
updated a dataset about 10 hours ago
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b published a dataset about 10 hours ago
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b published a model about 10 hours ago
SeanWang0027/mixed_sdft_solution_sudoku_qwen3_4b_thinking_1_epoch_8192_32_batch_2e-5_lr_qwen3_1_7b