thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated 24 days ago • 192
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated 24 days ago • 221
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated 26 days ago • 261
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Persona-Mixed Text Generation • 8B • Updated 26 days ago • 224
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Sft-Persona-Mixed Text Generation • 8B • Updated about 1 month ago • 190
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated about 1 month ago • 191