thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated 27 days ago • 192
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated 27 days ago • 225
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated 29 days ago • 265
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Persona-Mixed Text Generation • 8B • Updated 29 days ago • 226
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated Feb 5 • 188