Qwen3.5-9B LoRA SFT distillation: R7 (86.8% eval) + R8 calibration. Datasets, FP16 checkpoints, and pipeline docs.
lee
cudabenchmarktest
AI & ML interests
Finetuning small language models, maintaining quality chain of thought, refusal and abliteration, along with novel reasoning distillation techniques. When you are wrestling for possession of a sword, the man with the handle always wins.
Recent Activity
liked a dataset 3 days ago
caiovicentino1/Qwen3.6-35B-A3B-mcr-stage-b liked a model 3 days ago
Jackrong/Qwopus-GLM-18B-Merged-GGUF liked a model 4 days ago
unsloth/GLM-5.1-GGUFOrganizations
None yet