Date: 2025-10-14Single-process model-parallel SFT with LoRA FP16 (T4ร2). Answer-only loss. Time-capped.
See usage snippet in repo.
Chat template
Files info
Base model