Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • 31B • Updated • 135 • 55
None defined yet.
On the Step Length Confounding in LLM Reasoning Data Selection
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning