z-lab/gpt-oss-20b-DFlash
Text Generation
•
0.8B
•
Updated
•
37
•
1
Efficient AI
DFlash: Block Diffusion for Flash Speculative Decoding
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference