Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model 24 minutes ago
inference-optimization/Qwen3.6-35B-A3B-NVFP4 published a model 25 minutes ago
inference-optimization/Qwen3.6-35B-A3B-NVFP4 updated a collection about 17 hours ago
HIGGS-per-tensor