arxiv:2510.01582
๐๏ธ Building on HF
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
published a model 3 days ago
RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-FP8-Dflash updated a model 3 days ago
RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-FP8-Dflash updated a model 4 days ago
inference-optimization/Ornith-1.0-9B-FP8-Dynamic