INT4 LLMs for vLLM Collection Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! β’ 16 items β’ Updated 6 days ago β’ 12
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18, 2025 β’ 1.4M β’ β’ 4.4k
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024