Magic Quant Collection Hybrid GGUF quants created via an evolutionary quant algorithm. Want the best TPS? Lowest precision loss? Smallest file size? Welcome to MagicQuant! • 8 items • Updated Dec 16, 2025 • 26
magiccodingman/Qwen3-30B-A3B-Thinking-2507-unsloth-MagicQuant-Hybrid-GGUF Text Generation • 31B • Updated Dec 5, 2025 • 420 • 5
google/gemma-3-12b-it-qat-q4_0-gguf Image-Text-to-Text • 12B • Updated Apr 11, 2025 • 9.46k • 242
Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 57
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Dec 31, 2025 • 376
astronomer/Llama-3-8B-Instruct-GPTQ-4-Bit Text Generation • 8B • Updated Apr 22, 2024 • 96 • 25
google-bert/bert-base-multilingual-cased Fill-Mask • 0.2B • Updated Feb 19, 2024 • 3.49M • • 571
cmarkea/distilcamembert-base-ner Token Classification • 67.5M • Updated Oct 26, 2024 • 7.37k • • 24