Inference Providers
Active filters: quant
Text Generation
• 9B • Updated • 5.05k
• 3
2B • Updated • 14
• 56
AngelSlim/HY-1.8B-2Bit-GGUF
2B • Updated • 540
• 40
digitous/13B-HyperMantis_GPTQ_4bit-128g
Text Generation
• Updated • 53
• 12
pszemraj/nougat-small-onnx-quant_avx2
Image-Text-to-Text
• Updated • 10
pszemraj/nougat-base-onnx-quant_avx2
Image-Text-to-Text
• Updated • 12
fhai50032/RolePlayLake-7B-GGUF
7B • Updated • 27
• 3
oldbridge/latxa-7b-instruct-q8
Text Generation
• 7B • Updated • 2
pszemraj/nougat-small-onnx-quant_avx512_vnni
Image-Text-to-Text
• Updated • 5
RDson/Llama-3-Magenta-Instruct-4x8B-MoE-GGUF
25B • Updated • 101
• 1
TroyDoesAI/Codestral-21B-Pruned
Text Generation
• 21B • Updated • 15
• 2
mradermacher/Codestral-21B-Pruned-GGUF
21B • Updated • 183
mradermacher/Codestral-21B-Pruned-i1-GGUF
21B • Updated • 398
pszemraj/candle-flanUL2-quantized
Text Generation
• 19B • Updated • 24
byroneverson/gemma-2-27b-it-abliterated-gguf
Text Generation
• 27B • Updated • 135
• 12
QuantFactory/gemma-2-27b-it-abliterated-GGUF
Text Generation
• 27B • Updated • 594
• 7
EmperorKronos/gemma-2-27b-it-abliterated-exl2
Text Generation
• Updated • 4
byroneverson/LongWriter-glm4-9b-abliterated-gguf
Text Generation
• 9B • Updated • 16
• 3
Question Answering
• 8B • Updated • 13
• 4
mradermacher/FinShibainu-GGUF
8B • Updated • 118
• 1
eaddario/Hammer2.1-7b-GGUF
Text Generation
• 8B • Updated • 5.56k
• 2
eaddario/DeepSeek-R1-Distill-Qwen-7B-GGUF
Text Generation
• 8B • Updated • 1.15k
• 3
eaddario/Watt-Tool-8B-GGUF
Text Generation
• 8B • Updated • 646
• 5
eaddario/DeepSeek-R1-Distill-Llama-8B-GGUF
Text Generation
• 8B • Updated • 498
• 1
shisa-ai/Llama-3.1-Tulu-3-405B-FP8-Dynamic
Text Generation
• 406B • Updated • 5
eaddario/Dolphin3.0-R1-Mistral-24B-GGUF
Text Generation
• 24B • Updated • 398
• 1
eaddario/Llama-Guard-3-8B-GGUF
Text Generation
• 8B • Updated • 704
eaddario/Dolphin3.0-Mistral-24B-GGUF
Text Generation
• 24B • Updated • 607
• 2