Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,376

Base only

Active filters: nvfp4

nota-ai/Solar-Open2-250B-Nota-NVFP4

Text Generation • 145B • Updated 10 days ago • 22.4k • 152

nota-ai/Solar-Open2-250B-Nota-NVFP4-GlobalPruned

Text Generation • 117B • Updated 2 days ago • 296 • 34

drowzeys/keys-DeepSeekV4-Flash-GA-0731-Dspark-Abliterated-32-32

Text Generation • 304B • Updated about 11 hours ago • 23

DreamFast/Qwen3-VL-4b-Heretic-ComfyUI

Image-Text-to-Text • Updated 16 days ago • 51

nota-ai/Solar-Open-100B-NotaMoEQuant-NVFP4

Text Generation • 59B • Updated Mar 11 • 207 • 22

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated Jun 15 • 1.25M • 84

PassingByPixels/Qwen3.6-27B-Architect-Polaris2-Fable-B-F451-NVFP4

Image-Text-to-Text • 15B • Updated 4 days ago • 2.48k • 12

jarrelscy/GLM-5.2-NVFP4-AQLM-hybrid

Image-Text-to-Text • 208B • Updated 3 days ago • 3.09k • 14

DreamFast/gemma-3-12b-it-heretic-v2

Text Generation • 12B • Updated 16 days ago • 11.1k • • 77

protoLabsAI/ThinkingCap-Qwen3.6-27B-MTP-GGUF

0.5B • Updated 23 days ago • 48.1k • 59

sakamakismile/Qwen3.6-27B-Fable-Fusion-MTP-NVFP4

Image-Text-to-Text • 17B • Updated 6 days ago • 477 • 7

PassingByPixels/Qwen3.6-27B-Architect-Polaris2-Fable-B-F451-NVFP4-MTP

Image-Text-to-Text • 15B • Updated 4 days ago • 672 • 6

bottlecapai/ThinkingCap-Qwen3.6-27B-NVFP4

Image-Text-to-Text • 17B • Updated 4 days ago • 3.06k • 6

michaelw9999/Qwen3.6-27B-NVFP4-MTP-GGUF

27B • Updated Jun 6 • 40.4k • 55

nvidia/MiniMax-M3-NVFP4

Text Generation • 247B • Updated Jun 26 • 576k • 72

drowzeys/DeepSeek-V4-Flash-DSpark-Abliterated-Uncensored

Text Generation • 165B • Updated 21 days ago • 13.3k • 18

sakamakismile/Huihui-ThinkingCap-Qwen3.6-27B-abliterated-NVFP4

Image-Text-to-Text • 17B • Updated 20 days ago • 9.78k • 33

sakamakismile/KAT-Coder-V2.5-Dev-NVFP4

20B • Updated 9 days ago • 1.78k • 11

protoLabsAI/ThinkingCap-Qwen3.6-27B-heretic-MTP-GGUF

Text Generation • 27B • Updated 5 days ago • 2.35k • 5

rdtand/Qwen3.6-27B-PrismaAURA-5.5bit-vllm

20B • Updated Jun 25 • 42.7k • 26

s-batman/Ornith-1.0-35B-NVFP4-MTP-GGUF

Text Generation • 36B • Updated Jun 29 • 37.7k • 38

0xSero/Laguna-S-2.1-Hybrid-3.25bpw

Text Generation • Updated 7 days ago • 212 • 6

jcbtc/Laguna-S-2.1-NVFP4-GGUF

Text Generation • 118B • Updated 9 days ago • 1.06k • 5

MJPansa/DeepSeek-V4-Flash-0731-NVFP4

Text Generation • 304B • Updated 1 day ago • 99 • 4

scottgl/MiniMax-M2.7-REAP-172B-A10B-NVFP4-GB10

Text Generation • 98B • Updated Apr 16 • 3.26k • 6

RedHatAI/Qwen3.6-35B-A3B-NVFP4

20B • Updated 20 days ago • 1.85M • 166

nvidia/Gemma-4-26B-A4B-NVFP4

Text Generation • 14B • Updated May 11 • 1.26M • 124

nvidia/Qwen3.5-122B-A10B-NVFP4

Text Generation • 65B • Updated Jun 2 • 249k • 47

nvidia/diffusiongemma-26B-A4B-it-NVFP4

Text Generation • 14B • Updated 29 days ago • 1.68M • 113

s-batman/Ornith-1.0-9B-NVFP4-MTP-GGUF

Text Generation • 9B • Updated Jun 29 • 6.83k • 7