-
-
-
-
-
-
Inference Providers
Active filters:
modelopt
Text Generation
•
Updated
•
38.2k
•
46
lukealonso/MiniMax-M2.5-NVFP4
130B
•
Updated
•
3.34k
•
16
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
59.5k
•
44
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
113k
•
26
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
19.6k
•
25
Text Generation
•
8B
•
Updated
•
3
•
3
425B
•
Updated
•
3
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
69.3k
•
25
BenChaliah/Gemma3-27B-it-NVFP4
15B
•
Updated
•
5
•
2
vincentzed-hf/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
1.8k
•
4
Text Generation
•
5B
•
Updated
•
15.2k
•
14
Text Generation
•
15B
•
Updated
•
3.23k
•
3
shanjiaz/gpt-oss-120b-nvfp4-modelopt
59B
•
Updated
•
9.11k
•
2
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
3B
•
Updated
•
8
•
1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
5B
•
Updated
•
254
•
2
Text Generation
•
177B
•
Updated
•
4.82k
•
15
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
120B
•
Updated
•
1.42k
•
2
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
•
241B
•
Updated
•
299
•
1
vincentzed-hf/Kimi-K2.5-MXFP8
Image-Text-to-Text
•
1T
•
Updated
•
22
•
1
Cirrascale/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
108
•
1
txn545/Qwen3-Coder-Next-NVFP4
Updated
•
87
•
1
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
16.2k
•
20
nvidia/Llama-4-Maverick-17B-128E-Instruct-FP8
402B
•
Updated
•
477
•
12
nvidia/Llama-4-Scout-17B-16E-Instruct-FP8
109B
•
Updated
•
59.4k
•
11
ishan24/test_modelopt_quant
nvidia/Llama-4-Maverick-17B-128E-Eagle3
Updated
•
13
•
9
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
37.5k
•
23
jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8
Text Generation
•
71B
•
Updated
•
6
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
•
16B
•
Updated
•
1.4k
•
11
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
21.7k
•
7