-
-
-
-
-
-
Inference Providers
Active filters:
FP4
Text Generation
•
Updated
•
37.8k
•
45
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
112k
•
26
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
•
394B
•
Updated
•
121k
•
14
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD
Image-Text-to-Text
•
Updated
•
3.55k
•
19
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
18k
•
20
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
•
397B
•
Updated
•
15k
•
41
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
5.08k
•
14
nvidia/Phi-4-multimodal-instruct-NVFP4
4B
•
Updated
•
3.19k
•
7
Text Generation
•
5B
•
Updated
•
14.7k
•
14
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
•
Updated
•
185
•
12
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
•
241B
•
Updated
•
277
•
1
nvidia/DeepSeek-V3-0324-NVFP4
Text Generation
•
397B
•
Updated
•
84k
•
14
NVFP4/DeepSeek-Prover-V2-7B-FP4
4B
•
Updated
•
74
•
1
NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4
5B
•
Updated
•
157
•
1
Text Generation
•
19B
•
Updated
•
341
•
4
NVFP4/Polaris-4B-Preview-FP4
2B
•
Updated
•
4
NVFP4/Polaris-7B-Preview-FP4
5B
•
Updated
•
3
•
1
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
34.8k
•
23
apolloparty/LFM2-350M-NVFP4A16
Text Generation
•
0.2B
•
Updated
•
5
apolloparty/LFM2-700M-NVFP4A16
Text Generation
•
0.5B
•
Updated
•
3
apolloparty/LFM2-1.2B-NVFP4A16
Text Generation
•
0.7B
•
Updated
•
9
•
1
tachyphylaxis/DeepSeek-R1-0528-FP4
Text Generation
•
397B
•
Updated
•
4
nvidia/DeepSeek-R1-NVFP4-v2
Text Generation
•
394B
•
Updated
•
3.06k
•
5
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
•
118B
•
Updated
•
1.57k
•
3
NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4
Text Generation
•
241B
•
Updated
•
600
•
2
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
•
118B
•
Updated
•
165
•
2
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
•
397B
•
Updated
•
3
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
•
16B
•
Updated
•
1.42k
•
11
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
•
16B
•
Updated
•
2.5k
•
4
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
22.6k
•
7