-
-
-
-
-
-
Inference Providers
Active filters:
awq
QuantTrio/DeepSeek-V3.2-Speciale-AWQ
Text Generation
•
685B
•
Updated
•
41
•
4
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
224k
•
80
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
126k
•
116
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
11.2k
•
8
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
395
•
2
TheBloke/dolphin-2.2.1-mistral-7B-AWQ
Text Generation
•
1B
•
Updated
•
88
•
16
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
0.3B
•
Updated
•
97
•
4
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
6B
•
Updated
•
1.25M
•
89
Qwen/Qwen2.5-Coder-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
507k
•
18
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
2B
•
Updated
•
8.41k
•
7
Orion-zhen/aya-expanse-8b-AWQ
Text Generation
•
3B
•
Updated
•
64
•
1
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
•
49k
•
69
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
161k
•
94
gaunernst/gemma-3-27b-it-qat-autoawq
Image-Text-to-Text
•
6B
•
Updated
•
11.4k
•
12
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
212k
•
46
Qwen/Qwen2.5-Omni-7B-AWQ
Any-to-Any
•
5B
•
Updated
•
29.3k
•
14
TechxGenus/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
•
70B
•
Updated
•
31
•
1
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
6.25k
•
16
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
Updated
•
40
•
2
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
•
Updated
•
68
•
3
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
128k
•
33
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
229B
•
Updated
•
8.52k
•
6
QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ
Text Generation
•
162B
•
Updated
•
157
•
2
TheHouseOfTheDude/INTELLECT-3_Compressed-Tensors
Text Generation
•
Updated
•
14
•
1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
16
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
9
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
72
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
9
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
6
casperhansen/opt-125m-awq
Text Generation
•
90.3M
•
Updated
•
782
•
3