roshniramesh
's Collections
int4 llm
updated
Text Generation
•
Updated
•
20
•
1
nvidia/Gemma-2b-it-ONNX-INT4
nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4
Updated
•
21
•
6
nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4
nvidia/Phi-3.5-mini-Instruct-ONNX-INT4
nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4
nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4
meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
84
•
38
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
9B
•
Updated
•
2.01k
•
8
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
473
•
29
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
400k
•
87
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
26.2k
•
30
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
8B
•
Updated
•
114
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
3B
•
Updated
•
21.6k
•
25
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
79.2k
•
107
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
34.8k
•
19
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4
Text Generation
•
71B
•
Updated
•
5.41k
•
23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
24.3k
•
41
meta-llama/Llama-Guard-3-1B-INT4
Text Generation
•
Updated
•
16
•
27
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
50
•
71
meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
47
•
37
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
130
•
48
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
7B
•
Updated
•
1.04k
•
23
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16
Text Generation
•
7B
•
Updated
•
124
•
2
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
•
7B
•
Updated
•
19
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
26
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
195
•
2
RedHatAI/gemma-2-2b-it-quantized.w4a16
Text Generation
•
1B
•
Updated
•
19
•
1
RedHatAI/gemma-2-9b-it-quantized.w4a16
Text Generation
•
3B
•
Updated
•
37
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16
Text Generation
•
3B
•
Updated
•
470
•
4
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
3.78k
•
32
nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
•
13
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
•
561
•
1
Text Generation
•
72B
•
Updated
•
223
•
47
Text Generation
•
14B
•
Updated
•
45.6k
•
100
Text Generation
•
8B
•
Updated
•
609
•
75
Text Generation
•
Updated
•
310
•
36
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
128
•
18
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
131
•
7
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
1.03k
•
50
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
100
•
6
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
72B
•
Updated
•
1.38k
•
37
Qwen/Qwen1.5-4B-Chat-GGUF
Text Generation
•
4B
•
Updated
•
613
•
16
Qwen/Qwen1.5-0.5B-Chat-GGUF
Text Generation
•
0.6B
•
Updated
•
3.95k
•
35
Qwen/Qwen1.5-7B-Chat-GGUF
Text Generation
•
8B
•
Updated
•
2.57k
•
70
Qwen/CodeQwen1.5-7B-Chat-GGUF
Text Generation
•
7B
•
Updated
•
913
•
110
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
1.39k
•
3
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.5B
•
Updated
•
791
•
9
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
•
0.6B
•
Updated
•
51.7k
•
74
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
7.51k
•
27
Qwen/Qwen2-0.5B-Instruct-GGUF
Text Generation
•
0.5B
•
Updated
•
20.7k
•
71
Qwen/Qwen2-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
4.41k
•
178
Qwen/Qwen2-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
142
•
15
Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
19.4k
•
5
Qwen/Qwen2-72B-Instruct-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
81
•
33
Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4
Text Generation
•
57B
•
Updated
•
98
•
23