roshniramesh
's Collections
int4 llm
updated
Text Generation
•
Updated
•
21
•
1
nvidia/Gemma-2b-it-ONNX-INT4
nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4
Updated
•
38
•
6
nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4
nvidia/Phi-3.5-mini-Instruct-ONNX-INT4
nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4
nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4
meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
77
•
37
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
9B
•
Updated
•
3.58k
•
7
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
571
•
29
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
161k
•
82
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
19.6k
•
30
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
8B
•
Updated
•
157
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
3B
•
Updated
•
20.6k
•
26
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
71B
•
Updated
•
137k
•
107
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
30.2k
•
18
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4
Text Generation
•
71B
•
Updated
•
871
•
23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
6.72k
•
40
meta-llama/Llama-Guard-3-1B-INT4
Text Generation
•
Updated
•
13
•
27
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
93
•
70
meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
117
•
37
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
96
•
46
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
7B
•
Updated
•
59.5k
•
23
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16
Text Generation
•
7B
•
Updated
•
100
•
2
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
•
7B
•
Updated
•
54
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
24
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
116
•
2
RedHatAI/gemma-2-2b-it-quantized.w4a16
Text Generation
•
1B
•
Updated
•
117
•
1
RedHatAI/gemma-2-9b-it-quantized.w4a16
Text Generation
•
3B
•
Updated
•
66
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16
Text Generation
•
3B
•
Updated
•
123
•
4
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
1.34k
•
32
nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
•
28
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
•
1.73k
•
1
Text Generation
•
72B
•
Updated
•
71
•
46
Text Generation
•
14B
•
Updated
•
89
•
100
Text Generation
•
8B
•
Updated
•
575
•
75
Text Generation
•
2B
•
Updated
•
436
•
36
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
3.74k
•
18
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
542
•
7
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
1.15k
•
49
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
2.97k
•
6
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
72B
•
Updated
•
2.33k
•
37
Qwen/Qwen1.5-4B-Chat-GGUF
Text Generation
•
4B
•
Updated
•
1.93k
•
16
Qwen/Qwen1.5-0.5B-Chat-GGUF
Text Generation
•
0.6B
•
Updated
•
5.54k
•
35
Qwen/Qwen1.5-7B-Chat-GGUF
Text Generation
•
8B
•
Updated
•
4.83k
•
70
Qwen/CodeQwen1.5-7B-Chat-GGUF
Text Generation
•
7B
•
Updated
•
588
•
109
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
5.4k
•
2
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.5B
•
Updated
•
812
•
8
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
•
0.6B
•
Updated
•
36.6k
•
61
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
6.87k
•
27
Qwen/Qwen2-0.5B-Instruct-GGUF
Text Generation
•
0.5B
•
Updated
•
21k
•
69
Qwen/Qwen2-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
3.2k
•
177
Qwen/Qwen2-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
58
•
15
Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
7.56k
•
5
Qwen/Qwen2-72B-Instruct-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
94
•
33
Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4
Text Generation
•
57B
•
Updated
•
492
•
23