Volko
Volko76
AI & ML interests
Quantization, Fine-tune, Agentic Frameworks
Recent Activity
updated
a dataset
5 days ago
Volko76/french-classic-conversations-v2
published
a dataset
5 days ago
Volko76/french-classic-conversations-v2
updated
a dataset
5 days ago
Volko76/french-classic-books-v2
Organizations
Qwen2.5 Coder Base GGUF
A list of Qwen2.5 Coder base quantized in GGUF
GGUF Quantizations
A CPU + GPU support type of quantization. It's currently the most used quantization method. Read more here : https://github.com/ggerganov/llama.cpp
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 48 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 107 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 54 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 100
Qwen2.5 Coder Instruct GGUF
A list of Qwen2.5 Coder quantized in GGUF
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 48 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 107 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 54 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 100
OpenCoder GGUF
A complete open source small coding model quantized in GGUF
EXL2 Quantizations
A collection of models quantized for EXL2, one of the fastest quantisation method. https://github.com/turboderp/exllamav2
EXL3 Quantizations
Qwen2.5 Coder Instruct GGUF
A list of Qwen2.5 Coder quantized in GGUF
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 48 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 107 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 54 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 100
Qwen2.5 Coder Base GGUF
A list of Qwen2.5 Coder base quantized in GGUF
OpenCoder GGUF
A complete open source small coding model quantized in GGUF
GGUF Quantizations
A CPU + GPU support type of quantization. It's currently the most used quantization method. Read more here : https://github.com/ggerganov/llama.cpp
-
Volko76/Qwen2.5-Coder-0.5B-Instruct-GGUF
Text Generation • 0.5B • Updated • 48 -
Volko76/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation • 2B • Updated • 107 -
Volko76/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation • 3B • Updated • 54 -
Volko76/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation • 8B • Updated • 100
EXL2 Quantizations
A collection of models quantized for EXL2, one of the fastest quantisation method. https://github.com/turboderp/exllamav2