Why is the model size of safetensor 5.23B parameters?
π
1
#6 opened 5 months ago
by
oilbread
Can you share the GPTQ quantization code?
#5 opened 7 months ago
by
qwertist
Produce gibberish with dtype=auto
#4 opened 7 months ago
by
divisingh
QAT version
π₯
2
#3 opened 7 months ago
by
Delnith
vLLM on 24gb gpu
π
2
#2 opened 9 months ago
by
roadtoagi