Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

TheBloke
/
StableBeluga2-70B-GPTQ

Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
4-bit precision
gptq
Model card Files Files and versions
xet
Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Why so few 8 bit capable models?

1
#13 opened over 2 years ago by
ibivibiv

Can Run "gptq_model-4bit--1g" but not "gptq-4bit-32g-actorder_True"

#12 opened over 2 years ago by
0-hero

comparison with bitsandbytes nf4, hope to increase GPTQ accuracy

12
#11 opened over 2 years ago by
AIReach

Mininum VRAM?

7
#9 opened over 2 years ago by
hierholzer

GGML version possible/coming?

2
#8 opened over 2 years ago by
Thireus

vram requirements

👍 1
1
#5 opened over 2 years ago by
joujiboi
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs