Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

QuantTrio
/
Qwen3-Coder-480B-A35B-Instruct-AWQ

Text Generation
Transformers
Safetensors
qwen3_moe
Qwen3
AWQ
量化修复
vLLM
conversational
4-bit precision
awq
Model card Files Files and versions
xet
Community
4
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

How to use CPU Offload for this model? I keep getting OOM

#4 opened 3 days ago by
crystech

How did you do it?

#1 opened 4 months ago by
ehartford
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs