opter
opter
AI & ML interests
None yet
Organizations
None yet
W8A8 Quantization leads to wrong token
👀 1
#31 opened 7 months ago
by
opter
You give the wrong transpose format of model linear weights
#3 opened 7 months ago
by
opter
int4 quantization destorys function_call accuracy
#99 opened 9 months ago
by
opter
int4 quantization destorys function_call accuracy
#99 opened 9 months ago
by
opter