File size: 1,412 Bytes
a0879b1 9905486 a0879b1 9905486 a0879b1 8182b82 a0879b1 9905486 e309371 9905486 9c500e6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 |
---
license: mit
base_model: MiniMaxAI/MiniMax-M2
base_model_relation: quantized
quantized_by: turboderp
tags:
- exl3
---
EXL3 quants of [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2)
⚠️ Requires ExLlamaV3 v0.0.12 (or v0.0.11 `dev` branch)
Base bitrates:
[2.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.0bpw)
[3.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.0bpw)
[4.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.0bpw)
Optimized:
[2.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.04bpw)
[2.27 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.27bpw)
[3.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.04bpw)
[3.50 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.5bpw)
[4.03 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.03bpw)
. | KL-div | ppl | HumanEval@1
---------|--------|-------|-------------
2.00 bpw | 0.400 | 10.92 | 80.5%
2.04 bpw | 0.297 | 10.23 | 87.1%
2.27 bpw | 0.252 | 9.78 | 88.4%
3.00 bpw | 0.141 | 8.99 | 87.8%
3.04 bpw | 0.117 | 8.73 | 87.2%
3.50 bpw | 0.094 | 8.78 | 88.4%
4.00 bpw | 0.087 | 8.58 | 89.6%
4.03 bpw | 0.077 | 8.61 | 87.8%
original | - | 8.51 | 87.2%¹
¹ Unconfirmed
|