File size: 1,412 Bytes
a0879b1
 
9905486
 
 
 
 
a0879b1
 
 
 
 
 
 
 
9905486
a0879b1
8182b82
a0879b1
 
 
9905486
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e309371
9905486
 
9c500e6
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
license: mit
base_model: MiniMaxAI/MiniMax-M2
base_model_relation: quantized
quantized_by: turboderp
tags:
- exl3
---

EXL3 quants of [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2)

⚠️ Requires ExLlamaV3 v0.0.12 (or v0.0.11 `dev` branch)

Base bitrates:

[2.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.0bpw)    
[3.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.0bpw)    
[4.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.0bpw)    

Optimized:

[2.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.04bpw)    
[2.27 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.27bpw)    
[3.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.04bpw)    
[3.50 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.5bpw)    
[4.03 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.03bpw)    


.        | KL-div |  ppl  | HumanEval@1
---------|--------|-------|-------------
2.00 bpw | 0.400  | 10.92 | 80.5%
2.04 bpw | 0.297  | 10.23 | 87.1%
2.27 bpw | 0.252  |  9.78 | 88.4%
3.00 bpw | 0.141  |  8.99 | 87.8%
3.04 bpw | 0.117  |  8.73 | 87.2% 
3.50 bpw | 0.094  |  8.78 | 88.4%
4.00 bpw | 0.087  |  8.58 | 89.6%
4.03 bpw | 0.077  |  8.61 | 87.8%
original |     -  |  8.51 | 87.2%¹

¹ Unconfirmed