mlx-community
/

Kimi-K2-Instruct-0905-mlx-DQ3_K_M

Text Generation

4-bit precision

Model card Files Files and versions

bibproj commited on Sep 6, 2025

Commit

07be91a

·

verified ·

1 Parent(s): bc0af79

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -90,8 +90,8 @@ In the `convert.py` file of mlx-lm on your system ( [you can see the original co
            q_bits = 3
         if "switch_mlp.down_proj" in path:
            q_bits = 3
-           # Blocks 3 and 4 are higher quality
-           if (index == 3) or (index == 4):
               q_bits = 6
            # Every 5th block is "medium" quality
            if (index % 5) == 0:
@@ -105,8 +105,8 @@ Should you wish to squeeze more out of your quant, and you do not need to use a
 ```python
         if "switch_mlp.down_proj" in path:
            q_bits = 4
-           # Blocks 3 and 4 are higher quality
-           if (index == 3) or (index == 4):
               q_bits = 6
         #print("path:", path, "index:", index, "q_bits:", q_bits)
         return {"group_size": group_size, "bits": q_bits}

            q_bits = 3
         if "switch_mlp.down_proj" in path:
            q_bits = 3
+           # Blocks up to 5 are higher quality
+           if index < 5:
               q_bits = 6
            # Every 5th block is "medium" quality
            if (index % 5) == 0:
 ```python
         if "switch_mlp.down_proj" in path:
            q_bits = 4
+           # Blocks up to 5 are higher quality
+           if index < 5:
               q_bits = 6
         #print("path:", path, "index:", index, "q_bits:", q_bits)
         return {"group_size": group_size, "bits": q_bits}