Port of bitnet1.58 with custom metal kernel by johnmai-dev · Pull Request #331 · ml-explore/mlx-swift-examples

johnmai-dev · 2025-06-12T17:53:43Z

This PR ports bitnet1.58 contributed by @Blaizzy . Thanks to my idol @Blaizzy !
Source: ml-explore/mlx-lm#219

Blaizzy · 2025-06-12T19:39:42Z

Amazing job with the swift port @johnmai-dev! 🔥🚀

The quantization here is done normally like any other model. So you can consider it done ✅

johnmai-dev · 2025-06-13T03:49:20Z

Amazing job with the swift port @johnmai-dev! 🔥🚀

The quantization here is done normally like any other model. So you can consider it done ✅

Thank you! ❤️
Yes, you are right. I still need to adjust some details and expect to finish later today.

# Conflicts: # Libraries/MLXLLM/LLMModelFactory.swift

johnmai-dev · 2025-06-14T11:39:23Z

Hello，@Blaizzy

What is the difference between quantization and quantization_config in config.json? But there is only quantization_config in microsoft/bitnet-b1.58-2B-4T.

I see that apply_hf_quantization only uses quantization_config
https://github.com/ml-explore/mlx-lm/blob/4fab6fcbc9dd63dea229692f91028d33f7532fd6/mlx_lm/quant/utils.py#L56-L72

    "quantization": {
        "group_size": 64,
        "bits": 4,
        "quant_method": "bitnet",
        "linear_class": "autobitlinear",
        "quantization_mode": "offline"
    },
    "quantization_config": {
        "group_size": 64,
        "bits": 4,
        "quant_method": "bitnet",
        "linear_class": "autobitlinear",
        "quantization_mode": "offline"
    },

Looking forward to your reply, thanks ❤️

johnmai-dev · 2025-06-14T11:45:38Z

Hello，@Blaizzy

What is the difference between quantization and quantization_config in config.json? But there is only quantization_config in microsoft/bitnet-b1.58-2B-4T.

I see that apply_hf_quantization only uses quantization_config https://github.com/ml-explore/mlx-lm/blob/4fab6fcbc9dd63dea229692f91028d33f7532fd6/mlx_lm/quant/utils.py#L56-L72
    "quantization": {
        "group_size": 64,
        "bits": 4,
        "quant_method": "bitnet",
        "linear_class": "autobitlinear",
        "quantization_mode": "offline"
    },
    "quantization_config": {
        "group_size": 64,
        "bits": 4,
        "quant_method": "bitnet",
        "linear_class": "autobitlinear",
        "quantization_mode": "offline"
    },
Looking forward to your reply, thanks ❤️

Currently, Quantization does not support decoding quant_method, linear_class, or quantization_mode.

mlx-swift-examples/Libraries/MLXLMCommon/BaseConfiguration.swift

Lines 13 to 28 in e9dfa74

    
           public struct Quantization: Codable, Sendable, Equatable { 
        
               public init(groupSize: Int, bits: Int) { 
        
                   self.groupSize = groupSize 
        
                   self.bits = bits 
        
               } 
        
               public let groupSize: Int 
        
               public let bits: Int 
        
               public var asTuple: (Int, Int) { (groupSize, bits) } 
        
               enum CodingKeys: String, CodingKey { 
        
                   case groupSize = "group_size" 
        
                   case bits = "bits" 
        
               } 
        
           }

I am considering whether to adjust Quantization or add a new QuantizationConfig struct.

Failed: configurationDecodingError("config.json", "mlx-community/bitnet-b1.58-2B-4T-4bit", Swift.DecodingError.typeMismatch(Swift.Dictionary<Swift.String, Any>, Swift.DecodingError.Context(codingPath: [CodingKeys(stringValue: "quantization", intValue: nil), _DictionaryCodingKey(stringValue: "quant_method", intValue: nil)], debugDescription: "Expected to decode Dictionary<String, Any> but found a string instead.", underlyingError: nil)))

…ctory

johnmai-dev · 2025-06-18T15:35:13Z

#295

davidkoski · 2025-06-24T20:01:46Z

Cut a tag on mlx-swift for the relu squared: 0.25.5

awni · 2025-06-25T13:07:45Z

What is the difference between quantization and quantization_config in config.json?

I can say a little about that. MLX originally added the quantization field to the config.json. But Hugging Face uses a field called quantization_config to understand metadata about the model (e.g. if it's a quant of another model to display in the UI). So we now add both in order to maintain back-compatibility and set the right field for Hugging Face. So for any MLX model they should be the same.

For non MLX models they will probably just use the quantization_config. And that may or may not be compatible with MLX depending on the quant format.

johnmai-dev · 2025-06-26T12:12:02Z

What is the difference between quantization and quantization_config in config.json?

I can say a little about that. MLX originally added the quantization field to the config.json. But Hugging Face uses a field called quantization_config to understand metadata about the model (e.g. if it's a quant of another model to display in the UI). So we now add both in order to maintain back-compatibility and set the right field for Hugging Face. So for any MLX model they should be the same.

For non MLX models they will probably just use the quantization_config. And that may or may not be compatible with MLX depending on the quant format.

Thank you for your answer! ♥️

johnmai-dev · 2025-06-27T14:45:12Z

Thank you very much!!! @angeloskath
Speed increased 2x!!! 🚀🚀🚀

johnmai-dev · 2025-07-03T16:00:23Z

johnmai-dev · 2025-07-03T16:04:23Z

I think it's ready to merge.
cc @davidkoski @awni

davidkoski

This looks great! I like the use of the custom kernel -- this will make a good example.

Port mlx-lm bitnet1.58 ml-explore/mlx-lm#219

c118de9

Merge branch 'main' into 20250613-port-bitnet1.58

e66309a

# Conflicts: # Libraries/MLXLLM/LLMModelFactory.swift

johnmai-dev added 5 commits June 14, 2025 23:13

update: Update relu2 function to use compile for shapeless input

ae95432

refactor: Update BitLinear & Format code

42cdb09

update: Add quantization parameters to BaseConfiguration

468840a

update: Improve error handling during weight pre-loading in ContentView

adedc95

update: Add bitnet_b1_58_2b_4t_4bit model configuration to LLMModelFa…

00c67b2

…ctory

johnmai-dev marked this pull request as ready for review June 14, 2025 15:58

update: Rename relu2 function to reluSquared and refactor implementation

ba1b972

johnmai-dev mentioned this pull request Jun 15, 2025

Add ReLUSquared Activation Function ml-explore/mlx-swift#250

Merged

johnmai-dev marked this pull request as draft June 15, 2025 10:21

update: ACKNOWLEDGMENTS.md

4b819fd

johnmai-dev added 2 commits June 27, 2025 22:30

Improve the bitnet kernel

d8af01c

Merge branch 'refs/heads/main' into 20250613-port-bitnet1.58

cb8aaab

johnmai-dev marked this pull request as ready for review June 27, 2025 14:45

johnmai-dev added 2 commits June 27, 2025 22:47

remove: eliminate reluSquared function from Bitnet.swift

3c00b15

refactor: update kernel

cb2741a

davidkoski approved these changes Jul 3, 2025

View reviewed changes

davidkoski merged commit 2a14634 into ml-explore:main Jul 3, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port of bitnet1.58 with custom metal kernel#331

Port of bitnet1.58 with custom metal kernel#331
davidkoski merged 13 commits intoml-explore:mainfrom
johnmai-dev:20250613-port-bitnet1.58

johnmai-dev commented Jun 12, 2025 •

edited

Loading

Uh oh!

Blaizzy commented Jun 12, 2025 •

edited

Loading

Uh oh!

johnmai-dev commented Jun 13, 2025

Uh oh!

johnmai-dev commented Jun 14, 2025

Uh oh!

johnmai-dev commented Jun 14, 2025

Uh oh!

johnmai-dev commented Jun 18, 2025

Uh oh!

davidkoski commented Jun 24, 2025

Uh oh!

awni commented Jun 25, 2025

Uh oh!

johnmai-dev commented Jun 26, 2025

Uh oh!

johnmai-dev commented Jun 27, 2025

Uh oh!

johnmai-dev commented Jul 3, 2025

Uh oh!

johnmai-dev commented Jul 3, 2025

Uh oh!

davidkoski left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

johnmai-dev commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Blaizzy commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnmai-dev commented Jun 13, 2025

Uh oh!

johnmai-dev commented Jun 14, 2025

Uh oh!

johnmai-dev commented Jun 14, 2025

Uh oh!

johnmai-dev commented Jun 18, 2025

Uh oh!

davidkoski commented Jun 24, 2025

Uh oh!

awni commented Jun 25, 2025

Uh oh!

johnmai-dev commented Jun 26, 2025

Uh oh!

johnmai-dev commented Jun 27, 2025

Uh oh!

johnmai-dev commented Jul 3, 2025

Uh oh!

johnmai-dev commented Jul 3, 2025

Uh oh!

davidkoski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

johnmai-dev commented Jun 12, 2025 •

edited

Loading

Blaizzy commented Jun 12, 2025 •

edited

Loading