store ascii start/continue bitmaps in u128 instead of slices of bool#40
Merged
dtolnay merged 1 commit intodtolnay:masterfrom Oct 21, 2025
Merged
store ascii start/continue bitmaps in u128 instead of slices of bool#40dtolnay merged 1 commit intodtolnay:masterfrom
dtolnay merged 1 commit intodtolnay:masterfrom
Conversation
Reduce execution time by 44% in 0%-nonascii case and by 32% in 10%-nonascii. Reduce the static data size ~200 bytes.
Contributor
Author
|
Tested also on Apple M1 with a time reduction around 20-30% |
Contributor
Author
|
Hello @dtolnay, do you have any feedback about this? |
dtolnay
approved these changes
Oct 21, 2025
Owner
dtolnay
left a comment
There was a problem hiding this comment.
Thank you! This is an extraordinary performance improvement.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Hello,
Using slices of bools to store ASCII CONTINUE and START data results in using 2*128 bytes of memory since one entire byte is used to store each boolean.
Since ASCII chars are 128, bitmaps fit perfectly in u128 consts.
This change reduces execution time by 44% in 0%-nonascii case and by 32% in 10%-nonascii, according to the crate's benches, and reduces the static storage by ~200 bytes.
Benchmarks were run on an Intel i5-12400 system.
Thanks