Skip to content

Conversation

@AaronO
Copy link
Contributor

@AaronO AaronO commented Apr 25, 2023

The SWAR code now operates on a register of the host CPU at a time as intended.

Note this might actually not be faster on 32-bit, I would have to bench it but in some cases 4 memory reads / lookup-table reads might be faster than blockwide-operations

AaronO added 2 commits May 4, 2023 04:57
The SWAR code now operates on a register of the host CPU at a time as intended.

Note this might actually not be faster on 32-bit, I would have to bench it but in some cases 4 memory reads / lookup-table reads might be faster than blockwide-operations
@AaronO AaronO force-pushed the cleanup/swar-drop-64bit-assumptions branch from df301b6 to e10d41e Compare May 4, 2023 07:57
@seanmonstar seanmonstar merged commit 1c5faf8 into seanmonstar:master May 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants