Cleanup byte swapping utilities to generate optimal code on the platforms we care about. by resistor · Pull Request #11394 · pytorch/pytorch

resistor · 2018-09-07T19:03:27Z

While the use of memcpy as part of the byte swapping sequence looks funky, all major
compilers recognize and optimize this pattern reliably, resulting in essentially
optimal code generation.

For example, decodeUInt32LE goes from this on iOS arm64:

    ldrb    w8, [x0, #3]
    ldrb    w9, [x0, #2]
    bfi     w8, w9, #8, #8
    ldrb    w9, [x0, #1]
    bfi     w8, w9, #16, #8
    ldrb            w9, [x0]
    bfi     w8, w9, #24, #8
    mov      x0, x8
    ret

To this:

    ldr             w8, [x0]
    rev     w0, w8
    ret

…orms we care about. While the use of memcpy as part of the byte swapping sequence looks funky, all major compilers recognize and optimize this pattern reliably, resulting in essentially optimal code generation. For example, decodeUInt32LE goes from this on iOS arm64: ldrb w8, [x0, pytorch#3] ldrb w9, [x0, pytorch#2] bfi w8, w9, pytorch#8, pytorch#8 ldrb w9, [x0, pytorch#1] bfi w8, w9, pytorch#16, pytorch#8 ldrb w9, [x0] bfi w8, w9, pytorch#24, pytorch#8 mov x0, x8 ret To this: ldr w8, [x0] rev w0, w8 ret

facebook-github-bot

resistor has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

resistor is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

resistor has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…orms we care about. (pytorch#11394) Summary: While the use of memcpy as part of the byte swapping sequence looks funky, all major compilers recognize and optimize this pattern reliably, resulting in essentially optimal code generation. For example, decodeUInt32LE goes from this on iOS arm64: > ldrb w8, [x0, pytorch#3] > ldrb w9, [x0, pytorch#2] > bfi w8, w9, pytorch#8, pytorch#8 > ldrb w9, [x0, pytorch#1] > bfi w8, w9, pytorch#16, pytorch#8 > ldrb w9, [x0] > bfi w8, w9, pytorch#24, pytorch#8 > mov x0, x8 > ret To this: > ldr w8, [x0] > rev w0, w8 > ret Pull Request resolved: pytorch#11394 Reviewed By: SsnL Differential Revision: D9728659 Pulled By: resistor fbshipit-source-id: 9afbd4adfad1d1fb7b01f1179e6707ee21fa726f

resistor requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners September 7, 2018 19:03

soumith approved these changes Sep 7, 2018

View reviewed changes

resistor force-pushed the byteorder branch 2 times, most recently from bc610a9 to 232402d Compare September 7, 2018 20:43

resistor force-pushed the byteorder branch from 232402d to 4f714de Compare September 7, 2018 20:45

facebook-github-bot reviewed Sep 7, 2018

View reviewed changes

facebook-github-bot reviewed Sep 10, 2018

View reviewed changes

facebook-github-bot closed this in 0b78ae8 Sep 10, 2018

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup byte swapping utilities to generate optimal code on the platforms we care about.#11394

Cleanup byte swapping utilities to generate optimal code on the platforms we care about.#11394
resistor wants to merge 1 commit intopytorch:masterfrom
resistor:byteorder

resistor commented Sep 7, 2018 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

resistor commented Sep 7, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

resistor commented Sep 7, 2018 •

edited

Loading