Binaries without AVX512 kernels shouldn't report CPU Capability as AVX512 on machines with AVX512 support by imaginary-person · Pull Request #66703 · pytorch/pytorch

imaginary-person · 2021-10-15T18:26:24Z

BUG

If a PyTorch binary is built with a compiler that doesn't support all the AVX512 intrinsics in the codebase, then it won't have ATen AVX512 kernels, but at runtime, CPU capability would still be incorrectly returned as AVX512 on a machine that supports AVX512. It seems that PyTorch Linux releases are done on CentOS with gcc 7.3, so this bug would manifest in the 1.10 release, unless a fix such as this one is added. gcc versions below 9.0 don't support all the AVX512 intrinsics in the codebase, such as _mm512_set_epi16.

FIX

CPU Capability would be returned as AVX512 at runtime only if the binary was built with a compiler that supports all the AVX512 intrinsics in the codebase, and if the hardware the binary is being run on supports all the required AVX512 instruction sets.

PROBLEM: If a PyTorch release is built with a compiler that doesn't support all the AVX512 intrinsics in the codebase, then it won't have ATen AVX512 kernels, but at runtime, CPU capability would still be incorrectly returned as AVX512 on a machine that supports AVX512. It seems that PyTorch Linux releases are done on CentOS with gcc 7.3, so this bug would manifest in the 1.10 release, unless fixed. gcc versions below 9.0 don't support all the AVX512 intrinsics in the codebase, such as _mm512_set_epi16. SOLUTION: CPU Capability would be returned as AVX512 at runtime only if the binary was built with a compiler that supports all the AVX512 intrinsics in the codebase.

pytorch-probot · 2021-10-15T18:26:27Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/imaginary-person/pytorch-1/blob/fa5df1c7536750152b557f4eb25d6ae6c6ac1261/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
puretorch-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-10-15T18:26:29Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/66703
📄 Preview docs built from this PR
📄 Preview C++ docs built from this PR
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit fa5df1c (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

aten/src/ATen/native/DispatchStub.cpp

If a machine without AVX2 support is used with the environment variable `ATEN_CPU_CAPABILITY=avx2`, then SIGILLs would happen without this check. Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com>

If binaries are built with only default ATen CPU capability, then we won't call cpuinfo functions to query about AVX2 support .

facebook-github-bot · 2021-10-18T16:20:38Z

@malfet has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

cpuinfo is not guaranteed to work correctly on newer versions of CPUs, and users should have the freedom to override faulty auto-detection, which may also be useful for debugging hardware emulators.

facebook-github-bot · 2021-10-21T13:48:27Z

@malfet has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-10-21T23:18:48Z

@malfet merged this pull request in b696d64.

imaginary-person added 2 commits October 15, 2021 13:16

Fix indentation for comments

98dc4e4

pytorch-probot bot added the ciflow/default label Oct 15, 2021

facebook-github-bot added the cla signed label Oct 15, 2021

imaginary-person mentioned this pull request Oct 15, 2021

CPU Capability is being reported as AVX512 even if PyTorch is built without AVX512 ATen kernels #66712

Closed

imaginary-person changed the title ~~Binaries built with compilers not supporting all AVX512 intrinsics shouldn't report CPU Capability as AVX512~~ Binaries without AVX512 kernels shouldn't report CPU Capability as AVX512 on machines with AVX512 support Oct 15, 2021

imaginary-person mentioned this pull request Oct 15, 2021

[v.1.10.0] Release Tracker #65438

Closed

malfet reviewed Oct 15, 2021

View reviewed changes

aten/src/ATen/native/DispatchStub.cpp Show resolved Hide resolved

aten/src/ATen/native/DispatchStub.cpp Outdated Show resolved Hide resolved

aten/src/ATen/native/DispatchStub.cpp Show resolved Hide resolved

pytorchbot added the open source label Oct 15, 2021

imaginary-person and others added 2 commits October 15, 2021 15:24

Applied @malfet's suggestion.

823ddbe

If a machine without AVX2 support is used with the environment variable `ATEN_CPU_CAPABILITY=avx2`, then SIGILLs would happen without this check. Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com>

Avoid wasteful work if binaries are built without AVX2 support

22fd10c

If binaries are built with only default ATen CPU capability, then we won't call cpuinfo functions to query about AVX2 support .

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 18, 2021

Remove cpuinfo check for ATEN_CPU_CAPABILITY==avx512

fa5df1c

cpuinfo is not guaranteed to work correctly on newer versions of CPUs, and users should have the freedom to override faulty auto-detection, which may also be useful for debugging hardware emulators.

malfet approved these changes Oct 18, 2021

View reviewed changes

facebook-github-bot closed this in b696d64 Oct 21, 2021

facebook-github-bot added the Merged label Oct 21, 2021

rafariossaa mentioned this pull request Mar 30, 2022

No support for AVX512 in torch 1.11.0+cpu python package #74950

Closed

This was referenced May 5, 2022

Update devtoolset to 9 pytorch/builder#1028

Merged

Migrating conda builds to devtoolset 9 pytorch/builder#1030

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Binaries without AVX512 kernels shouldn't report CPU Capability as AVX512 on machines with AVX512 support#66703

Binaries without AVX512 kernels shouldn't report CPU Capability as AVX512 on machines with AVX512 support#66703
imaginary-person wants to merge 5 commits intopytorch:masterfrom
imaginary-person:patch-37

imaginary-person commented Oct 15, 2021

Uh oh!

pytorch-probot bot commented Oct 15, 2021 •

edited

Loading

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Oct 15, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Oct 18, 2021

Uh oh!

facebook-github-bot commented Oct 21, 2021

Uh oh!

facebook-github-bot commented Oct 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

imaginary-person commented Oct 15, 2021

BUG

FIX

Uh oh!

pytorch-probot bot commented Oct 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Oct 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Oct 18, 2021

Uh oh!

facebook-github-bot commented Oct 21, 2021

Uh oh!

facebook-github-bot commented Oct 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-probot bot commented Oct 15, 2021 •

edited

Loading

facebook-github-bot commented Oct 15, 2021 •

edited

Loading