Get rid of some template arguments in GPU loop by zasdfgbnm · Pull Request #33308 · pytorch/pytorch

zasdfgbnm · 2020-02-13T19:32:17Z

Globally define

constexpr int num_threads = C10_WARP_SIZE * 2;
constexpr int thread_work_size = 4;
constexpr int block_work_size = thread_work_size * num_threads;

and kill all the template arguments passing these values.

These are effectively global, but we are now passing them around by template arguments, causing many inconvenience in coding.

dr-ci · 2020-02-13T20:31:26Z

💊 CircleCI build failures summary and remediations

As of commit 48ba22b:

1/2 failures introduced in this PR
1/2 recognized as flaky ❄️
- Re-run these jobs?

Detailed failure analysis

One may explore the probable reasons each build failed interactively on the Dr. CI website.

🕵️ 1 new failure recognized by patterns

The following build failures do not appear to be due to upstream breakage:

pytorch_xla_linux_xenial_py3_6_clang7_build (1/1)

Step: "Build" (full log | pattern match details)

Feb 13 20:29:01 CMake Error at /var/lib/jenkins/workspace/xla/test/cpp/build/gtest/src/googletest-stamp/googletest-configure-Release.cmake:16 (message):

Feb 13 20:28:57 -- Build files have been written to: /var/lib/jenkins/workspace/xla/test/cpp/build 
Feb 13 20:28:57 + make -j 
Feb 13 20:28:57 Scanning dependencies of target googletest 
Feb 13 20:28:57 [  4%] Creating directories for 'googletest' 
Feb 13 20:28:57 [  9%] Performing download step (git clone) for 'googletest' 
Feb 13 20:28:59 -- googletest download command succeeded.  See also /var/lib/jenkins/workspace/xla/test/cpp/build/gtest/src/googletest-stamp/googletest-download-*.log 
Feb 13 20:28:59 [ 19%] No patch step for 'googletest' 
Feb 13 20:28:59 [ 19%] Performing update step for 'googletest' 
Feb 13 20:28:59 Current branch master is up to date. 
Feb 13 20:28:59 [ 23%] Performing configure step for 'googletest' 
Feb 13 20:29:01 CMake Error at /var/lib/jenkins/workspace/xla/test/cpp/build/gtest/src/googletest-stamp/googletest-configure-Release.cmake:16 (message): 
Feb 13 20:29:01   Command failed: 1 
Feb 13 20:29:01  
Feb 13 20:29:01    '/usr/bin/cmake' '-GUnix Makefiles' '/var/lib/jenkins/workspace/xla/test/cpp/build/gtest/src/googletest-src' 
Feb 13 20:29:01  
Feb 13 20:29:01   See also 
Feb 13 20:29:01  
Feb 13 20:29:01     /var/lib/jenkins/workspace/xla/test/cpp/build/gtest/src/googletest-stamp/googletest-configure-*.log 
Feb 13 20:29:01  
Feb 13 20:29:01  
Feb 13 20:29:01 CMakeFiles/googletest.dir/build.make:105: recipe for target 'gtest/src/googletest-stamp/googletest-configure' failed

❄️ 1 failure recognized as flaky

The following build failures have been detected as flaky and may not be your fault:

pytorch_windows_test2 (1/1)

Step: "Test" (full log | pattern match details) ❄️

AssertionError: tensor(1.) not less than or equal to 1e-05 :

====================================================================== 
FAIL: test_cuda_extension (__main__.TestCppExtensionAOT) 
---------------------------------------------------------------------- 
Traceback (most recent call last): 
  File "test_cpp_extensions_aot.py", line 67, in test_cuda_extension 
    self.assertEqual(z, torch.ones_like(z)) 
  File "C:\Users\circleci\project\build\win_tmp\build\torch\testing\_internal\common_utils.py", line 862, in assertEqual 
    assertTensorsEqual(x, y) 
  File "C:\Users\circleci\project\build\win_tmp\build\torch\testing\_internal\common_utils.py", line 832, in assertTensorsEqual 
    self.assertLessEqual(max_err, prec, message) 
AssertionError: tensor(1.) not less than or equal to 1e-05 :  
 
---------------------------------------------------------------------- 
Ran 10 tests in 1.339s 
 
FAILED (failures=1, skipped=1) 
Traceback (most recent call last): 
  File "run_test.py", line 486, in <module> 
    main() 
  File "run_test.py", line 479, in main 
    raise RuntimeError(message)

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

This comment has been revised 1 time.

zasdfgbnm · 2020-02-13T23:17:06Z

test failures are unrelated

ngimel · 2020-02-13T23:41:47Z

aten/src/ATen/native/cuda/Loops.cuh

-namespace at { namespace native { namespace modern { namespace detail {
+namespace at { namespace native {
+
+constexpr int num_threads = C10_WARP_SIZE * 2;


I assume those values are unchanged from what they used to be for ROCm? And at present we don't need to specialize them on a per-kernel basis? If so, then constexp'ing them rather than having them as template arguments makes sense.

ngimel · 2020-02-13T23:56:13Z

cc @iotamudelta. This PR does not change any existing behavior, just makes some template arguments that were never changed into constexprs.

facebook-github-bot

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-02-16T01:12:08Z

@ngimel merged this pull request in cd038c0.

Summary: Globally define ```C++ constexpr int num_threads = C10_WARP_SIZE * 2; constexpr int thread_work_size = 4; constexpr int block_work_size = thread_work_size * num_threads; ``` and kill all the template arguments passing these values. These are effectively global, but we are now passing them around by template arguments, causing many inconvenience in coding. Pull Request resolved: pytorch#33308 Differential Revision: D19907250 Pulled By: ngimel fbshipit-source-id: 4623b69baea7e6e77f460ffdfa07cf9f8cba588a

Get rid of some template arguments in GPU loop

1f92dce

pytorchbot added the open source label Feb 13, 2020

zasdfgbnm added 3 commits February 13, 2020 11:51

fix

85654f5

fix

38c4c34

fix

48ba22b

zasdfgbnm changed the title ~~[WIP]Get rid of some template arguments in GPU loop~~ Get rid of some template arguments in GPU loop Feb 13, 2020

zasdfgbnm requested a review from ngimel February 13, 2020 21:09

ngimel reviewed Feb 13, 2020

View reviewed changes

ngimel approved these changes Feb 13, 2020

View reviewed changes

facebook-github-bot reviewed Feb 14, 2020

View reviewed changes

zasdfgbnm mentioned this pull request Feb 15, 2020

TensorIterator unrolling and vectorized load for GPU loop #31975

Closed

11 tasks

facebook-github-bot closed this in cd038c0 Feb 15, 2020

zasdfgbnm deleted the reduce_template_args branch February 15, 2020 23:39

facebook-github-bot added the merged label Feb 16, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get rid of some template arguments in GPU loop#33308

Get rid of some template arguments in GPU loop#33308
zasdfgbnm wants to merge 4 commits intopytorch:masterfrom
zasdfgbnm:reduce_template_args

zasdfgbnm commented Feb 13, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Feb 13, 2020 •

edited

Loading

Uh oh!

zasdfgbnm commented Feb 13, 2020

Uh oh!

ngimel Feb 13, 2020

Uh oh!

ngimel commented Feb 13, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Feb 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

zasdfgbnm commented Feb 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Feb 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CircleCI build failures summary and remediations

Detailed failure analysis

🕵️ 1 new failure recognized by patterns

pytorch_xla_linux_xenial_py3_6_clang7_build (1/1)

❄️ 1 failure recognized as flaky

pytorch_windows_test2 (1/1)

Uh oh!

zasdfgbnm commented Feb 13, 2020

Uh oh!

ngimel Feb 13, 2020

Choose a reason for hiding this comment

Uh oh!

ngimel commented Feb 13, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Feb 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

zasdfgbnm commented Feb 13, 2020 •

edited

Loading

dr-ci bot commented Feb 13, 2020 •

edited

Loading