[TensorExpr] TensorExprKernel: don't do any compilation or lowering in run(). by ZolotukhinM · Pull Request #37948 · pytorch/pytorch

ZolotukhinM · 2020-05-06T19:15:35Z

Stack from ghstack:

[TensorExpr] TensorExprKernel: don't do any compilation or lowering in run(). #37948 [TensorExpr] TensorExprKernel: don't do any compilation or lowering in run().

The input JIT graph has all the information we need to perform the
entire compilation at the construction time. We don't need to postpone
any steps until the execution time. Also, from the graph we always know
what device we will be executing on and thus we don't need to have a
CodeGen cache in TensorExprKernel - we always have one and only one
CodeGen.

Differential Revision: D21432145

…n run(). The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. [ghstack-poisoned]

… lowering in run()." The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. [ghstack-poisoned]

…n run(). The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. ghstack-source-id: e1fc41c Pull Request resolved: #37948

dr-ci · 2020-05-06T20:25:31Z

💊 CI failures summary and remediations

As of commit de9aeb5 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/py3.6-clang7-rocmdeb-ubuntu16.04

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 7 times.

… lowering in run()." The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. Differential Revision: [D21432145](https://our.internmc.facebook.com/intern/diff/D21432145) [ghstack-poisoned]

…n run(). The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. ghstack-source-id: 90856fa Pull Request resolved: #37948

… lowering in run()." The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. Differential Revision: [D21432145](https://our.internmc.facebook.com/intern/diff/D21432145) [ghstack-poisoned]

…n run(). The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. ghstack-source-id: a8ccf12 Pull Request resolved: #37948

protonu · 2020-05-13T17:18:12Z

@ZolotukhinM , does this work with the multigpu tests in test_jit_fuser_te.py?

ZolotukhinM · 2020-05-13T17:33:06Z

@ZolotukhinM , does this work with the multigpu tests in test_jit_fuser_te.py?

All tests pass with this change.

facebook-github-bot · 2020-05-13T22:17:52Z

@ZolotukhinM merged this pull request in 6e13146.

…n run(). (pytorch#37948) Summary: Pull Request resolved: pytorch#37948 The input JIT graph has all the information we need to perform the entire compilation at the construction time. We don't need to postpone any steps until the execution time. Also, from the graph we always know what device we will be executing on and thus we don't need to have a CodeGen cache in TensorExprKernel - we always have one and only one CodeGen. Test Plan: Imported from OSS Reviewed By: protonu Differential Revision: D21432145 Pulled By: ZolotukhinM fbshipit-source-id: 8dc86b891713056b2c62f30170cd4a168912f027

ZolotukhinM requested a review from apaszke as a code owner May 6, 2020 19:15

ZolotukhinM mentioned this pull request May 6, 2020

[TensorExpr] Support Bool dtype in Or, Xor, And ops and in TensorExprKernel::bindInput. #37938

Closed

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label May 6, 2020

protonu self-assigned this May 6, 2020

ZolotukhinM requested review from bertmaher, protonu and zheng-xq May 7, 2020 04:29

protonu approved these changes May 13, 2020

View reviewed changes

facebook-github-bot closed this in 6e13146 May 13, 2020

facebook-github-bot added the merged label May 13, 2020

facebook-github-bot deleted the gh/ZolotukhinM/239/head branch May 17, 2020 14:18

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TensorExpr] TensorExprKernel: don't do any compilation or lowering in run().#37948

[TensorExpr] TensorExprKernel: don't do any compilation or lowering in run().#37948
ZolotukhinM wants to merge 4 commits intogh/ZolotukhinM/239/basefrom
gh/ZolotukhinM/239/head

ZolotukhinM commented May 6, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented May 6, 2020 •

edited

Loading

Uh oh!

protonu commented May 13, 2020

Uh oh!

ZolotukhinM commented May 13, 2020

Uh oh!

facebook-github-bot commented May 13, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ZolotukhinM commented May 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented May 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

protonu commented May 13, 2020

Uh oh!

ZolotukhinM commented May 13, 2020

Uh oh!

facebook-github-bot commented May 13, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ZolotukhinM commented May 6, 2020 •

edited

Loading

dr-ci Bot commented May 6, 2020 •

edited

Loading