benchmarks/torchbench_model: skip benchmarks that fail to load by cota · Pull Request #6199 · pytorch/xla

cota · 2023-12-18T19:09:24Z

In "dfcf306e7 Apply precision config env vars in the root process. (#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.

frgossen

Thanks! If this is a wider spread issue, we should investigate further. Can we add an issue for this one?

cota · 2023-12-18T19:42:31Z

pytorch_unet also needs to be added. Will update the PR once the run finishes -- there might be more additions to the deny list needed.
Will add an issue once I gather them all.

cota · 2023-12-19T06:28:31Z

I've added the necessary models to the deny list as a workaround. I've also filed issue #6207 and referenced it from the code change so that we do not forget why they were added to the deny list.

In "dfcf306e7 Apply precision config env vars in the root process. (pytorch#6152)" we started running load_benchmark() from experiment_runner's main process. Unfortunately, load_benchmark() for some models does exit the calling process, which results in experiment_runner exiting prematurely. Work around this issue by adding these models to the deny list, so that experiment_runner does not die early.

Add Inductor to the deny list for two benchmarks. Note: I should have added these already in "3ccb4ed2a benchmarks/torchbench_model: skip benchmarks that fail to load (pytorch#6199)". This fixes that oversight.

…ch#6199) In "dfcf306e7 Apply precision config env vars in the root process. (pytorch#6152)" we started running load_benchmark() from experiment_runner's main process. Unfortunately, load_benchmark() for some models does exit the calling process, which results in experiment_runner exiting prematurely. Work around this issue by adding these models to the deny list, so that experiment_runner does not die early.

In "dfcf306e7 Apply precision config env vars in the root process. (#6152)" we started running load_benchmark() from experiment_runner's main process. Unfortunately, load_benchmark() for some models does exit the calling process, which results in experiment_runner exiting prematurely. Work around this issue by adding these models to the deny list, so that experiment_runner does not die early.

cota requested review from frgossen and golechwierowicz December 18, 2023 19:09

cota force-pushed the pix2pix branch 2 times, most recently from a067171 to fbf0916 Compare December 18, 2023 19:17

frgossen approved these changes Dec 18, 2023

View reviewed changes

cota force-pushed the pix2pix branch from fbf0916 to 102b3b4 Compare December 19, 2023 06:07

cota changed the title ~~benchmarks/torchbench_model: skip pytorch_CycleGAN_and_pix2pix in XLA~~ benchmarks/torchbench_model: skip benchmarks that fail to load Dec 19, 2023

cota force-pushed the pix2pix branch from 102b3b4 to ab24fe2 Compare December 19, 2023 06:10

cota mentioned this pull request Dec 19, 2023

benchmarks/torchbench_model: some benchmarks fail to load and kill experiment_runner's main process #6207

Closed

cota force-pushed the pix2pix branch 2 times, most recently from d931dc8 to 54599a6 Compare December 19, 2023 06:27

golechwierowicz approved these changes Dec 19, 2023

View reviewed changes

cota force-pushed the pix2pix branch from 54599a6 to 0b02a84 Compare December 19, 2023 20:09

cota force-pushed the pix2pix branch from 0b02a84 to fa2bca8 Compare December 19, 2023 20:13

cota merged commit 3ccb4ed into pytorch:master Dec 20, 2023

cota deleted the pix2pix branch December 20, 2023 17:40

cota mentioned this pull request Dec 21, 2023

benchmarks/torchbench_model: add more benchmarks that fail to load #6226

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks/torchbench_model: skip benchmarks that fail to load#6199

benchmarks/torchbench_model: skip benchmarks that fail to load#6199
cota merged 1 commit intopytorch:masterfrom
cota:pix2pix

cota commented Dec 18, 2023 •

edited

Loading

Uh oh!

frgossen left a comment

Uh oh!

cota commented Dec 18, 2023 •

edited

Loading

Uh oh!

cota commented Dec 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cota commented Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

frgossen left a comment

Choose a reason for hiding this comment

Uh oh!

cota commented Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cota commented Dec 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cota commented Dec 18, 2023 •

edited

Loading

cota commented Dec 18, 2023 •

edited

Loading