Skip to content

benchmarks/torchbench_model: skip benchmarks that fail to load#6199

Merged
cota merged 1 commit intopytorch:masterfrom
cota:pix2pix
Dec 20, 2023
Merged

benchmarks/torchbench_model: skip benchmarks that fail to load#6199
cota merged 1 commit intopytorch:masterfrom
cota:pix2pix

Conversation

@cota
Copy link
Copy Markdown
Collaborator

@cota cota commented Dec 18, 2023

In "dfcf306e7 Apply precision config env vars in the root process. (#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.

@cota cota force-pushed the pix2pix branch 2 times, most recently from a067171 to fbf0916 Compare December 18, 2023 19:17
Copy link
Copy Markdown
Collaborator

@frgossen frgossen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! If this is a wider spread issue, we should investigate further. Can we add an issue for this one?

@cota
Copy link
Copy Markdown
Collaborator Author

cota commented Dec 18, 2023

pytorch_unet also needs to be added. Will update the PR once the run finishes -- there might be more additions to the deny list needed.
Will add an issue once I gather them all.

@cota cota changed the title benchmarks/torchbench_model: skip pytorch_CycleGAN_and_pix2pix in XLA benchmarks/torchbench_model: skip benchmarks that fail to load Dec 19, 2023
@cota cota force-pushed the pix2pix branch 2 times, most recently from d931dc8 to 54599a6 Compare December 19, 2023 06:27
@cota
Copy link
Copy Markdown
Collaborator Author

cota commented Dec 19, 2023

I've added the necessary models to the deny list as a workaround. I've also filed issue #6207 and referenced it from the code change so that we do not forget why they were added to the deny list.

In "dfcf306e7 Apply precision config env vars in the root process. (pytorch#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.
@cota cota merged commit 3ccb4ed into pytorch:master Dec 20, 2023
@cota cota deleted the pix2pix branch December 20, 2023 17:40
cota added a commit to cota/pytorch-xla that referenced this pull request Dec 21, 2023
Add Inductor to the deny list for two benchmarks.

Note: I should have added these already in "3ccb4ed2a
benchmarks/torchbench_model: skip benchmarks that fail to load (pytorch#6199)".
This fixes that oversight.
mbzomowski pushed a commit to mbzomowski-test-org/xla that referenced this pull request Jan 3, 2024
…ch#6199)

In "dfcf306e7 Apply precision config env vars in the root process. (pytorch#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.
golechwierowicz pushed a commit that referenced this pull request Jan 12, 2024
In "dfcf306e7 Apply precision config env vars in the root process. (#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.
bhavya01 pushed a commit that referenced this pull request Apr 22, 2024
In "dfcf306e7 Apply precision config env vars in the root process. (#6152)"
we started running load_benchmark() from experiment_runner's
main process. Unfortunately, load_benchmark() for
some models does exit the calling process, which results
in experiment_runner exiting prematurely.

Work around this issue by adding these models to the deny list,
so that experiment_runner does not die early.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants