Enabled pytorch-linux-backward-compatibility-check-test in GHA#61973
Enabled pytorch-linux-backward-compatibility-check-test in GHA#61973MaxMotovilov wants to merge 8 commits intopytorch:masterfrom
Conversation
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 21013ce (more details on the Dr. CI page): ✅ None of the CI failures appear to be your fault 💚
❄️ 1 failure tentatively classified as flakybut reruns have not yet been triggered to confirm:
|
|
@MaxMotovilov has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
zhouzhuojie
left a comment
There was a problem hiding this comment.
it still triggers a full build, is it possible that the backward tests can be included in the existing linux CI workflow? For example, just like the build_docs job, it's only enabled for pytorch-linux-xenial-py3.6-gcc5.4, and reuse the build artifacts
| docker_image_base=f"{DOCKER_REGISTRY}/pytorch/pytorch-linux-xenial-py3.6-gcc5.4", | ||
| test_runner_type=LINUX_CPU_TEST_RUNNER, | ||
| on_pull_request=True, | ||
| num_test_shards=2, |
There was a problem hiding this comment.
I don't think test_shards is handled for backward tests, can you help check the logs from both shards - I guess they are testing same things, if yes might want to set it to 1 or remove this field
There was a problem hiding this comment.
I don't think test_shards is handled for backward tests, can you help check the logs from both shards - I guess they are testing same things, if yes might want to set it to 1 or remove this field
Pretty sure they are... I was wondering where the /0 and /1 are coming from.
There was a problem hiding this comment.
Shards does not really work for backward compat tests, so let's keep them at one
There was a problem hiding this comment.
Is an entirely new workflow really needed for this? Seems like this is just an extension of pytorch-linux-xenial-py3.6-gcc5.4 in general
@samestep did a lot of work to enable separate configurations within that workflow
There was a problem hiding this comment.
As far as I could understand, the script for the bw compat test installs a nightly build of PyTorch anyway -
pytorch/.jenkins/pytorch/test.sh
Line 369 in d6e776d
There was a problem hiding this comment.
the main goal of bc check is to build with current commit and compare against latest nightly API FunctionSchemas.
so it is not "overwritten" --> once the FunctionSchema from nightly wheel got extracted it is actually not useful anymore.
That being said, it would be nice to just have some way to host the API schema instead of having to download the entire nightly build.
Given that the first thing it does for the actual test step is installs a nightly, perhaps it doesn't even need the build dependency? Strange that CircleCI version seems to have it... |
oh wait, so basically it just test the backward compatibility for nightly builds? if so, then yes, probably can skip the build step and just go for the test |
But what docker image should I use? Need something that has [a fresh enough] |
That's a base docker image we rebuilt not frequently, e.g. some low-level dependencies will be put there to prepare the pytorch build. The image can be found from the |
e17c636 to
b7ec4be
Compare
...in hopes that resolves the submodule issues during CI
Merging upstream to resolve CI issue w/submodules
walterddr
left a comment
There was a problem hiding this comment.
in order to achieve what @zhouzhuojie mentioned I think you also need to change generate_pytorch_test_matrix.py to generate additional test step for backward_compatibility.
this statement is not true, it is comparing against nightly as baseline (it is test for the current commit) |
Wait - why? Current code in |
|
Yes I am just reflecting on the original comment from test.sh and I am sure there's a reason why it needs to be on its own job instead of being ran as part of other tests. also this will run bc check on both test1 and test2 right? |
Says "before" - which makes sense given that it overwrites current build with a nightly. Shouldn't have a problem with "after" though?
Ah! Yes, I meant to fix that and forgot. Will do - just need to pull this all back on a devserver because the one I was using has already lapsed after I finished bootcamp. |
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
Summary:
Test Plan: Execute GHA, verify results
Reviewers: Zhuojie Zhou
Subscribers:
Tasks: T96112383
Tags: