Skip to content

[_shard] add copy_ to shardedtensor#82508

Closed
wanchaol wants to merge 2 commits intogh/wanchaol/226/basefrom
gh/wanchaol/226/head
Closed

[_shard] add copy_ to shardedtensor#82508
wanchaol wants to merge 2 commits intogh/wanchaol/226/basefrom
gh/wanchaol/226/head

Conversation

@wanchaol
Copy link
Collaborator

@wanchaol wanchaol commented Jul 29, 2022

Stack from ghstack (oldest at bottom):

as titled

Differential Revision: D38290442

as titled

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Jul 29, 2022

🔗 Helpful links

❌ 11 New Failures

As of commit 38a6d7d (more details on the Dr. CI page):

Expand to see more
  • 11/11 failures introduced in this PR

🕵️ 11 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages

See GitHub Actions build periodic / linux-bionic-cuda11.6-py3.7-gcc7-debug / build (1/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:43.8271406Z ##[error]The operation was canceled.
2022-08-01T18:02:05.1810836Z [ 97%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/ordered_dict.cpp.o�[0m
2022-08-01T18:02:05.2458781Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_create_autodiff_subgraphs.cpp.o�[0m
2022-08-01T18:02:13.0646542Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_custom_class.cpp.o�[0m
2022-08-01T18:02:14.3248193Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_custom_class_registrations.cpp.o�[0m
2022-08-01T18:02:22.7372106Z [ 97%] �[32mBuilding CXX object nvfuser_bench/CMakeFiles/nvfuser_bench.dir/lstm_cell.cpp.o�[0m
2022-08-01T18:02:23.0568671Z [ 97%] �[32mBuilding CXX object nvfuser_bench/CMakeFiles/nvfuser_bench.dir/reduction.cpp.o�[0m
2022-08-01T18:02:26.8506982Z [ 97%] �[32mBuilding CXX object nvfuser_bench/CMakeFiles/nvfuser_bench.dir/softmax.cpp.o�[0m
2022-08-01T18:02:28.6525480Z [ 97%] �[32mBuilding CXX object nvfuser_bench/CMakeFiles/nvfuser_bench.dir/softmax_backward.cpp.o�[0m
2022-08-01T18:02:35.7336004Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_custom_operators.cpp.o�[0m
2022-08-01T18:02:43.0347202Z [ 97%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/rnn.cpp.o�[0m
2022-08-01T18:02:43.8271406Z ##[error]The operation was canceled.
2022-08-01T18:02:43.8335286Z Prepare all required actions
2022-08-01T18:02:43.8404415Z ##[group]Run ./.github/actions/teardown-linux
2022-08-01T18:02:43.8404750Z with:
2022-08-01T18:02:43.8405000Z ##[endgroup]
2022-08-01T18:02:43.8453129Z ##[group]Run .github/scripts/wait_for_ssh_to_drain.sh
2022-08-01T18:02:43.8453593Z �[36;1m.github/scripts/wait_for_ssh_to_drain.sh�[0m
2022-08-01T18:02:43.8743716Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0}
2022-08-01T18:02:43.8744118Z ##[endgroup]
2022-08-01T18:02:43.8821642Z Holding runner for 2 hours until all ssh sessions have logged out
2022-08-01T18:02:43.8933146Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty

See GitHub Actions build periodic / ios-12-5-1-arm64-metal / build (2/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.2565210Z ##[error]The operation was canceled.
2022-08-01T18:02:00.7571110Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qhardswish.cpp.o
2022-08-01T18:02:11.4647400Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qlinear.cpp.o
2022-08-01T18:02:11.9204060Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qlinear_dynamic.cpp.o
2022-08-01T18:02:12.4884750Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qconv_dynamic.cpp.o
2022-08-01T18:02:24.6681690Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/LinearUnpackImpl.cpp.o
2022-08-01T18:02:24.6692780Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qlinear_prepack.cpp.o
2022-08-01T18:02:24.6732120Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qmatmul.cpp.o
2022-08-01T18:02:35.4429710Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qmul.cpp.o
2022-08-01T18:02:35.5796880Z [ 92%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/qnormalization.cpp.o
2022-08-01T18:02:36.6286980Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/Pooling.cpp.o
2022-08-01T18:02:44.2565210Z ##[error]The operation was canceled.
2022-08-01T18:02:44.2673720Z Post job cleanup.
2022-08-01T18:02:44.2740110Z Post job cleanup.
2022-08-01T18:02:44.6563990Z [command]/usr/local/bin/git version
2022-08-01T18:02:44.7143720Z git version 2.37.1
2022-08-01T18:02:46.0691690Z Copying '/Users/runner/.gitconfig' to '/Users/runner/work/_temp/7deb79bd-edec-4a98-a401-38b7cae1ecc0/.gitconfig'
2022-08-01T18:02:46.1008770Z Temporarily overriding HOME='/Users/runner/work/_temp/7deb79bd-edec-4a98-a401-38b7cae1ecc0' before making global git config changes
2022-08-01T18:02:46.1110730Z Adding repository directory to the temporary git global config as a safe directory
2022-08-01T18:02:46.1338380Z [command]/usr/local/bin/git config --global --add safe.directory /Users/runner/work/pytorch/pytorch
2022-08-01T18:02:46.1491800Z [command]/usr/local/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-08-01T18:02:46.1493320Z [command]/usr/local/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :

See GitHub Actions build periodic / ios-12-5-1-arm64-coreml / build (3/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:42.2574460Z ##[error]The operation was canceled.
2022-08-01T18:01:59.2945220Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/BatchLinearAlgebraKernel.cpp.o
2022-08-01T18:02:06.3666340Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Batching.cpp.o
2022-08-01T18:02:12.4672000Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/BinaryOps.cpp.o
2022-08-01T18:02:14.6311670Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Blas.cpp.o
2022-08-01T18:02:16.7221550Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/BlasKernel.cpp.o
2022-08-01T18:02:26.5862700Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Bucketization.cpp.o
2022-08-01T18:02:27.3541840Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/CPUBlas.cpp.o
2022-08-01T18:02:27.3974960Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/ChanelShuffle.cpp.o
2022-08-01T18:02:38.5394000Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Col2Im.cpp.o
2022-08-01T18:02:41.9265870Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/PadNd.cpp.o
2022-08-01T18:02:42.2574460Z ##[error]The operation was canceled.
2022-08-01T18:02:42.2681800Z Post job cleanup.
2022-08-01T18:02:42.2746020Z Post job cleanup.
2022-08-01T18:02:42.6083600Z [command]/usr/local/bin/git version
2022-08-01T18:02:42.6185600Z git version 2.37.1
2022-08-01T18:02:42.6421840Z Copying '/Users/runner/.gitconfig' to '/Users/runner/work/_temp/c55642dc-0315-482e-a25e-254fb6afd7be/.gitconfig'
2022-08-01T18:02:42.7345410Z Temporarily overriding HOME='/Users/runner/work/_temp/c55642dc-0315-482e-a25e-254fb6afd7be' before making global git config changes
2022-08-01T18:02:42.7447250Z Adding repository directory to the temporary git global config as a safe directory
2022-08-01T18:02:42.7549460Z [command]/usr/local/bin/git config --global --add safe.directory /Users/runner/work/pytorch/pytorch
2022-08-01T18:02:42.7651970Z [command]/usr/local/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-08-01T18:02:42.7755130Z [command]/usr/local/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :

See GitHub Actions build periodic / ios-12-5-1-arm64 / build (4/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:42.6932220Z ##[error]The operation was canceled.
2022-08-01T18:02:12.1470600Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/TensorFactories.cpp.o
2022-08-01T18:02:17.7524560Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/AffineQuantizer.cpp.o
2022-08-01T18:02:20.6679730Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/AffineQuantizerBase.cpp.o
2022-08-01T18:02:22.8205670Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/FakeQuantPerChannelAffine.cpp.o
2022-08-01T18:02:23.3486130Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/FakeQuantPerTensorAffine.cpp.o
2022-08-01T18:02:29.9697640Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/library.cpp.o
2022-08-01T18:02:34.6662370Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/TensorAdvancedIndexing.cpp.o
2022-08-01T18:02:34.7999380Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/RuyUtils.cpp.o
2022-08-01T18:02:34.9211540Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/cpu/XnnpackUtils.cpp.o
2022-08-01T18:02:41.1606770Z [ 93%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/quantized/qlinear_unpack.cpp.o
2022-08-01T18:02:42.6932220Z ##[error]The operation was canceled.
2022-08-01T18:02:42.7136210Z Post job cleanup.
2022-08-01T18:02:42.7266010Z Post job cleanup.
2022-08-01T18:02:43.7146360Z [command]/usr/local/bin/git version
2022-08-01T18:02:44.7314660Z git version 2.37.1
2022-08-01T18:02:44.7575330Z Copying '/Users/runner/.gitconfig' to '/Users/runner/work/_temp/2cf10220-e75d-44df-84c6-b179edf59438/.gitconfig'
2022-08-01T18:02:44.7821560Z Temporarily overriding HOME='/Users/runner/work/_temp/2cf10220-e75d-44df-84c6-b179edf59438' before making global git config changes
2022-08-01T18:02:44.7923920Z Adding repository directory to the temporary git global config as a safe directory
2022-08-01T18:02:44.7975510Z [command]/usr/local/bin/git config --global --add safe.directory /Users/runner/work/pytorch/pytorch
2022-08-01T18:02:44.8067170Z [command]/usr/local/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-08-01T18:02:44.8126370Z [command]/usr/local/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :

See GitHub Actions build periodic / linux-bionic-cuda10.2-py3.9-gcc7 / build (5/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.8502033Z ##[error]The operation was canceled.
2022-08-01T18:02:05.7574405Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_lite_trainer.cpp.o�[0m
2022-08-01T18:02:10.3576287Z [ 98%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/jit/codegen/cuda/python_frontend/python_bindings.cpp.o�[0m
2022-08-01T18:02:15.8427450Z [ 98%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/jit/codegen/cuda/python_frontend/fusion_definition.cpp.o�[0m
2022-08-01T18:02:20.5852865Z [ 98%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/jit/python/init.cpp.o�[0m
2022-08-01T18:02:25.5940447Z [ 98%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/jit/passes/onnx.cpp.o�[0m
2022-08-01T18:02:26.0839077Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/torch_include.cpp.o�[0m
2022-08-01T18:02:30.7940580Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_memory_dag.cpp.o�[0m
2022-08-01T18:02:31.9087233Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/inference_mode.cpp.o�[0m
2022-08-01T18:02:39.2799078Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_misc.cpp.o�[0m
2022-08-01T18:02:44.0155461Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_mobile_type_parser.cpp.o�[0m
2022-08-01T18:02:44.8502033Z ##[error]The operation was canceled.
2022-08-01T18:02:44.8552791Z Prepare all required actions
2022-08-01T18:02:44.8580929Z ##[group]Run ./.github/actions/teardown-linux
2022-08-01T18:02:44.8581257Z with:
2022-08-01T18:02:44.8581533Z ##[endgroup]
2022-08-01T18:02:44.8602770Z ##[group]Run .github/scripts/wait_for_ssh_to_drain.sh
2022-08-01T18:02:44.8603212Z �[36;1m.github/scripts/wait_for_ssh_to_drain.sh�[0m
2022-08-01T18:02:44.8631032Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0}
2022-08-01T18:02:44.8631395Z ##[endgroup]
2022-08-01T18:02:44.8719063Z Holding runner for 2 hours until all ssh sessions have logged out
2022-08-01T18:02:44.8819569Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty

See GitHub Actions build periodic / ios-12-5-1-x86-64-coreml / build (6/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.1016950Z ##[error]The operation was canceled.
2022-08-01T18:02:38.2465480Z [ 88%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/exp-f32-avx512f-rr2-p5-scalef.c.o
2022-08-01T18:02:38.7062350Z [ 88%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/exp-f32-avx512f-rr2-p5.c.o
2022-08-01T18:02:38.9909940Z [ 88%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/core/BackendSelectFallbackKernel.cpp.o
2022-08-01T18:02:39.4545170Z [ 88%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/expm1minus-f32-avx512f-rr1-lut16-p3-perm.c.o
2022-08-01T18:02:40.1895520Z [ 88%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/expm1minus-f32-avx512f-rr1-p6.c.o
2022-08-01T18:02:40.9047190Z [ 88%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/extexp-avx512f-p5.c.o
2022-08-01T18:02:41.6314770Z [ 89%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/sigmoid-f32-avx512f-rr1-lut16-p3-perm-scalef-div.c.o
2022-08-01T18:02:42.3804600Z [ 89%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/sigmoid-f32-avx512f-rr1-lut16-p3-perm-scalef-nr1fma.c.o
2022-08-01T18:02:43.1147170Z [ 89%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/sigmoid-f32-avx512f-rr1-lut16-p3-perm-scalef-nr1fma1adj.c.o
2022-08-01T18:02:43.8545840Z [ 89%] Building C object confu-deps/XNNPACK/CMakeFiles/all_microkernels.dir/src/math/sigmoid-f32-avx512f-rr1-lut32-p2-perm2-scalef-div.c.o
2022-08-01T18:02:44.1016950Z ##[error]The operation was canceled.
2022-08-01T18:02:44.1125310Z Post job cleanup.
2022-08-01T18:02:44.1198090Z Post job cleanup.
2022-08-01T18:02:44.9352770Z [command]/usr/local/bin/git version
2022-08-01T18:02:44.9411550Z git version 2.37.1
2022-08-01T18:02:45.9940110Z Copying '/Users/runner/.gitconfig' to '/Users/runner/work/_temp/ab499524-42a4-470c-a518-ee045f743d26/.gitconfig'
2022-08-01T18:02:46.0056060Z Temporarily overriding HOME='/Users/runner/work/_temp/ab499524-42a4-470c-a518-ee045f743d26' before making global git config changes
2022-08-01T18:02:46.0255660Z Adding repository directory to the temporary git global config as a safe directory
2022-08-01T18:02:46.0424620Z [command]/usr/local/bin/git config --global --add safe.directory /Users/runner/work/pytorch/pytorch
2022-08-01T18:02:46.0584160Z [command]/usr/local/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-08-01T18:02:46.0585450Z [command]/usr/local/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :

See GitHub Actions build periodic / linux-focal-rocm5.2-py3.7-slow / build (7/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.0456676Z ##[error]The operation was canceled.
2022-08-01T18:02:41.1847259Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-inconsistent-missing-override�[m�[K’
2022-08-01T18:02:41.1847736Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-macro-redefined�[m�[K’
2022-08-01T18:02:41.1898460Z [ 97%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/Stream.cpp.o�[0m
2022-08-01T18:02:42.2682200Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kcommand line option ‘�[01m�[K-Wno-duplicate-decl-specifier�[m�[K’ is valid for C/ObjC but not for C++
2022-08-01T18:02:42.2683232Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-implicit-int-float-conversion�[m�[K’
2022-08-01T18:02:42.2688838Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-unused-command-line-argument�[m�[K’
2022-08-01T18:02:42.2689379Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-exceptions�[m�[K’
2022-08-01T18:02:42.2690142Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-inconsistent-missing-override�[m�[K’
2022-08-01T18:02:42.2690657Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-macro-redefined�[m�[K’
2022-08-01T18:02:42.2739460Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_cs_debug_info_serialization.cpp.o�[0m
2022-08-01T18:02:44.0456676Z ##[error]The operation was canceled.
2022-08-01T18:02:44.0496910Z Prepare all required actions
2022-08-01T18:02:44.0524347Z ##[group]Run ./.github/actions/teardown-linux
2022-08-01T18:02:44.0524710Z with:
2022-08-01T18:02:44.0524970Z ##[endgroup]
2022-08-01T18:02:44.0545415Z ##[group]Run .github/scripts/wait_for_ssh_to_drain.sh
2022-08-01T18:02:44.0545860Z �[36;1m.github/scripts/wait_for_ssh_to_drain.sh�[0m
2022-08-01T18:02:44.0569524Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0}
2022-08-01T18:02:44.0569913Z ##[endgroup]
2022-08-01T18:02:44.0631985Z Holding runner for 2 hours until all ssh sessions have logged out
2022-08-01T18:02:44.0727019Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty

See GitHub Actions build periodic / buck-build-test / buck-build-test (8/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T17:26:51.6045244Z head: cannot open ...va/cacerts' for reading: No such file or directory
2022-08-01T17:26:51.4381563Z Preparing to unpack .../7-watchman_4.9.0-3build1_amd64.deb ...
2022-08-01T17:26:51.4390889Z Unpacking watchman (4.9.0-3build1) ...
2022-08-01T17:26:51.4954538Z Selecting previously unselected package nailgun.
2022-08-01T17:26:51.5175936Z Preparing to unpack .../8-nailgun_0.9.3-3_amd64.deb ...
2022-08-01T17:26:51.5182643Z Unpacking nailgun (0.9.3-3) ...
2022-08-01T17:26:51.5786003Z Setting up watchman (4.9.0-3build1) ...
2022-08-01T17:26:51.5814970Z Setting up libjna-jni (4.5.2-1build2) ...
2022-08-01T17:26:51.5842541Z Setting up libpcsclite1:amd64 (1.8.26-3) ...
2022-08-01T17:26:51.5869015Z Setting up libjna-java (4.5.2-1build2) ...
2022-08-01T17:26:51.5896768Z Setting up ca-certificates-java (20190405ubuntu1) ...
2022-08-01T17:26:51.6045244Z head: cannot open '/etc/ssl/certs/java/cacerts' for reading: No such file or directory
2022-08-01T17:26:51.8213328Z Adding debian:ISRG_Root_X1.pem
2022-08-01T17:26:51.8379655Z Adding debian:GlobalSign_Root_E46.pem
2022-08-01T17:26:51.8405447Z Adding debian:Baltimore_CyberTrust_Root.pem
2022-08-01T17:26:51.8439562Z Adding debian:Certum_EC-384_CA.pem
2022-08-01T17:26:51.8530739Z Adding debian:Secure_Global_CA.pem
2022-08-01T17:26:51.8565095Z Adding debian:QuoVadis_Root_CA_3_G3.pem
2022-08-01T17:26:51.8604923Z Adding debian:emSign_Root_CA_-_C1.pem
2022-08-01T17:26:51.8633707Z Adding debian:GlobalSign_Root_R46.pem
2022-08-01T17:26:51.8664999Z Adding debian:emSign_ECC_Root_CA_-_C3.pem
2022-08-01T17:26:51.8680581Z Adding debian:Staat_der_Nederlanden_EV_Root_CA.pem

See GitHub Actions build periodic / linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck / build (9/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:43.8289425Z ##[error]The operation was canceled.
2022-08-01T18:02:06.9966830Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/tensor_options.cpp.o�[0m
2022-08-01T18:02:12.8743559Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/tensor.cpp.o�[0m
2022-08-01T18:02:13.5500808Z [ 98%] �[32mBuilding CXX object caffe2/torch/CMakeFiles/torch_python.dir/csrc/jit/codegen/cuda/python_frontend/python_bindings.cpp.o�[0m
2022-08-01T18:02:14.4892864Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/torch_include.cpp.o�[0m
2022-08-01T18:02:19.7820184Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_lite_interpreter_direct.cpp.o�[0m
2022-08-01T18:02:22.7363021Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_lite_trainer.cpp.o�[0m
2022-08-01T18:02:25.7111510Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_memory_dag.cpp.o�[0m
2022-08-01T18:02:31.3202373Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_misc.cpp.o�[0m
2022-08-01T18:02:33.2794758Z [ 98%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_mobile_type_parser.cpp.o�[0m
2022-08-01T18:02:43.0368234Z [ 98%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/inference_mode.cpp.o�[0m
2022-08-01T18:02:43.8289425Z ##[error]The operation was canceled.
2022-08-01T18:02:43.8336231Z Prepare all required actions
2022-08-01T18:02:43.8363387Z ##[group]Run ./.github/actions/teardown-linux
2022-08-01T18:02:43.8363688Z with:
2022-08-01T18:02:43.8363947Z ##[endgroup]
2022-08-01T18:02:43.8383721Z ##[group]Run .github/scripts/wait_for_ssh_to_drain.sh
2022-08-01T18:02:43.8384119Z �[36;1m.github/scripts/wait_for_ssh_to_drain.sh�[0m
2022-08-01T18:02:43.8406462Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0}
2022-08-01T18:02:43.8406815Z ##[endgroup]
2022-08-01T18:02:43.8515719Z Holding runner for 2 hours until all ssh sessions have logged out
2022-08-01T18:02:43.8626230Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty

See GitHub Actions build periodic / ios-12-5-1-arm64-custom-ops / build (10/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.3069570Z ##[error]The operation was canceled.
2022-08-01T18:02:06.6658050Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/ForeachOpsKernels.cpp.o
2022-08-01T18:02:10.6747140Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/FractionalMaxPool2d.cpp.o
2022-08-01T18:02:17.7897280Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/FractionalMaxPool3d.cpp.o
2022-08-01T18:02:18.5268080Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/FunctionOfAMatrixUtils.cpp.o
2022-08-01T18:02:23.5737450Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/GatedLinearUnit.cpp.o
2022-08-01T18:02:24.1263990Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/GridSampler.cpp.o
2022-08-01T18:02:31.3596150Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Histogram.cpp.o
2022-08-01T18:02:34.2092610Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Im2Col.cpp.o
2022-08-01T18:02:38.2179290Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/IndexingUtils.cpp.o
2022-08-01T18:02:42.8332900Z [ 94%] Building CXX object caffe2/CMakeFiles/torch_cpu.dir/__/aten/src/ATen/native/Integration.cpp.o
2022-08-01T18:02:44.3069570Z ##[error]The operation was canceled.
2022-08-01T18:02:44.3189860Z Post job cleanup.
2022-08-01T18:02:44.3254360Z Post job cleanup.
2022-08-01T18:02:44.6548510Z [command]/usr/local/bin/git version
2022-08-01T18:02:44.8627090Z git version 2.37.1
2022-08-01T18:02:44.9030040Z Copying '/Users/runner/.gitconfig' to '/Users/runner/work/_temp/81a939d1-d2b2-4881-a68a-04ab6aba7206/.gitconfig'
2022-08-01T18:02:44.9132130Z Temporarily overriding HOME='/Users/runner/work/_temp/81a939d1-d2b2-4881-a68a-04ab6aba7206' before making global git config changes
2022-08-01T18:02:44.9232220Z Adding repository directory to the temporary git global config as a safe directory
2022-08-01T18:02:44.9300400Z [command]/usr/local/bin/git config --global --add safe.directory /Users/runner/work/pytorch/pytorch
2022-08-01T18:02:44.9383280Z [command]/usr/local/bin/git config --local --name-only --get-regexp core\.sshCommand
2022-08-01T18:02:44.9485460Z [command]/usr/local/bin/git submodule foreach --recursive git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :

See GitHub Actions build periodic / linux-focal-rocm5.2-py3.7-distributed / build (11/11)

Step: "Unknown" (full log | diagnosis details)

2022-08-01T18:02:44.8446715Z ##[error]The operation was canceled.
2022-08-01T18:02:42.5384723Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-inconsistent-missing-override�[m�[K’
2022-08-01T18:02:42.5385216Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-macro-redefined�[m�[K’
2022-08-01T18:02:42.5443443Z [ 97%] �[32mBuilding CXX object test_api/CMakeFiles/test_api.dir/tensor_indexing.cpp.o�[0m
2022-08-01T18:02:44.2763235Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kcommand line option ‘�[01m�[K-Wno-duplicate-decl-specifier�[m�[K’ is valid for C/ObjC but not for C++
2022-08-01T18:02:44.2764299Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-implicit-int-float-conversion�[m�[K’
2022-08-01T18:02:44.2765264Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-unused-command-line-argument�[m�[K’
2022-08-01T18:02:44.2766105Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-exceptions�[m�[K’
2022-08-01T18:02:44.2766907Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-inconsistent-missing-override�[m�[K’
2022-08-01T18:02:44.2767706Z �[01m�[Kcc1plus:�[m�[K �[01;35m�[Kwarning: �[m�[Kunrecognized command line option ‘�[01m�[K-Wno-macro-redefined�[m�[K’
2022-08-01T18:02:44.2832292Z [ 97%] �[32mBuilding CXX object test_jit/CMakeFiles/test_jit.dir/test_interpreter.cpp.o�[0m
2022-08-01T18:02:44.8446715Z ##[error]The operation was canceled.
2022-08-01T18:02:44.8485995Z Prepare all required actions
2022-08-01T18:02:44.8514196Z ##[group]Run ./.github/actions/teardown-linux
2022-08-01T18:02:44.8514527Z with:
2022-08-01T18:02:44.8514768Z ##[endgroup]
2022-08-01T18:02:44.8535168Z ##[group]Run .github/scripts/wait_for_ssh_to_drain.sh
2022-08-01T18:02:44.8535573Z �[36;1m.github/scripts/wait_for_ssh_to_drain.sh�[0m
2022-08-01T18:02:44.8559642Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0}
2022-08-01T18:02:44.8560017Z ##[endgroup]
2022-08-01T18:02:44.8665428Z Holding runner for 2 hours until all ssh sessions have logged out
2022-08-01T18:02:44.8777178Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@facebook-github-bot facebook-github-bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Jul 29, 2022
wanchaol added a commit that referenced this pull request Jul 29, 2022
as titled

ghstack-source-id: c924880
Pull Request resolved: #82508
@wanchaol wanchaol requested review from fduwjj, kumpera and yhcharles July 29, 2022 22:33
@wanchaol
Copy link
Collaborator Author

@wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Copy link
Contributor

@fduwjj fduwjj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@wanchaol wanchaol added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Jul 29, 2022
wanchaol added a commit that referenced this pull request Aug 1, 2022
Pull Request resolved: #82508


as titled

Differential Revision: [D38290442](https://our.internmc.facebook.com/intern/diff/D38290442/)
ghstack-source-id: 163154614
@facebook-github-bot
Copy link
Contributor

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@github-actions
Copy link
Contributor

github-actions bot commented Aug 1, 2022

Hey @wanchaol.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

facebook-github-bot pushed a commit that referenced this pull request Aug 1, 2022
Summary:
Pull Request resolved: #82508

as titled

Test Plan: Imported from OSS

Reviewed By: fduwjj

Differential Revision: D38290442

Pulled By: wanchaol

fbshipit-source-id: 9938705b9e5e6f14d36e942e6379ab9cbc2540c3
@facebook-github-bot facebook-github-bot deleted the gh/wanchaol/226/head branch August 5, 2022 14:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR cla signed Merged oncall: distributed Add this issue/PR to distributed oncall triage queue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants