Disable tests that use DataLoader with multiple workers for Windows by yf225 · Pull Request #5322 · pytorch/pytorch

yf225 · 2018-02-21T00:20:50Z

It seems that DataLoader with multiple workers have been causing CUDA out-of-memory errors in the Windows CI test (such as test_batch_sampler, test_multi_keep and test_multi_drop). @ssnl is looking into this issue.

Added to #4092.

ssnl

LGTM, thanks!

dded ind_worker_queue parameter to data.DataLoader. It makes preprocessing determinate. DataLoader in multiprocessing mode may cause non-deterministic issue. Even if radom_seed has frozen, each subprocess may get tasks in unstable order. This is caused by different I/O time while data loads. If you use augmentation while data loading, it makes results unreproduceble. Look at the https://discuss.pytorch.org/t/deterministic-non-deterministic-results-with-pytorch/9087 To fix this issue I have added the individual queue for each worker. In this case each worker get tasks in the stable order. In summary, subprocess produces the stable results. To reproduce issue you may change ind_worker_queue to False and run the script several times. Code to reproduce issue is in the corresponding PR. * TestIndividualWorkerQueue added to DataLoader tests * Review fixes * "Simplify" code by removing itertools * Rebase conflicts fix * Review fixes * Fixed shutdown behavior * Removed ind_worker_queue flag. * Rebase on master * Disable tests that use DataLoader with multiple workers (#5322)

…rch#5322)" This reverts commit 0340e46.

dded ind_worker_queue parameter to data.DataLoader. It makes preprocessing determinate. DataLoader in multiprocessing mode may cause non-deterministic issue. Even if radom_seed has frozen, each subprocess may get tasks in unstable order. This is caused by different I/O time while data loads. If you use augmentation while data loading, it makes results unreproduceble. Look at the https://discuss.pytorch.org/t/deterministic-non-deterministic-results-with-pytorch/9087 To fix this issue I have added the individual queue for each worker. In this case each worker get tasks in the stable order. In summary, subprocess produces the stable results. To reproduce issue you may change ind_worker_queue to False and run the script several times. Code to reproduce issue is in the corresponding PR. * TestIndividualWorkerQueue added to DataLoader tests * Review fixes * "Simplify" code by removing itertools * Rebase conflicts fix * Review fixes * Fixed shutdown behavior * Removed ind_worker_queue flag. * Rebase on master * Disable tests that use DataLoader with multiple workers (pytorch#5322)

Disable tests that use DataLoader with multiple workers

2ae786a

yf225 force-pushed the num_workers branch from 186d165 to 2ae786a Compare February 21, 2018 00:24

ssnl approved these changes Feb 21, 2018

View reviewed changes

soumith merged commit 0340e46 into pytorch:master Feb 21, 2018

yf225 deleted the num_workers branch February 22, 2018 22:27

AlexanderRadionov added a commit to AlexanderRadionov/pytorch that referenced this pull request Mar 6, 2018

Disable tests that use DataLoader with multiple workers (pytorch#5322)

bbf5cd5

yf225 mentioned this pull request Mar 6, 2018

Disallow num_workers > 0 for DataLoader on Windows #5591

Merged

peterjc123 mentioned this pull request Mar 8, 2018

Missing components / tests on Windows #4092

Closed

13 tasks

yf225 pushed a commit to yf225/pytorch that referenced this pull request Mar 29, 2018

Revert "Disable tests that use DataLoader with multiple workers (pyto…

7ab6fbd

…rch#5322)" This reverts commit 0340e46.

laurentdupin pushed a commit to laurentdupin/pytorch that referenced this pull request Apr 24, 2026

Disable tests that use DataLoader with multiple workers (pytorch#5322)

3a21228

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable tests that use DataLoader with multiple workers for Windows#5322

Disable tests that use DataLoader with multiple workers for Windows#5322
soumith merged 1 commit intopytorch:masterfrom
yf225:num_workers

yf225 commented Feb 21, 2018

Uh oh!

ssnl left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yf225 commented Feb 21, 2018

Uh oh!

ssnl left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants