Use multiple processes when extracting ImageNet training archive

## 🚀 Feature

Use multiple processes when extracting `ImageNet` training archive.

## Motivation

I recently extracting the `ImageNet` training archive with the code of `torchvision` and was suprised how long it took. I realised that after extracting the main archive, we only extract the subarchives one after another:

https://github.com/pytorch/vision/blob/3c254fb7af5f8af252c24e89949c54a3461ff0be/torchvision/datasets/imagenet.py#L183-L184

## Pitch

I think we can speed that up significantly by using multiple processes to do this simultaneously.  IMO doing this would have no drawbacks. 

## Additional context

If we want this feature, I could take it up, albeit with a low priority.


cc @pmeier

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use multiple processes when extracting ImageNet training archive #2023

🚀 Feature

Motivation

Pitch

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	for archive in archives:
	extract_archive(archive, os.path.splitext(archive)[0], remove_finished=True)

Use multiple processes when extracting ImageNet training archive #2023

Description

🚀 Feature

Motivation

Pitch

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions