Adds `dim` argument to `torch.unique` by ptrblck · Pull Request #10423 · pytorch/pytorch

ptrblck · 2018-08-10T22:28:53Z

Initial version of unique supporting a dim argument.

As discussed in this issue I added the dim argument to torch.unique with the same behavior like numpy.

Since the implementation is based on std/thrust::unique, the tensor always needs to be sorted. The sorted argument in torch.unique does not have any function, as in the CUDA version of the plain torch.unique.

To check the performance and equal behavior between torch.unique and np.unique, I've used this gist.

Currently we achieve the following timings for an input of x = torch.randint(2, (1000, 1000)):
(The values are calculated by taking the average of the times for both dimension)

Device	PyTorch (return_inverse=False)	Numpy (return_inverse=False)	PyTorch (return_inverse=True)	Numpy (return_inverse=True)
CPU	~0.007331s	~0.022452s	~0.011139s	~0.044800s
GPU	~0.006154s	-	~0.105373s	-

Many thanks to @colesbury for the awesome mentoring and the valuable advices on the general implementation and performance issues!

aten/src/ATen/native/cuda/Unique.cu

aten/src/ATen/native/Unique.cpp

aten/src/ATen/native/cuda/Unique.cu

ptrblck · 2018-08-20T22:00:42Z

As there is no switch between return_inverse=True/False, since inverse_indices will be pre-computed in the unique implementation, the new CPU time is now approx. 0.00921 seconds.

colesbury

lgtm, thanks @ptrblck

facebook-github-bot

colesbury has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Initial version of `unique` supporting a `dim` argument. As discussed in [this issue](pytorch/pytorch#9997) I added the `dim` argument to `torch.unique` with the same behavior like [numpy](https://docs.scipy.org/doc/numpy-1.14.0/reference/generated/numpy.unique.html). Since the implementation is based on `std/thrust::unique`, the `tensor` always needs to be sorted. The `sorted` argument in `torch.unique` does not have any function, as in the CUDA version of the plain `torch.unique`. To check the performance and equal behavior between `torch.unique` and `np.unique`, I've used [this gist](https://gist.github.com/ptrblck/ac0dc862f4e1766f0e1036c252cdb105). Currently we achieve the following timings for an input of `x = torch.randint(2, (1000, 1000))`: (The values are calculated by taking the average of the times for both dimension) | Device | PyTorch (return_inverse=False) | Numpy (return_inverse=False) | PyTorch (return_inverse=True) | Numpy (return_inverse=True) | | --- | --- | --- | --- | --- | | CPU | ~0.007331s | ~0.022452s | ~0.011139s | ~0.044800s | | GPU | ~0.006154s | - | ~0.105373s | - | Many thanks to colesbury for the awesome mentoring and the valuable advices on the general implementation and performance issues! Pull Request resolved: pytorch/pytorch#10423 Differential Revision: D9517289 Pulled By: soumith fbshipit-source-id: a4754f805223589c2847c98b8e4e39d8c3ddb7b5

Summary: Initial version of `unique` supporting a `dim` argument. As discussed in [this issue](pytorch#9997) I added the `dim` argument to `torch.unique` with the same behavior like [numpy](https://docs.scipy.org/doc/numpy-1.14.0/reference/generated/numpy.unique.html). Since the implementation is based on `std/thrust::unique`, the `tensor` always needs to be sorted. The `sorted` argument in `torch.unique` does not have any function, as in the CUDA version of the plain `torch.unique`. To check the performance and equal behavior between `torch.unique` and `np.unique`, I've used [this gist](https://gist.github.com/ptrblck/ac0dc862f4e1766f0e1036c252cdb105). Currently we achieve the following timings for an input of `x = torch.randint(2, (1000, 1000))`: (The values are calculated by taking the average of the times for both dimension) | Device | PyTorch (return_inverse=False) | Numpy (return_inverse=False) | PyTorch (return_inverse=True) | Numpy (return_inverse=True) | | --- | --- | --- | --- | --- | | CPU | ~0.007331s | ~0.022452s | ~0.011139s | ~0.044800s | | GPU | ~0.006154s | - | ~0.105373s | - | Many thanks to colesbury for the awesome mentoring and the valuable advices on the general implementation and performance issues! Pull Request resolved: pytorch#10423 Differential Revision: D9517289 Pulled By: soumith fbshipit-source-id: a4754f805223589c2847c98b8e4e39d8c3ddb7b5

pbialecki added 11 commits August 10, 2018 23:35

initial commit for unique_dim

cbcf566

fix sizes init to std::vector

0231efc

speedup _unique_dim

e07ac34

cleanup

372dcf9

add cuda unique_dim

208264c

speedup _unique_dim_cuda

b40e24e

add test

8b793fe

flake8

1b370f8

add condition on return_inverse

69f0120

code review: small improvements

5260a70

code review changes

2c7c12f

ptrblck requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners August 10, 2018 22:28

ptrblck changed the title ~~fixes #9997~~ Adds dim argument to torch.unique Aug 13, 2018

colesbury reviewed Aug 14, 2018

View reviewed changes

ptrblck commented Aug 15, 2018

View reviewed changes

aten/src/ATen/native/cuda/Unique.cu Outdated

This comment was marked as off-topic.

Sign in to view

compute inverse_indices in unique, minor cleanup

dce5b7c

ptrblck force-pushed the master branch from cbafa5b to dce5b7c Compare August 20, 2018 21:46

reset third_party

5a8ecf2

zou3519 added the ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes label Aug 21, 2018

colesbury approved these changes Aug 23, 2018

View reviewed changes

facebook-github-bot reviewed Aug 27, 2018

View reviewed changes

gchanan approved these changes Aug 27, 2018

View reviewed changes

facebook-github-bot reviewed Aug 28, 2018

View reviewed changes

facebook-github-bot closed this in 2cc98d8 Aug 29, 2018

ptrblck mentioned this pull request Dec 22, 2018

Add return_counts to torch.unique #15495

Closed

ptrblck mentioned this pull request Jan 7, 2019

torch.unique performance issue for return_inverse=True #15804

Closed

ezyang added open source merged labels Jun 24, 2019

Conversation

ptrblck commented Aug 10, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ptrblck commented Aug 20, 2018

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants