Propose ADR-0018: Add `dim` argument to `bin` and `hist` #3498

SimonHeybrock · 2024-07-12T07:07:49Z

Proposal as discussed on Slack. Please comment and/or approve. See also #3435.

docs/development/adr/0018-bin-hist-reduction-dims.md

Co-authored-by: Sunyoung Yoo <luysunyoung9@gmail.com>

docs/development/adr/0018-bin-hist-reduction-dims.md

nvaytet · 2024-07-30T11:26:26Z

docs/development/adr/0018-bin-hist-reduction-dims.md

+
+### Proposed solution
+
+1. Add a `dim` argument to all `bin` and `hist` functions to specify the dimension(s) to be replaced.


I am not sure the name dim is explicit enough in what it is doing.
I don't know if we can pick another name like remove or something better?

I can see that in our reduction operations, using the argument dim often means that this dimension will disappear in the output, but we do have some other operations like concat where the dim will appear in the output.

I think dim is best, since I feel it is consistent. concat is probably the only case where it is used for something else? But note that sc.concat and da.bins.concat both use dim for different things, for a function with the same name!

I still think that da.hist(z=50, dim='y') is not so clear what it is doing.
In a reduction operation, I think it is clearer, as it is like the numpy axis argument: da.min(dim='y') is pretty clear that we apply the min along the y dim.

In make_binned we have erase as an argument. Would erase be clearer here?

That said, if we expect this to be used only by 'advanced' users, maybe the naming is not crucial?

Have a look at NumPy, or the Array API standard. Arguments like this are always named axis. In Scipp we are (hopefully?) consistently naming them dim. I have always imagined that this would be least confusing, since no matter what function you use it is always dim, with no need to read the documentation (provided that you know or guess the function has such an argument).

nvaytet · 2024-07-30T11:27:50Z

docs/development/adr/0018-bin-hist-reduction-dims.md

+    arg_dict: dict[str, int | Variable] | None = None,
+    /,
+    *,
+    dim: tuple[str, ...] | str | None = None,


Are we ok with the name clash? If someone needs to have the dim dim in their output, they can use the dict syntax, but it is weird to have such a speacial case. Is it uncommon enough to allow this?

Personally I have never used a dimension named dim. I think the regular use where a keyword-arg is useful outnumber these hypothetical clashes (which can simply use a dict) 100:1. I believe we should make the 99% convenient and not worry about the 1% (given that there is a really simple alternative syntax).

docs/development/adr/0018-bin-hist-reduction-dims.md

SimonHeybrock added 5 commits July 12, 2024 06:45

Begin op table

de52318

Binned data table

58ea944

Notes

cb8ee0b

Add alternative table for binned data behavior

1a83fc0

Finalize ADR proposal

282df57

SimonHeybrock marked this pull request as ready for review July 12, 2024 11:10

Merge branch 'main' into adr-hist-bin-api

6a1292b

SimonHeybrock mentioned this pull request Jul 12, 2024

Is the mechanism for controlling whether hist replaces dimensions too indirect? #3435

Closed

YooSunYoung approved these changes Jul 18, 2024

View reviewed changes

docs/development/adr/0018-bin-hist-reduction-dims.md Outdated Show resolved Hide resolved

Update docs/development/adr/0018-bin-hist-reduction-dims.md

905c470

Co-authored-by: Sunyoung Yoo <luysunyoung9@gmail.com>

SimonHeybrock commented Jul 18, 2024

View reviewed changes

docs/development/adr/0018-bin-hist-reduction-dims.md Outdated Show resolved Hide resolved

Update docs/development/adr/0018-bin-hist-reduction-dims.md

459a996

nvaytet reviewed Jul 30, 2024

View reviewed changes

nvaytet approved these changes Aug 2, 2024

View reviewed changes

SimonHeybrock commented Aug 2, 2024

View reviewed changes

docs/development/adr/0018-bin-hist-reduction-dims.md Outdated Show resolved Hide resolved

Update docs/development/adr/0018-bin-hist-reduction-dims.md

8bedbd7

SimonHeybrock merged commit c169fcb into main Aug 2, 2024

SimonHeybrock deleted the adr-hist-bin-api branch August 2, 2024 08:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Propose ADR-0018: Add `dim` argument to `bin` and `hist` #3498

Propose ADR-0018: Add `dim` argument to `bin` and `hist` #3498

Uh oh!

SimonHeybrock commented Jul 12, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

nvaytet Jul 30, 2024

Uh oh!

SimonHeybrock Jul 30, 2024

Uh oh!

nvaytet Aug 2, 2024

Uh oh!

SimonHeybrock Aug 2, 2024

Uh oh!

nvaytet Jul 30, 2024

Uh oh!

SimonHeybrock Jul 30, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		### Proposed solution

		1. Add a `dim` argument to all `bin` and `hist` functions to specify the dimension(s) to be replaced.

Propose ADR-0018: Add dim argument to bin and hist #3498

Propose ADR-0018: Add dim argument to bin and hist #3498

Uh oh!

Conversation

SimonHeybrock commented Jul 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nvaytet Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

nvaytet Aug 2, 2024

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Aug 2, 2024

Choose a reason for hiding this comment

Uh oh!

nvaytet Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Jul 30, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Propose ADR-0018: Add `dim` argument to `bin` and `hist` #3498

Propose ADR-0018: Add `dim` argument to `bin` and `hist` #3498

SimonHeybrock commented Jul 12, 2024 •

edited

Loading