Document behavior of min/max/nanmin/nanmax for empty inputs #3287

SimonHeybrock · 2023-10-12T13:04:20Z

jokasimr

Scipp returns DBL_MAX for empty inputs, while NumPy returns NaN. For integer inputs, Scipp returns INT_MAX, while NumPy raises.

It is not really correct to say that Numpy returns Nan when the input is empty, or at least it is a bit unclear. Both of the below examples raises ValueError: zero-size array to reduction operation minimum which has no identity independent of the data-type.

np.min(np.ones(0, dtype=np.int64))
np.min(np.ones(0, dtype=np.float64))

However, Numpy.min does returns nan if the input contains nan.

As a side note that probably should be the topic of another issue: When looking into this I noticed that sc.nanmin and sc.min does the same thing

# Both return -1
sc.array(dims=['x'], values=[1, float('nan'), -1]).min()
sc.array(dims=['x'], values=[1, float('nan'), -1]).nanmin()

while

# Only nanmin returns -1
np.min([1, float('nan'), -1])
np.nanmin([1, float('nan'), -1])

I think the reason is that in the implementation of sc.min we use std::min(a, b) equivalent to (b < a) ? b : a; and if b=nan we get a.

SimonHeybrock · 2023-10-16T04:02:30Z

Hmm, and from numpy.ma.masked_array one gets a special value (of type numpy.ma.core.MaskedConstant):

import numpy as np
import numpy.ma as ma

ma.masked_array(np.ones(1), mask=[True]).min()  # masked

jl-wynen · 2023-10-19T10:58:22Z

src/scipp/core/reduction.py

-    inputs, Scipp returns INT_MAX, while NumPy raises. Note that in the case of
-    :py:class:`DataArray`, inputs can also be "empty" if all elements contributing
-    to an output element are masked.
+    Scipp returns DBL_MAX and INT_MAX for empty inputs of float or int dtype,


Suggested change

Scipp returns DBL_MAX and INT_MAX for empty inputs of float or int dtype,

Scipp returns DBL_MAX or INT_MAX for empty inputs of float or int dtype,

And in the other docstrings, too.

jl-wynen · 2023-10-19T10:58:30Z

src/scipp/core/reduction.py

-    :py:class:`DataArray`, inputs can also be "empty" if all elements contributing
-    to an output element are masked.
+    Scipp returns DBL_MAX and INT_MAX for empty inputs of float or int dtype,
+    respectively, while NumPy rases. Note that in the case of :py:class:`DataArray`,


Suggested change

respectively, while NumPy rases. Note that in the case of :py:class:`DataArray`,

respectively, while NumPy raises. Note that in the case of :py:class:`DataArray`,

Document behavior of min/max/nanmin/nanmax for empty inputs

05e73e5

SimonHeybrock requested a review from jokasimr October 12, 2023 13:04

jl-wynen approved these changes Oct 12, 2023

View reviewed changes

jokasimr reviewed Oct 12, 2023

View reviewed changes

jl-wynen mentioned this pull request Oct 12, 2023

min ignores nan #3288

Closed

NumPy raises.

7af391a

SimonHeybrock requested a review from jokasimr October 19, 2023 10:23

jl-wynen reviewed Oct 19, 2023

View reviewed changes

Spelling

6d81c90

jl-wynen approved these changes Oct 19, 2023

View reviewed changes

SimonHeybrock enabled auto-merge October 19, 2023 11:24

jokasimr approved these changes Oct 19, 2023

View reviewed changes

SimonHeybrock disabled auto-merge October 19, 2023 11:55

SimonHeybrock merged commit 2cedc60 into main Oct 19, 2023

SimonHeybrock deleted the docs-reduction-ops branch October 19, 2023 11:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Document behavior of min/max/nanmin/nanmax for empty inputs #3287

Document behavior of min/max/nanmin/nanmax for empty inputs #3287

Uh oh!

SimonHeybrock commented Oct 12, 2023

Uh oh!

jokasimr left a comment •

edited

Loading

Uh oh!

SimonHeybrock commented Oct 16, 2023

Uh oh!

jl-wynen Oct 19, 2023

Uh oh!

jl-wynen Oct 19, 2023

Uh oh!

jl-wynen Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	Scipp returns DBL_MAX and INT_MAX for empty inputs of float or int dtype,
	Scipp returns DBL_MAX or INT_MAX for empty inputs of float or int dtype,

	respectively, while NumPy rases. Note that in the case of :py:class:`DataArray`,
	respectively, while NumPy raises. Note that in the case of :py:class:`DataArray`,

Document behavior of min/max/nanmin/nanmax for empty inputs #3287

Document behavior of min/max/nanmin/nanmax for empty inputs #3287

Uh oh!

Conversation

SimonHeybrock commented Oct 12, 2023

Uh oh!

jokasimr left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock commented Oct 16, 2023

Uh oh!

jl-wynen Oct 19, 2023

Choose a reason for hiding this comment

Uh oh!

jl-wynen Oct 19, 2023

Choose a reason for hiding this comment

Uh oh!

jl-wynen Oct 19, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jokasimr left a comment •

edited

Loading