BUG: Allow np.percentile to operate on float16 data #29105

eendebakpt · 2025-05-31T22:41:34Z

The approach taken to address the issue is to perform index calculations with increased precision, but keep the behaviour that for arr of dtype float16 the output np.quantile(arr, q) is of the same type.

The behaviour to have the same output type makes sense for float input, but cannot hold for integer input. E.g. we want np.quantile([1, 2], 0.5, method='linear') to output 1.5. For that reason it might also be ok to just upcase the type to float64.

There are also some corner cases. For example input (of either the array or the quantile) of type Fraction. Possible output types could be either Fraction or float64.

charris · 2025-06-03T14:23:32Z

Could use a release note.

eendebakpt · 2025-06-04T21:16:33Z

Could use a release note.

I agree. Not sure how to phrase it though, suggestions are welcome.

ericpre · 2025-11-08T10:12:29Z

We hit the issue that this PR fixes in hyperspy/hyperspy#3560. Can we know what is the status of this PR and when could we expect the fix to be included in a release? Thank you! :)

mhvk

@eendebakpt - this is not really my cup of tea, but I certainly like that quite a few of the work-arounds could be removed! Still, a larger question about ensuring that all methods will now work with large float16 arrays.

Plus nitpicks about the tests... For the tests, perhaps add some comments about why they are being done. They do feel on current main, I presume?

numpy/lib/tests/test_function_base.py

mhvk · 2025-11-08T15:51:27Z

numpy/lib/tests/test_function_base.py

+
+    def test_percentile_gh_29003_Fraction(self):
+        zero = Fraction(0)
+        one = Fraction(0)


Here again I assume it was meant to be Fraction(1) -- though perhaps something like nonzero = Fraction(22, 7) would make more sense?

numpy/lib/tests/test_function_base.py

mhvk · 2025-11-08T16:05:35Z

numpy/lib/_function_base_impl.py

    # They are mathematically equivalent.
    'linear': {
-        'get_virtual_index': lambda n, quantiles: (n - 1) * quantiles,
+        'get_virtual_index': lambda n, quantiles: (n - np.int64(1)) * quantiles,


This one worries me, as it feels indirect: is this effectively to upcast float16/32 to float64? I can see how that solves the original issue, where what is presumably a float16 would be multiplied with a large n.

But if that is the reason, should something similar not happen for all the other methods too? I think a solution that would cover them all is to ensure that n is an int64 when get_virtual_index is called.

This indeed looks a bit odd. I do not recall exactly how I ended up here, but the case for method linear is not needed so I removed it.

mhvk

OK, that actually is even better than -- removed many and introduced one place where the dtype is explicitly used, nice!

I had a comment on the changelog entry, but really, probably not worth running CI for again! (if you do, add [skip actions] [skip azp] [skip cirrus] to the commit message).

mhvk · 2025-11-09T13:06:29Z

doc/release/upcoming_changes/29105.change.rst

@@ -0,0 +1 @@
+* The accuracy of ``np.quantile`` and ``np.percentile`` for 16- and 32-bit floating point input data has been improved.


Would it be correct to say "by casting intermediate results to 64 bit"? (If so, I would add that!)

mhvk · 2025-11-09T13:12:39Z

Actually, let's just get it in, thanks @eendebakpt!

ericpre · 2025-11-09T14:09:30Z

Thank you @eendebakpt and @mhvk! :)

* BUG: Allow np.percentile to operate on float16 data * add an extra regression test * add an extra regression test * remove unused default value * add release note * review comments: part1 * review comments: part 2 * review comments: part 3

seberg · 2026-01-06T12:37:17Z

Hmmm, CuPy noticed this as a change in result dtype (my understanding). I have to investigate, but I suspect there may be an issue here. The intention is (I guess) to use default precision for the q computation but later use the weak one, but this now disregards the precision of q itself in the final result.
I.e. the old one used effectively used result_type(q.dtype, arr.dtype, 0.0) (via true-divide resolution, though). The code here just uses arr.dtype.

eendebakpt force-pushed the percentile_float16 branch from 2f15050 to 74a93b4 Compare May 31, 2025 22:46

BUG: Allow np.percentile to operate on float16 data

6766c8d

eendebakpt force-pushed the percentile_float16 branch from 74a93b4 to 6766c8d Compare June 1, 2025 18:06

eendebakpt added 3 commits June 1, 2025 20:54

add an extra regression test

2c59bea

add an extra regression test

5a7ca19

remove unused default value

6f982e5

eendebakpt changed the title ~~Draft: BUG: Allow np.percentile to operate on float16 data~~ BUG: Allow np.percentile to operate on float16 data Jun 1, 2025

charris added the 00 - Bug label Jun 3, 2025

charris added the 01 - Enhancement label Jun 3, 2025

add release note

b1d973d

eendebakpt mentioned this pull request Jul 6, 2025

BUG: quantile inconsitent with median for size=0 #29315

Open

eendebakpt added 3 commits July 27, 2025 22:36

Merge branch 'main' into percentile_float16

66d3972

Merge branch 'main' into percentile_float16

392902e

Merge branch 'main' into percentile_float16

bf78296

ericpre mentioned this pull request Nov 6, 2025

Plotting fails when plotting data with np.float16 datatype hyperspy/hyperspy#3560

Closed

eendebakpt requested a review from mhvk November 8, 2025 11:38

charris added this to the 2.4.0 release milestone Nov 8, 2025

mhvk reviewed Nov 8, 2025

View reviewed changes

eendebakpt added 3 commits November 8, 2025 18:50

review comments: part1

efbcc9c

review comments: part 2

9d23c9d

review comments: part 3

3a5b6da

mhvk approved these changes Nov 9, 2025

View reviewed changes

mhvk merged commit 884aec9 into numpy:main Nov 9, 2025
77 checks passed

DerWeh mentioned this pull request Dec 4, 2025

BUG: fix numpy.quantile for integer inputs #30376

Open

kmaehashi mentioned this pull request Dec 27, 2025

Support NumPy 2.4 cupy/cupy#9549

Open

This was referenced Jan 6, 2026

BUG: Return type changed in percentile for higher precision q #30586

Closed

BUG: Undo result type change of quantile/percentile but keep q precision #30601

Merged

charris mentioned this pull request Jan 9, 2026

BUG: Undo result type change of quantile/percentile but keep q precision (#30601) #30623

Merged

		@@ -0,0 +1 @@
		* The accuracy of ``np.quantile`` and ``np.percentile`` for 16- and 32-bit floating point input data has been improved. No newline at end of file

Uh oh!

BUG: Allow np.percentile to operate on float16 data #29105

BUG: Allow np.percentile to operate on float16 data #29105

Uh oh!

Conversation

eendebakpt commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charris commented Jun 3, 2025

Uh oh!

eendebakpt commented Jun 4, 2025

Uh oh!

ericpre commented Nov 8, 2025

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhvk Nov 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mhvk Nov 8, 2025

Choose a reason for hiding this comment

Uh oh!

eendebakpt Nov 9, 2025

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk Nov 9, 2025

Choose a reason for hiding this comment

Uh oh!

mhvk commented Nov 9, 2025

Uh oh!

Uh oh!

ericpre commented Nov 9, 2025

Uh oh!

seberg commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eendebakpt commented May 31, 2025 •

edited

Loading