running_median() with windowed data by rhettinger · Pull Request #1040 · more-itertools/more-itertools

rhettinger · 2025-08-01T19:25:25Z

Open questions:

Possibly rename the iterable parameter to data, consistent with statistics.median? Or keep as-is to match more_itertools conventions and to emphasize the lazy evaluation which is the principal use case for running_median()?
Happy with the O(n) but mostly fast steps to maintain a sorted window? Or add more complex O(log n) code (IndexableSkiplist, blist, SortedContainers, etc)? Based on Grant Jenks' notes, the current list insort/bisect/del technique can be expected to win for window sizes up to several thousand.

Solved questions:

Named the size parameter maxlen because it caps the size of the window and also allows smaller sizes, like the maxlen parameter for deque.
Let running_median start yielding values before the window is full. This is more convenient to use. Also, it invariant that the unwindowed case gives the same result as having a window larger than the input data.

more_itertools/recipes.py

rhettinger · 2025-08-02T16:49:49Z

For the record, the issue with Decimal context rounding is that double negation does not always round-trip:

>>> from decimal import Decimal
>>> from math import pi
>>> Decimal(pi)
Decimal('3.141592653589793115997963468544185161590576171875')
>>> - - Decimal(pi)
Decimal('3.141592653589793115997963469')
>>> Decimal(pi) == - - Decimal(pi)
False

So a value that goes into the negated lo heap may not exactly match that value that comes out. This only happens when the inputs have more precision than the current context precision. That arises when converting binary-floats to decimal-floats.

…l order.

bbayles · 2025-08-03T15:10:43Z

Possibly rename the iterable parameter to data, consistent with statistics.median? Or keep as-is to match more_itertools conventions and to emphasize the lazy evaluation which is the principal use case for running_median()?

+0 for as-is on this one.

Happy with the O(n) but mostly fast steps to maintain a sorted window? Or add more complex O(log n) code (IndexableSkiplist, blist, SortedContainers, etc)? Based on Grant Jenks' notes, the current list insort/bisect/del technique can be expected to win for window sizes up to several thousand.

+1 on the current implementation - it's reasonably clear what's going on, and the other options are probably overkill for this general purpose library.

rhettinger · 2025-08-03T18:34:47Z

Okay, I think we're good to go.

bbayles · 2025-08-03T19:26:09Z

This is great, thanks - can't wait to find a place to use it.

Draft of running_median() with windowed data

5c8336c

bbayles reviewed Aug 1, 2025

View reviewed changes

more_itertools/recipes.py Outdated Show resolved Hide resolved

rhettinger added 3 commits August 1, 2025 16:59

Various small improvments

f8ccab2

Consistent use of "maxlen" formal parameter

f559403

Remove speculative part of the comment block

fffcd7a

rhettinger changed the title ~~Draft of running_median() with windowed data~~ running_median() with windowed data Aug 2, 2025

rhettinger added 2 commits August 2, 2025 11:12

Fold _sorted_window into _running_median_windowed

5b79a63

Test expected relationships

4369939

rhettinger added 6 commits August 2, 2025 13:17

Adjust "no cover" pragmas

c083f73

Improve variable names. The "window" slides across the data in arriva…

be6c8d7

…l order.

Simplify with a conditional expression

aebf56c

Pretty is as pretty does

7646cd0

Test non-integer case

2f220f7

Test for window size 2

671e5f1

Depend on more_itertools pairwise() which is available for Py3.9

262fcaf

bbayles approved these changes Aug 3, 2025

View reviewed changes

bbayles merged commit aea2ffb into more-itertools:master Aug 3, 2025
6 checks passed

rhettinger deleted the windowed_median branch August 15, 2025 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

running_median() with windowed data#1040

running_median() with windowed data#1040
bbayles merged 13 commits intomore-itertools:masterfrom
rhettinger:windowed_median

rhettinger commented Aug 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

rhettinger commented Aug 2, 2025 •

edited

Loading

Uh oh!

bbayles commented Aug 3, 2025

Uh oh!

rhettinger commented Aug 3, 2025

Uh oh!

bbayles commented Aug 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rhettinger commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

rhettinger commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbayles commented Aug 3, 2025

Uh oh!

rhettinger commented Aug 3, 2025

Uh oh!

bbayles commented Aug 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rhettinger commented Aug 1, 2025 •

edited

Loading

rhettinger commented Aug 2, 2025 •

edited

Loading