Fix naive model with last strategy for cases with trailing NaN values by Flix6x · Pull Request #1130 · sktime/sktime

Flix6x · 2021-07-12T20:39:04Z

Reference Issues/PRs

Fixes #918.

What does this implement/fix? Explain your changes.

The logic for the "last" strategy is now much more similar to the logic for the "mean" strategy. I refactored some of the logic to internal util functions so that it's clear that both strategies transform the last_window according to the seasonal periodicity in the same way, and also tile the results in the same way. The only difference is now that the "last" strategy selects the last non-NaN value from each seasonally periodic row, while the "mean" strategy computes the mean of said row.

Just to note, to me it would make sense if the "drift" strategy would use the same util functions to:

make it robust against NaN values, both at the end of the time series and at the start, and
let it account for seasonal periodicity.
I think this would be a separate issue, but just to be clear: I am not considering working on it, unless someone points me to a use case for this particular strategy.

Does your contribution introduce a new dependency? If yes, which one?

No.

What should a reviewer concentrate their feedback on?

Any other comments?

PR checklist

For all contributions

I've added myself to the list of contributors.
Optionally, I've updated sktime's CODEOWNERS to receive notifications about future changes to these files.
I've added unit tests and made sure they pass locally.

For new estimators

I've added the estimator to the online documentation.
I've updated the existing example notebooks or provided a new one to showcase how my estimator works.

…sp logic of mean strategy

fkiraly

To me, this looks very sensible! Thanks for the change.
I think this makes the method a lot more robust.

Now, I have to admit, I wanted to review this earlier, but I had to take some time for this since the docstrings aren't too great - otherwise I would have been done with this much earlier.

May I hence kindly ask:

in the class docstring, explain better what is meant with "last window"
in the class docstring, explain how nans are now handled
add docstrings to the new and old utility functions, you will do future developers a big favour...
add docstrings to the new tests - what does this tests for, under which conditions are what errors raised?

Another point: should the tag "handles-missing-values" now be switched to True? The default is False if it's not set.

fkiraly · 2021-08-21T09:34:50Z

@Flix6x, are you still working on this?

Flix6x · 2021-08-23T14:01:08Z

@Flix6x, are you still working on this?

Little busy, but I plan to get back to this on Wednesday evening or Monday evening. Your documentation requests seem reasonable. I'd need to find out how I could set that tag, though. Is that on a class level or strategy level? If it's on a class level, I don't think the drift strategy is robust against missing values, so then it should still be False, but maybe made explicit and with a corresponding # todo: switch to True if GH<number of issue that addresses robustness against nan values for the "drift" strategy> is fixed?

fkiraly · 2021-08-31T15:52:38Z

If it's on a class level, I don't think the drift strategy is robust against missing values, so then it should still be False

on the class level, it should be the "worst possible value". If you want, you can do it on the object/strategy level by using set_tags.

…comment

Flix6x · 2021-09-01T12:21:59Z

If you want, you can do it on the object/strategy level by using set_tags.

Thanks for the hint. I did just that.

add docstrings to the new tests - what does this tests for, under which conditions are what errors raised?

I added slightly more detail to the docstring and an inline comment. It's already pretty detailed compared to other tests.

sktime/forecasting/naive.py

sktime/forecasting/tests/test_naive.py

fkiraly

looks good from my side, I now understand what this is doing - I added minor comments, neither a blocker.

fkiraly · 2021-09-01T12:48:10Z

but, obviously, you also need to fix:

doc quality checks
should be up-to-date with main

(can't be merged without this)

mloning

Looks all good to me - can we merge @fkiraly?

fkiraly

yes, all fine - doc checks pass now.

mloning · 2021-09-09T18:55:34Z

Thanks @Flix6x and sorry again for the very long delay on this one - now it's merged 🎉

Flix6x added 2 commits July 12, 2021 22:06

BUG GH918: Fix by selecting most recent non-NaN value and by reusing …

c3f42ef

…sp logic of mean strategy

TST GH918: Update test

217d879

Flix6x requested a review from mloning as a code owner July 12, 2021 20:39

Flix6x and others added 2 commits July 14, 2021 14:21

CLN flake8

bf32287

Merge branch 'main' into fix-GH918

291109c

fkiraly requested review from aiwalter and fkiraly July 27, 2021 20:45

fkiraly added the module:forecasting forecasting module: forecasting, incl probabilistic and hierarchical forecasting label Jul 31, 2021

fkiraly requested changes Aug 1, 2021

View reviewed changes

Merge remote-tracking branch 'turing-origin/main' into fix-GH918

4bf5082

Flix6x mentioned this pull request Aug 31, 2021

[ENH] Make naive model with drift strategy robust against NaN values and account for seasonal periodicity #1367

Open

Flix6x added 6 commits August 31, 2021 11:22

DOC GH918: better explanation of last window and NaN handling

c70ab95

DOC GH918: add author

ae400c2

DOC GH918: add code owner

4aa2b7d

DOC GH918: add docstring to internal util functions

9dc4292

DOC GH918: add class tag for handles-missing-data + todo

b8f64ba

DOC GH918: fix in-line comments

79f1771

Flix6x added 3 commits September 1, 2021 14:06

DOC GH918: override handles-missing-data tag depending on strategy

2a5880c

TST GH918: check handles-missing-data tag

4a9cbec

DOC GH918: add author to test, expand test docstring and add in-line …

47323d3

…comment

Flix6x requested a review from fkiraly September 1, 2021 12:22

fkiraly reviewed Sep 1, 2021

View reviewed changes

sktime/forecasting/naive.py Outdated Show resolved Hide resolved

fkiraly reviewed Sep 1, 2021

View reviewed changes

sktime/forecasting/naive.py Show resolved Hide resolved

fkiraly reviewed Sep 1, 2021

View reviewed changes

sktime/forecasting/tests/test_naive.py Outdated Show resolved Hide resolved

fkiraly previously approved these changes Sep 1, 2021

View reviewed changes

Flix6x added 3 commits September 1, 2021 15:54

DOC GH918: fix doc-quality

1d863b7

DOC GH918: switch author names to GitHub IDs

150cfe2

Merge remote-tracking branch 'turing-origin/main' into fix-GH918

eaa6eeb

Flix6x dismissed fkiraly’s stale review via eaa6eeb September 1, 2021 13:58

Flix6x added 4 commits September 6, 2021 09:56

DOC GH918: fix doc-quality

2c5e7e3

Merge remote-tracking branch 'turing-origin/main' into fix-GH918

1936ee7

DOC GH918: fix doc-quality

2fa8c16

DOC GH918: fix code-quality

713b439

Flix6x requested a review from fkiraly September 6, 2021 10:23

mloning approved these changes Sep 8, 2021

View reviewed changes

fkiraly approved these changes Sep 9, 2021

View reviewed changes

Flix6x deleted the fix-GH918 branch September 10, 2021 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix naive model with last strategy for cases with trailing NaN values#1130

Fix naive model with last strategy for cases with trailing NaN values#1130
mloning merged 21 commits intosktime:mainfrom
SeitaBV:fix-GH918

Flix6x commented Jul 12, 2021 •

edited

Loading

Uh oh!

fkiraly left a comment •

edited

Loading

Uh oh!

fkiraly commented Aug 21, 2021

Uh oh!

Flix6x commented Aug 23, 2021

Uh oh!

fkiraly commented Aug 31, 2021

Uh oh!

Flix6x commented Sep 1, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fkiraly left a comment

Uh oh!

fkiraly commented Sep 1, 2021

Uh oh!

mloning left a comment

Uh oh!

fkiraly left a comment

Uh oh!

mloning commented Sep 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Flix6x commented Jul 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Any other comments?

PR checklist

For all contributions

For new estimators

Uh oh!

fkiraly left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fkiraly commented Aug 21, 2021

Uh oh!

Flix6x commented Aug 23, 2021

Uh oh!

fkiraly commented Aug 31, 2021

Uh oh!

Flix6x commented Sep 1, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

fkiraly commented Sep 1, 2021

Uh oh!

mloning left a comment

Choose a reason for hiding this comment

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

mloning commented Sep 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Flix6x commented Jul 12, 2021 •

edited

Loading

fkiraly left a comment •

edited

Loading