Implement wanted error handling for alerting and actions by mikecote · Pull Request #41917 · elastic/kibana

mikecote · 2019-07-24T18:16:40Z

This PR implements what is wanted for error handling in alerting and actions as outlined in #39349.

The following changes have been made:

Action types have a new maxAttempts attribute that is optional
Executions returning status of error will be able to recommend retry logic via retry attribute
Executors are no longer passed scheduledRunAt and previousScheduledRunAt. Those are replaced with startedAt and previousStartedAt.

elasticmachine · 2019-07-24T18:16:43Z

Pinging @elastic/kibana-stack-services

x-pack/legacy/plugins/actions/server/action_type_registry.test.ts

pmuellr · 2019-07-29T18:58:47Z

x-pack/legacy/plugins/actions/server/action_type_registry.ts

+            return error.retry == null ? true : error.retry;
+          }
+          // Retry other kinds of errors
+          return true;


I think we want to have this return false by default. Idea being that an action will know when something is retry-able - setting retry to true or a Date - and any other case, should not be retryable.

Yeah I guess by looking here https://github.com/elastic/kibana/blob/master/x-pack/legacy/plugins/actions/server/lib/execute.ts#L29-L74 any code that could throw here is already being handled and wrapped into a proper ExecutionError. Some may need to return retry: false ex validation?

I went ahead and made it false for other kinds of errors. I can't think of a reason to retry totally unhandled errors.

pmuellr

Made a few comments - just one in the review, a few "single" comments.

My only real concern is defaulting getRetry() to true in x-pack/legacy/plugins/actions/server/action_type_registry.ts; perhaps I'm not thinking about that right. And we could always tweak it later ...

pmuellr · 2019-07-29T19:04:39Z

x-pack/legacy/plugins/alerting/server/lib/get_create_task_runner_function.test.ts

  expect(runnerResult).toMatchInlineSnapshot(`
    Object {
-      "runAt": 2019-06-03T18:55:30.982Z,
+      "runAt": 1970-01-01T00:00:10.000Z,


Interesting the date is set to such an old epoch-y date. My only thought is that at some point in the future, if we ended up needing to calculate a previous date from this, somehow, the epoch millis might go negative. Which node should handle, but I wonder if some other part of the system might choke on negative numbers.

$ node -p 'new Date(-1)' 1969-12-31T23:59:59.999Z

OTOH, I guess would actually be a good test, so ... leave it in!?!?!?

Yeah this seems to be what happens when you use fake timers, everything becomes new Date(0) by default 🤔

…rmat is returned with status: error

bmcconaghy

Code LGTM, just wondering about defaulting to retrying.

x-pack/legacy/plugins/actions/server/action_type_registry.ts

…r-handling

elasticmachine · 2019-07-30T14:19:31Z

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request

) * Implement wanted error handling * Cleanup * Add retry logic to actions * Leverage startedAt from task manager * Fix broken jest tests * Add missing unit test * Add unit tests for getRetry * Add test for rate limit * Remove fake timers * Don't retry errors by default for actions unless the proper result format is returned with status: error * Don't retry unless attribute specified * Fix tests

… (#42255) * Implement wanted error handling for alerting and actions (#41917) * Implement wanted error handling * Cleanup * Add retry logic to actions * Leverage startedAt from task manager * Fix broken jest tests * Add missing unit test * Add unit tests for getRetry * Add test for rate limit * Remove fake timers * Don't retry errors by default for actions unless the proper result format is returned with status: error * Don't retry unless attribute specified * Fix tests * Increase retry timeout to prevent flaky tests (#42291)

mikecote added Feature:Alerting v8.0.0 release_note:skip Skip the PR/issue when compiling release notes Team:Stack Services v7.4.0 labels Jul 24, 2019

mikecote self-assigned this Jul 24, 2019

This comment has been minimized.

Sign in to view

mikecote force-pushed the alerting/error-handling branch 3 times, most recently from 88012ae to d145f25 Compare July 26, 2019 16:47

This comment has been minimized.

Sign in to view

mikecote added 5 commits July 29, 2019 08:45

Implement wanted error handling

7eb2e1b

Cleanup

82a2f57

Add retry logic to actions

78cd509

Leverage startedAt from task manager

0459929

Fix broken jest tests

13ae86a

mikecote force-pushed the alerting/error-handling branch from cad9add to 13ae86a Compare July 29, 2019 12:46

mikecote added 2 commits July 29, 2019 09:14

Add missing unit test

d9fedaa

Add unit tests for getRetry

cbb666c

mikecote marked this pull request as ready for review July 29, 2019 13:28

mikecote requested a review from a team July 29, 2019 13:28

mikecote added the review label Jul 29, 2019

This comment has been minimized.

Sign in to view

Add test for rate limit

4b751f1

pmuellr reviewed Jul 29, 2019

View reviewed changes

x-pack/legacy/plugins/actions/server/action_type_registry.test.ts Outdated Show resolved Hide resolved

Remove fake timers

1ec4571

pmuellr reviewed Jul 29, 2019

View reviewed changes

pmuellr approved these changes Jul 29, 2019

View reviewed changes

Don't retry errors by default for actions unless the proper result fo…

f6c0353

…rmat is returned with status: error

This comment has been minimized.

Sign in to view

bmcconaghy approved these changes Jul 30, 2019

View reviewed changes

x-pack/legacy/plugins/actions/server/action_type_registry.ts Outdated Show resolved Hide resolved

mikecote added 3 commits July 30, 2019 09:13

Merge branch 'master' of github.com:elastic/kibana into alerting/erro…

ede0ced

…r-handling

Don't retry unless attribute specified

4bb44c4

Fix tests

2f1f593

mikecote merged commit 9a321ae into elastic:master Jul 30, 2019

mikecote mentioned this pull request Jul 30, 2019

[7.x] Implement wanted error handling for alerting and actions (#41917) #42255

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement wanted error handling for alerting and actions#41917

Implement wanted error handling for alerting and actions#41917
mikecote merged 13 commits intoelastic:masterfrom
mikecote:alerting/error-handling

mikecote commented Jul 24, 2019 •

edited

Loading

Uh oh!

elasticmachine commented Jul 24, 2019

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

pmuellr Jul 29, 2019

Uh oh!

mikecote Jul 29, 2019

Uh oh!

mikecote Jul 29, 2019

Uh oh!

pmuellr left a comment

Uh oh!

pmuellr Jul 29, 2019

Uh oh!

mikecote Jul 29, 2019

Uh oh!

This comment has been minimized.

bmcconaghy left a comment

Uh oh!

Uh oh!

elasticmachine commented Jul 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mikecote commented Jul 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Jul 24, 2019

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

pmuellr Jul 29, 2019

Choose a reason for hiding this comment

Uh oh!

mikecote Jul 29, 2019

Choose a reason for hiding this comment

Uh oh!

mikecote Jul 29, 2019

Choose a reason for hiding this comment

Uh oh!

pmuellr left a comment

Choose a reason for hiding this comment

Uh oh!

pmuellr Jul 29, 2019

Choose a reason for hiding this comment

Uh oh!

mikecote Jul 29, 2019

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

bmcconaghy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticmachine commented Jul 30, 2019

💚 Build Succeeded

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mikecote commented Jul 24, 2019 •

edited

Loading