Skip to content

jobs: add expiration for paused jobs #84598

@amruss

Description

@amruss

Paused jobs can be easily forgotten about. Long-paused jobs can cause cluster issues, because of the accumulation of garbage that paused jobs can create. Additionally, supporting long-paused jobs through cluster upgrades is hard.

We should:

  • Expire paused jobs after 3 days of being paused, add expiration time as a column in jobs table
  • If a job is unpaused then paused, the expiration counter should reset

Rational behind 3 days: you have enough working-day time to have done something about this

Follow up work:

Jira issue: CRDB-17756

Epic CRDB-20422

Metadata

Metadata

Assignees

Labels

C-enhancementSolution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)T-jobs

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions