Tracking issue: safe, lazy, managed DROP TABLE

`DROP TABLE` is a risky MySQL operation in production. There seem to be multiple components involved, the major being that if the table has pages in InnoDB's buffer pool (one or many), then the buffer pool is locked for the duration of the `DROP`. The duration of the `DROP` is also related with the time it takes the operating system to delete the `.ibd` file associated with the table (assuming `innodb_file_per_table`).

Noteworthy that the problem is in particular on the `primary` (`master`) node; Replicas are not affected as much.

Different companies solve `DROP TABLE` in different ways. An interesting discussion is found on `gh-ost`'s repo and on mysql bugs:

- https://github.com/github/gh-ost/issues/307
- https://bugs.mysql.com/bug.php?id=91977

Solutions differ in implementation, but all suggest _waiting_ for some time before actually dropping the table. That alone requires management around `DROP TABLE` operations. As explained below, _waiting_ enables reverting the operation.

Vitess should automate table drops and make the problem transparent to the user as much as possible. Breakdown of the suggested solution follows.

## Illustrating the DROP steps

We can make the `DROP` management stateful or stateless. I opt for stateless: no meta tables to describe the progress of the `DROP`. The state should be inferred from the tables themselves. Specifically, we will encode hints in the table names.

We wish to manage `DROP` requests. Most managed `DROP` requests will _wait_ before destroying data. If the user issued a `DROP TABLE` only to realize the app still expects the table to exist, then we make it possible to revert the operation.

This is done by first issuing a `RENAME TABLE my_table TO something_else`. To the app, it seems like the table is gone; but the user may easily restore it by running the revert query: `RENAME TABLE something_else TO my_table`.

That `something_else` name can be e.g. `_vt_HOLD_2201058f_f266_11ea_bab4_0242c0a8b007_20200910113042`.

At some point we decide that we can destroy the data. The "hold" period can either be determined by vitess or explicitly by the user. e.g. On a successful schema migration completion, Vitess can choose to purge the "old" table right away.
At that stage we rename the table to e.g. `_vt_PURGE_63b5db0c_f25c_11ea_bab4_0242c0a8b007_20200911070228`.
A table by that name is eligible to have its data purged.

In my experience (see `gh-ost` issue above), a safe method to purge data is to slowly remove rows, until the table is empty. Note:

- This operation needs to be throttled (see #6661, #6668,  https://github.com/vitessio/website/pull/512)
- We can `SET SQL_LOG_BIN=0` and only purge the table on the `primary`. This reduces binlog size and also does not introduce replication lag. One may argue that lag-based throttling is not needed, but in my experience it's still wise to use, since replication lag can imply load on `primary`, and it's best to not overload the `primary`.
- We issue a `DELETE FROM my_table LIMIT 50` ; 10-100 are normally good chunk sizes. Order of purging does not matter.

It's important to note that the `DELETE` statement actually causes table pages to _load into the buffer pool_, which works against us.

Once all rows are purged from a table, we rename it again to e.g. `_vt_DROP_8a797518_f25c_11ea_bab4_0242c0a8b007_20200911234156`. At this time we point out that `20200911234156` is actually a readable timestamp, and stands for `2020-09-11 23:41:56`. That timestamp can tell us when the table was last renamed. 

Vitess can then run an actual `DROP TABLE` for `_vt_DROP_...` tables whose timestamp is older than, say, 2 days. As mentioned above, purging the table actually caused the table to load onto the buffer pool, and we need to wait for it to naturally get evicted, before dropping it.


 
## Asking for a safe DROP

We can introduce a new `DROP TABLE` syntax. The user will issue `DROP TABLE` with hints to Vitess. Examples could be:

- `DROP IN '24h' TABLE my_table`: `my_table` renamed to `_vt_HOLD_...` where it will spend at least 24h before transitioning to `_vt_PURGE...`. The user will have 24h to regret and revert the `DROP`.
- `DROP IN '30min' TABLE my_table`: same, 30min only
- `DROP NOWAIT TABLE my_table`: the user indicates they're sure they won't regret the `DROP`. We save time and immediately rename to `_vt_PURGE_...`
- `DROP DEFAULT TABLE my_table`: Vitess chooses the `HOLD` time (e.g. 3 days)

Vitess can internally choose to drop tables, my immediate example is with [automated schema migrations](https://github.com/vitessio/vitess/pull/6547). Whether successful or failed, it's generally safe to purge the artifact tables immediately.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue: safe, lazy, managed DROP TABLE #6689

Illustrating the DROP steps

Asking for a safe DROP

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Tracking issue: safe, lazy, managed DROP TABLE #6689

Description

Illustrating the DROP steps

Asking for a safe DROP

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions