importer: AddSSTable sent after import cancellation

**Describe the problem**

In we've observed AddSSTable requests that were applied after an `IMPORT` job's `OnFailOrCancel` callback attempted to clear the data added by the import.

As a result, the table contains data from the cancelled `IMPORT.`

We believe this happens because the import processor on the remote node is still running.  While the import processor will eventually see a context cancellation, occasionally that context cancellation isn't seen until after another node has already adopted the `IMPORT` job and run the `OnFailOrCancel` hook.

We've identified a few possible causes of this:

1) We have at least 1 go routine that we do not wait for https://github.com/cockroachdb/cockroach/blob/8a36b94fe0d5b19606ca1a9e864c4b406da87471/pkg/sql/importer/import_processor.go#L179-L185.  Despite the comment in that code, in the case of cancellation we've observed that goroutine outliving the processor.

2) Since https://github.com/cockroachdb/cockroach/pull/89014  `OnFailOrCancel` is eligible for execution on another node immediately after the Resumer's context has been cancelled.  We've observed via logs OnFailOrCancel running before the Resumer has exited.

3) Even with (1) and (2) fixed, we've observed that `dsp.Run` https://github.com/cockroachdb/cockroach/blob/8a36b94fe0d5b19606ca1a9e864c4b406da87471/pkg/sql/importer/import_processor_planning.go#L271 returns before processors have exited. Typically, the processor will shutdown or observe a cancelled context before it is able to make a successful AddSSTable request, but occasionally it is not.

*** Possible Solutions ***

The following are possible solutions we've discussed in the past for this problem:

- Review our distSQL flow and ensure we are using the correct contexts and implementing the correct callbacks to ensure as orderly a shutdown as possible.

- Add new code in the job coordinator to explicitly wait for an affirmative shutdown from all processors. This would certainly help on the happy path, but it wouldn't cover all cases since the node responsible for doing the waiting may 

- Add a safety timeout before issuing any DeleteRange requests.

- Periodically broadcast a timestamp to all processors that the processors will use for writing AddSSTables (rather than allowing them to use the batch timestamps).  The node responsible for cancellation would then know the last timestamp at which nodes were possibly writing.

- For `IMPORT INTO` on empty tables, as a special case, we could write into a different index and then swap over to that index on success.

- A new KV feature "span admin lock" which would lock a span for admin operations.  Any AddSSTable requests that arrived with the wrong or an old lock token would be rejected.


**To Reproduce**

The failure can be seen in the unit test found here: https://github.com/cockroachdb/cockroach/pull/91407 when run under stress for a few minutes. 


Jira issue: CRDB-21252

	// We don't have to worry about this go routine leaking because next we loop over progCh
	// which is closed only after the go routine returns.
	go func() {
	defer close(idp.progCh)
	idp.summary, idp.importErr = runImport(ctx, idp.flowCtx, &idp.spec, idp.progCh,
	idp.seqChunkProvider)
	}()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

importer: AddSSTable sent after import cancellation #91418

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

importer: AddSSTable sent after import cancellation #91418

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions