sql: introduce new internal executor interfaces by ZhouXing19 · Pull Request #82477 · cockroachdb/cockroach

ZhouXing19 · 2022-06-06T18:27:48Z

This PR aims to provide a set of safer interfaces for the internal executor, making it less easy to abuse.

Currently, each conn executor underneath the internal executor (we call it “child executor”) has its own set of information, such as descriptor collection, job collection, schema change jobs, etc, even when it’s run with a not-nil outer kv.Txn, or there're multiple SQL executions under the same kv.Txn.
This is not intuitive, since it violates a rather deep principle that a descs.Collection and a SQL txn have a 1:1 relationship. The code doesn’t enforce that, but it ought to. The more places that make it possible to decouple this, the more anxious we get.

Ideally, internal executor with a not-nil txn is either planner or collectionFactory oriented, so that the txn is always tightly coupled with the descriptor collection. We thus propose a set of new interfaces to ensure this coupling.

Currently, the usage of an internal executor query function (e.g. InternalExecutor.ExecEx()) falls into the following 3 categories:

The query is run under a planner context and with a not-nil kv.Txn from this planner.
The query is run without a kv.Txn. (e.g. InternalExecutor.ExecEx(..., nil /* txn */, stmt...)
The query is running with a not-nil kv.Txn but not under the planner context.

For usage 1, the descriptor collections, txn state, job collections, and session data from the parent planner are expected to be passed to the internal executor's child conn executor.
For usage 2 and 3, if multiple SQL statements are run under the same txn, these executions should share the descs.Collection, txn state machine, job collections and session data for their conn executors.

To suit these 3 use cases, we proposed 3 interfaces for each of the query function:
(In the following we use InternalExecutor.ExecEx as the example)

For case 1, refactor to use func (p *planner) ExecExUpdated(), where the internal executor is always initialized with descs.Collection, TxnState and etc. from the sql.planner.
For case 2, refactor to use ieFactory.WithoutTxn(), where the query is always run with a nil kv.Txn.
For case 3, refactor to use CollectionFactory.TxnWithExecutor(). In this function, the internal executor is generated and passed to the call back function to run the query.

We also tried refactoring some of the existing use cases to give an example of the new interface.

(Note that the ultimate goal of this improvement is to deprecate all the "free-hanging" InternalExecutor objects (such as sql.ExecutorConfig.InternalExecutor) and replace them with an InternalExecutorFactory field. InternalExecutorFactory is to initialize a REAL internal executor, but it cannot be used directly to run SQL statement queries.
Instead, we wrap the initialization of an internal executor inside each query function, i.e. init it only when you really need to run a query. In other words, the creation of an internal executor becomes closer to the query running.)

fixes #69495
fixes #78998

Release Note: None

cockroach-teamcity · 2022-06-06T18:28:02Z

This change is

knz

Could you explain to us what you recommend happens in the following "common" use case:

err := db.Txn(ctx, func(ctx context.Context, txn *kv.Txn) error {
   ... := internalExecutor.QueryRowEx(ctx, ..., txn, query)
})

FYI, What this pattern does is to run the query but make it robust to transaction retry errors: the code inside db.Txn() will re-execute the closure (containing the QueryRowEx call) every time a retry error is encountered, automatically.

Now, what would happen with the proposed APIs? It seems to me that it would be "expensive" to re-initialize an internalexecutor from scratch every time db.Txn invokes the closure.

Would there be a way to "refresh" the kv.Txn object inside the executor, the same way that sql.connExecutor does internally for auto-retries (like in prepareTxnForRetryWithRewind())?

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @miretskiy, and @rafiss)

ajwerner · 2022-06-16T14:24:31Z

If I understand correctly, nothing really has been made worse by this interface change. I'd like to unpack the extent to which it has.

Could you explain to us what you recommend happens in the following "common" use case:

The db.Txn use case is not that common, it's pattern (3) in the list. A similarly common pattern today is to pass the nil /* txn */ (pattern (2)). In the nil txn case, the connExecutor does all of the restart management, as you propose.

It seems to me that it would be "expensive" to re-initialize an internalexecutor from scratch every time db.Txn invokes the closure.

I don't think I buy this. The internalExecutor we have today is shockingly thin. We already build a new connExecutor for each invocation. That means this change is not worse than any existing code which uses an explicit txn argument, right?

cockroach/pkg/sql/internal.go

Lines 129 to 140 in cc6aa33

    
           // initConnEx creates a connExecutor and runs it on a separate goroutine. It 
        
           // takes in a StmtBuf into which commands can be pushed and a WaitGroup that 
        
           // will be signaled when connEx.run() returns. 
        
           // 
        
           // If txn is not nil, the statement will be executed in the respective txn. 
        
           // 
        
           // The ieResultWriter coordinates communicating results to the client. It may 
        
           // block execution when rows are being sent in order to prevent hazardous 
        
           // concurrency. 
        
           // 
        
           // sd will constitute the executor's session state. 
        
           func (ie *InternalExecutor) initConnEx(

knz · 2022-06-16T14:29:33Z

ok, that's a good explanation thanks

ZhouXing19 · 2022-06-16T15:49:10Z

Thanks @ajwerner for the explanation! Yes the understanding is correct.

Would there be a way to "refresh" the kv.Txn object inside the executor, the same way that sql.connExecutor does internally for auto-retries (like in prepareTxnForRetryWithRewind())?

As Andrew mentioned, the initialization of an internal executor itself is to call sql.MakeInternalExecutor() coupled with some addition setting for session data, descs.Collection and etc. It does create and start new monitor for the new ie, but it doesn't seem that expensive.

So if it's acceptable to re-init the internal executor for each retry in the db.Txn() example, I don't think we need to enable "refreshing" the kv.Txn inside the internal executor, since in each retry, a new kv.Txn will be passed to the new ie.

rafiss · 2022-08-08T14:44:48Z

My gut is that this work is on a path to somewhere better than the internal executor we started with, but if we just leave things like this, we'll have made the story worse.

I disagree that it makes the story worse; I view this as an incremental improvement. This is evident by the fact that we already see the roundtrip tests improving. If your concern is just about tech debt, then with clear commenting and documentation, I really don't think we are setting ourselves for the disaster that you fear.

I hope we can get rid of the extraTxnState object soon and that we can actually make the APIs reflect the lifecycles which you're starting to create. If the status quo created here exists in 23.1, I'll be very sad.

I feel that we need these new interfaces merged for us to have common ground when discussing how to make further improvements. I feel pretty strongly that the way forward to this goal is to merge this PR and to tackle the other issues you raise in future PRs. I worry that trying to tackle each issue in this PR might have been making it harder to iterate on addressing the concerns you are raising.

what's acceptable in terms of the state of the world for 22.2.

We still plan to work through tech debt in this area during the stability period.

I hope it's clear that I'm not trying to dismiss your feedback. I'm trying to say that the best way to address your valuable feedback is to merge this large PR and continue working on improvements.

cucaroach · 2022-08-08T14:48:48Z

Does this PR fix #70888?

ZhouXing19 · 2022-08-08T15:05:26Z

Does this PR fix #70888?

Sadly no. This PR doesn't deprecate the existing ExecCfg.InternalExecutor, which we proposed to do after merging this one. This PR proposes a new interface to initialize an internal executor, which is expected to eventually replace the current ExecCfg.InternalExecutor and solve this issue. But given the large number of usages of the latter, we'd prefer to have the replacement in a future PR.

cucaroach · 2022-08-08T15:21:17Z

Okay thanks for the update! No worries, Rome wasn't built in a day. Could we get an Informs: in your commit message to that issue just so things are looped together? Thanks!

ZhouXing19 · 2022-08-08T15:37:14Z

Could we get an Informs: in your commit message to that issue just so things are looped together?

Yeah, sure! I'll add that.

ZhouXing19 · 2022-08-09T14:50:09Z

Made changes per comments, except changing sqlutil.InternalExecutorFactory into an interface, for I'm not 100% sure about how to define the InternalExecutorFactory field of structs: #82477 (comment)

ZhouXing19 · 2022-08-09T23:49:57Z

Made changes including changing sqlutil.InternalExecutorFactory to an interface. Ready for another look.

ZhouXing19 · 2022-08-10T17:42:34Z

Friendly reping-ing @ajwerner for another look. Thanks!

ajwerner

This is more or less where I want it. I think it'd be good if we delegate the functionality we need in the descs layer to a method in the sql layer on the InternalExecutorFactory, but otherwise, I'm content with the shape this has taken. Thanks for all the iteration.

ajwerner · 2022-08-11T16:16:45Z

pkg/sql/catalog/descs/factory.go

+// InternalExecutorFactoryWithTxn is used to create an internal executor
+// with associated extra txn state information.
+type InternalExecutorFactoryWithTxn func(


it feels to me like this should just be a method on the InternalExecutorFactory and that this library should describe the interface method. That way, we can avoid the closure in the server package.

Yeah I thought about that but it can be a bit difficult -- we will have to import descs package to sqlutil for the collection parameter, but that will again bring dependency loop.

I don't think that's true, sqlutil doesn't need to include this method in its interface, just the sql.InternalExecutorFactory has to implement the method. As in, we define an interface in descs which is distinct from the interface in sqlutil.

Ah, i see what you meant. Yeah, that makes sense, will do!

…are passed This commit adds a boolean field `fromOuterTxn` to the conn executor's extraTxnState. It's set true when the conn executor is run with a not-nil txn passed from the internal executor, and hence the collection and the job records, which are passed from the caller of the internal executor to the conn executor, should not be released when the conn executor close. Instead, we leave the caller to release them. This commit also changed the descriptor collection and schema changer state stored in conn executor's `ExtraTxnState` to pointer. We also deprecated `collectionFactory.MakeCollection()` with `collectionFactory.NewCollection()`. Release note: None

This commit 1. renamed the original `sqlutil.SessionBoundInternalExecutorFactory` to a more general name `sqlutil.InternalExecutorFactory`, and 2. change this factory from a function to an interface, that include a `NewInternalExecutor()` method with the same logic as the original function. Release note: none

This commit modified how a child conn executor is initialized under internal executor. We modified the logic of initializing a conn executor under internal executor. If there's a descriptor collection, txn state, job collection and schema change job passed to the internal executor, we let the child conn executor inherit them, instead of creating a new set for itself. Release note: None

Currently, the internal executor always create its own descriptor collections, txn state, job collection and etc. for its conn executor, even though it's run underneath a "parent" query. These recreation can unneccesarily reduce the query efficiency in some use cases, such as when an internal executor is used under a planner context. In this case, the internal executor is expected to inherit these info from the planner, rather than creating its own. To make this rule more explicit, this commit adds a series of query functions under `sql.planner`. Each of these functions wrap both the init of an internal executor and the query execution. In this way, the internal executor always stores the info inherited from the parent planner, and will pass it to its child conn executor. fixes cockroachdb#69495 Release note: None

…Executor() This commit introduces two functions that allow users to run sql statements with an internal executor. We intend to limit the usage of a real internal executor only inside these functions, instead of free-floating or hanging off certain structs. In other words, we restrict the init of an internal executor. The motivation is that if an internal executor is used to run multiple sql statements in a txn manner, these executions are expected to use the same set of info (such as descriptor collections) among their conn executor. While this rule can be easily forgot by the user of internal executors. Hence we provide an interface that wraps the initialization of internal executors with the query executions, so that the users won't need to be worried. Informs: once all existing usages of the internal executors are replaced with the new interfaces proposed here, cockroachdb#70888 should be solved. Release note: None

This commit provide an example to refactor the current use cases with the new internal executor interfaces. In this example, originally, the internal executor was used with a nil txn. We now replace it with ieFactory.RunWithoutTxn(). Release Note: None Release note (<category, see below>): <what> <show> <why>

….WithTxn() This commit is to provides example to refactor the usages of internal executor with the new interfaces. Idealy, if a planner is involved, use the query functions for `sql.planner`. Otherwise, if the query is to run with a not-nil txn, we should use collectionFactory.WithTxn(). Release note: None

ZhouXing19 · 2022-08-12T00:18:36Z

Thanks all for reviewing!
bors r=rafiss,ajwerner

craig · 2022-08-12T04:43:01Z

Build failed (retrying...):

Bazel Essential CI (Cockroach)

craig · 2022-08-12T06:02:39Z

Build succeeded:

Bazel Essential CI (Cockroach)

andreimatei · 2022-08-29T21:58:31Z

pkg/sql/internal.go

+
+// RunWithoutTxn is to create an internal executor without binding to a txn,
+// and run the passed function with this internal executor.
+func (ief *InternalExecutorFactory) RunWithoutTxn(


Why was this method necessary / how is it different from NewInternalExecutor() ? Can't the caller run exactly the two lines inside this function?
The comment talks about "binding to a txn" without other explanations. I don't think either fewer or more words would be needed since most callers don't care about that / don't know what "binding to a txn" means.

I agree that this function is far from ideal... We made this function with the hope of avoiding use cases where an internal executor is created without binding to any txn-related metadata, but is used to run queries with a not nil-txn. In other words, we wanted to make the usages "with" and "without" an outer txn more distinct from each other, and let callers think twice about which one they should use.
I think we can add a comment saying that it's disallowed to use this function to run DDLs or multiple statement in a transactional manner.

You can have a NewInternalExecutorWithoutTxn if you insist on the importance of having the "without part" in your face, but I don't see why the caller needs to structure its logic into a closure if they're not getting anything in return.

I think if we only have NewInternalExecutorWithoutTxn, it can still happen to the caller to use it to run statements with a not-nil txn, which is wrong. To wrap it in this function is to make it more explicit that you shouldn't do this (though it's true that we can't truly disallow it here)

I think the ideal case is to remove the txn field in internal executor's query functions (e.g. ie.QueryRowEx()). The txn should be bound to the internal executor, rather than each statement execution. With that, I think it's totally fine for us to remove this function and just do ie := NewInternalExecutorWithoutTxn ()

ZhouXing19 force-pushed the ie-new-0601 branch from a0be410 to 06ef82a Compare June 6, 2022 19:48

ZhouXing19 changed the title ~~[DNM] internl execurot improvement~~ [DNM] internl executor improvement Jun 6, 2022

ZhouXing19 force-pushed the ie-new-0601 branch from 06ef82a to 287e4ff Compare June 15, 2022 21:18

ZhouXing19 changed the title ~~[DNM] internl executor improvement~~ sql: introduce new internal executor interface Jun 15, 2022

ZhouXing19 marked this pull request as ready for review June 15, 2022 21:21

ZhouXing19 requested review from a team as code owners June 15, 2022 21:21

ZhouXing19 requested a review from a team June 15, 2022 21:21

ZhouXing19 requested a review from a team as a code owner June 15, 2022 21:21

ZhouXing19 requested a review from a team June 15, 2022 21:21

ZhouXing19 requested a review from a team as a code owner June 15, 2022 21:21

ZhouXing19 requested review from a team, ajwerner, miretskiy and rafiss and removed request for a team June 15, 2022 21:21

ZhouXing19 changed the title ~~sql: introduce new internal executor interface~~ sql: introduce new internal executor interfaces Jun 15, 2022

knz reviewed Jun 16, 2022

View reviewed changes

ZhouXing19 force-pushed the ie-new-0601 branch from 287e4ff to 062e366 Compare June 16, 2022 16:10

ZhouXing19 requested a review from a team as a code owner June 16, 2022 16:10

ZhouXing19 force-pushed the ie-new-0601 branch 3 times, most recently from 22de15b to a305630 Compare June 21, 2022 15:03

abarganier removed the request for review from a team June 21, 2022 16:32

ZhouXing19 force-pushed the ie-new-0601 branch 2 times, most recently from 445e55e to a9b858c Compare June 21, 2022 17:10

ZhouXing19 force-pushed the ie-new-0601 branch from 8d3aeca to cd75c45 Compare August 9, 2022 14:46

ZhouXing19 force-pushed the ie-new-0601 branch from cd75c45 to 068f90b Compare August 9, 2022 18:17

ZhouXing19 requested review from a team and ajwerner August 9, 2022 18:17

ZhouXing19 force-pushed the ie-new-0601 branch from 068f90b to 3df480e Compare August 10, 2022 16:15

ajwerner approved these changes Aug 11, 2022

View reviewed changes

ZhouXing19 added 7 commits August 11, 2022 14:39

ZhouXing19 force-pushed the ie-new-0601 branch from 3df480e to afb5db4 Compare August 11, 2022 19:46

ZhouXing19 mentioned this pull request Aug 17, 2022

*: add restriction to running DDL with internal executors #86334

Merged

ajwerner mentioned this pull request Aug 18, 2022

kv: remove PrepareRetryableError from the (*kv.Txn) API #86361

Closed

ZhouXing19 mentioned this pull request Aug 29, 2022

sql: add support to the InternalExecutor to execute multiple SQL queries #71467

Closed

andreimatei reviewed Aug 29, 2022

View reviewed changes

ZhouXing19 mentioned this pull request Oct 31, 2022

*: migrate the creation of internal executor to the new interfaces #91004

Closed

Conversation

ZhouXing19 commented Jun 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Jun 6, 2022

Uh oh!

knz left a comment

Choose a reason for hiding this comment

Uh oh!

ajwerner commented Jun 16, 2022

Uh oh!

knz commented Jun 16, 2022

Uh oh!

ZhouXing19 commented Jun 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rafiss commented Aug 8, 2022

Uh oh!

cucaroach commented Aug 8, 2022

Uh oh!

ZhouXing19 commented Aug 8, 2022

Uh oh!

cucaroach commented Aug 8, 2022

Uh oh!

ZhouXing19 commented Aug 8, 2022

Uh oh!

ZhouXing19 commented Aug 9, 2022

Uh oh!

ZhouXing19 commented Aug 9, 2022

Uh oh!

ZhouXing19 commented Aug 10, 2022

Uh oh!

ajwerner left a comment

Choose a reason for hiding this comment

Uh oh!

ajwerner Aug 11, 2022

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 Aug 11, 2022

Choose a reason for hiding this comment

Uh oh!

ajwerner Aug 11, 2022

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 Aug 11, 2022

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 Aug 11, 2022

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 commented Aug 12, 2022

Uh oh!

craig bot commented Aug 12, 2022

Uh oh!

craig bot commented Aug 12, 2022

Uh oh!

andreimatei Aug 29, 2022

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 Aug 29, 2022

Choose a reason for hiding this comment

Uh oh!

andreimatei Aug 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZhouXing19 Aug 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

ZhouXing19 commented Jun 6, 2022 •

edited

Loading

ZhouXing19 commented Jun 16, 2022 •

edited

Loading

andreimatei Aug 29, 2022 •

edited

Loading

ZhouXing19 Aug 29, 2022 •

edited

Loading