fix!: Rename MaxGoRoutines and enforce it by erezrokah · Pull Request #103 · cloudquery/plugin-sdk

erezrokah · 2022-09-14T10:37:09Z

Summary

When testing the Azure plugin I noticed we print a message on the number of Go routines used. From the other logs it seemed that the number is not enforced (All table fetch start logs were printed at the same time).
Since we can't really enforce a number of Go routines as sometimes we'd like to fetch related resources in parallel and we don't know those in advance, I renamed the configuration to Concurrency as I think we should still have a setting to control the load CloudQuery puts on the APIs (right now it creates a Go routine per top level table).
Also I switched to using channels instead of semaphore as seems more suitable for this use case, and we use channels in other places in the code.

Use the following steps to ensure your PR is ready to be reviewed

Read the contribution guidelines 🧑‍🎓
Run go fmt to format your code 🖊
Lint your changes via golangci-lint run 🚨 (install golangci-lint here)
Update or add tests 🧪
Ensure the status checks below are successful ✅

erezrokah · 2022-09-14T10:38:28Z

schema/table.go

 				}
 			}

+			totalResources++


It seems more clear to me to count each resource at the time we send it to the channel instead of the current approach. Happy to revert as this is a personal opinion

yevgenypats · 2022-09-14T18:54:02Z

I'd like to give it a go as well as I was dealing with a lot of issues around our concurrency and locks. I commented some of the stuff out because of lack of time and because I encountered some race conditions which I didn't have time to fix at the time.

erezrokah · 2022-09-15T07:10:33Z

I'd like to give it a go as well as I was dealing with a lot of issues around our concurrency and locks. I commented some of the stuff out because of lack of time and because I encountered some race conditions which I didn't have time to fix at the time.

💯 I understand as this is a core change. I think the challenge is that we can't predict how many table.resolve jobs we have as when we have a relation we resolve it for each parent resource that was fetched. Hence we can only enforce the number of Go routines at the top level.
The only way I could think of of enforcing the max Go routines is instead of resolving the relations in the same Go routine as the parent, send it as a new job back to the worker channel/queue. The problem with that approach is that we need to keep in memory all the relation jobs until those are scheduled. For example if a top level table fetches 10,000 items and it has 4 relations, we'll queue 40,000 new jobs.

I deleted the comments as code in comments usually gets stale and leads to future confusion.

yevgenypats · 2022-09-15T09:48:06Z

I'd like to give it a go as well as I was dealing with a lot of issues around our concurrency and locks. I commented some of the stuff out because of lack of time and because I encountered some race conditions which I didn't have time to fix at the time.

💯 I understand as this is a core change. I think the challenge is that we can't predict how many table.resolve jobs we have as when we have a relation we resolve it for each parent resource that was fetched. Hence we can only enforce the number of Go routines at the top level. The only way I could think of of enforcing the max Go routines is instead of resolving the relations in the same Go routine as the parent, send it as a new job back to the worker channel/queue. The problem with that approach is that we need to keep in memory all the relation jobs until those are scheduled. For example if a top level table fetches 10,000 items and it has 4 relations, we'll queue 40,000 new jobs.

I deleted the comments as code in comments usually gets stale and leads to future confusion.

I think this still wont solve it and will always cause deadlock. The only way I see is possible to solve it is with X parameters:

concurrency_relations_1, concurrency_relations_2, and so on. a limit on number of jobs wont solve as child relations can always cause deadlock for parent, doesn't matter if it's queue, worker channel, etc...

erezrokah · 2022-09-15T11:11:00Z

concurrency_relations_1, concurrency_relations_2

Yeah I thought about that too, but that's a dynamic number based on the depth of relations and also very hard to expose as a config.

a limit on number of jobs wont solve as child relations can always cause deadlock for parent, doesn't matter if it's queue, worker channel, etc...

I think if you resolve only 1 level each time and each relation is a new job it won't deadlock as the parent already finished running (if we do a non blocking send or use a very big channel). But again that leads to memory issues as the number of jobs grows very high.

Hence this PR only implements a limit at the top level tables and avoids the term Go routines.
If we don't have a limit at all the only way to control concurrency is to use multiple configuration files with a different set of resources per configuration.

On another note Azure has 55 top level tables, and if someone has 4 subscriptions the current code will create 220 Go routines. I'm actually not worried about the number of Go routines, but on the number of API calls and rate limiting.
⬆️ Might not be a problem yet, but it would be nice to be able to control it.

plugins/source.go

disq · 2022-09-15T13:52:08Z

plugins/source.go

+
+	totalResources := 0
+	for i := 0; i < jobsCount; i++ {
+		result := <-results


I would make the channels unbuffered and collect in a separate goroutine, to save on memory.

I'll make this change once @yevgenypats approves of the new approach

disq · 2022-09-15T13:53:13Z

specs/source.go

-	Destinations  []string    `json:"destinations,omitempty"`
-	Spec          interface{} `json:"spec,omitempty"`
+	Registry     Registry    `json:"registry,omitempty"`
+	Concurrency  uint64      `json:"concurrency,omitempty"`


I welcome this change as the MaxGoRoutines name is a bit too implementation-detaily.

erezrokah · 2022-09-19T10:53:05Z

Closing in favor of #129

erezrokah requested a review from yevgenypats as a code owner September 14, 2022 10:37

github-actions bot added breaking labels Sep 14, 2022

erezrokah commented Sep 14, 2022

View reviewed changes

github-actions bot added breaking and removed fix labels Sep 14, 2022

github-actions bot added breaking and removed fix labels Sep 15, 2022

disq reviewed Sep 15, 2022

View reviewed changes

plugins/source.go Outdated Show resolved Hide resolved

disq reviewed Sep 15, 2022

View reviewed changes

plugins/source.go Show resolved Hide resolved

disq reviewed Sep 15, 2022

View reviewed changes

plugins/source.go Outdated Show resolved Hide resolved

disq reviewed Sep 15, 2022

View reviewed changes

fix!: Rename MaxGoRoutines and enforce it

b927b57

erezrokah added 2 commits September 15, 2022 18:50

chore: Update default concurrency

68986b4

refactor: Start index from 0 and rename w to i

9cf9fa0

erezrokah force-pushed the fix/goroutines branch from 81d8f02 to 9cf9fa0 Compare September 15, 2022 15:52

yevgenypats mentioned this pull request Sep 19, 2022

fix: Bring concurrency back #129

Merged

5 tasks

erezrokah closed this Sep 19, 2022

erezrokah deleted the fix/goroutines branch September 19, 2022 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix!: Rename MaxGoRoutines and enforce it#103

fix!: Rename MaxGoRoutines and enforce it#103
erezrokah wants to merge 3 commits intocloudquery:mainfrom
erezrokah:fix/goroutines

erezrokah commented Sep 14, 2022 •

edited

Loading

Uh oh!

erezrokah Sep 14, 2022 •

edited

Loading

Uh oh!

yevgenypats commented Sep 14, 2022

Uh oh!

erezrokah commented Sep 15, 2022 •

edited

Loading

Uh oh!

yevgenypats commented Sep 15, 2022

Uh oh!

erezrokah commented Sep 15, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

disq Sep 15, 2022

Uh oh!

erezrokah Sep 15, 2022

Uh oh!

disq Sep 15, 2022

Uh oh!

erezrokah commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

erezrokah commented Sep 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

erezrokah Sep 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yevgenypats commented Sep 14, 2022

Uh oh!

erezrokah commented Sep 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yevgenypats commented Sep 15, 2022

Uh oh!

erezrokah commented Sep 15, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

disq Sep 15, 2022

Choose a reason for hiding this comment

Uh oh!

erezrokah Sep 15, 2022

Choose a reason for hiding this comment

Uh oh!

disq Sep 15, 2022

Choose a reason for hiding this comment

Uh oh!

erezrokah commented Sep 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

erezrokah commented Sep 14, 2022 •

edited

Loading

erezrokah Sep 14, 2022 •

edited

Loading

erezrokah commented Sep 15, 2022 •

edited

Loading