store/bucket: wait until chunk loading ends in Close() by GiedriusS · Pull Request #6582 · thanos-io/thanos

GiedriusS · 2023-08-04T10:39:56Z

The chunk reader needs to wait until the chunk loading ends in Close() because otherwise there will be a race between appending to r.chunkBytes and reading from it. This is because Close() only cancels the context but populateChunk() cannot check the context in a way not to cause a race. So, if the context is canceled between getting data and populating chunks then there's a race.

fpetkovski · 2023-08-04T12:16:19Z

pkg/store/bucket.go

 		r.stats.chunksTouched++
 		r.stats.ChunksTouchedSizeSum += units.Base2Bytes(int(chunkDataLen))
-
-		r.block.chunkPool.Put(nb)


Hm, where did this go?

Idea was to put everything into the slice that is Put() back during Close() but it seems like this refactoring was faulty. I removed it.

Chunk reader needs to wait until the chunk loading ends in Close() because otherwise there will be a race between appending to r.chunkBytes and reading from it. Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

douglascamata · 2023-08-10T15:29:22Z

This looks like a good place and time to apply https://pkg.go.dev/sync#Cond?

douglascamata

Leaving some suggestions on how I think sync.Cond could be use here. Feel free to decide whether to adopt.

douglascamata · 2023-08-10T15:32:44Z

pkg/store/bucket.go

+	loadingChunksMtx  sync.Mutex
+	loadingChunks     bool
+	finishLoadingChks chan struct{}


Suggested change

loadingChunksMtx sync.Mutex

loadingChunks bool

finishLoadingChks chan struct{}

loadingChunksCond *sync.Cond

loadingChunks bool

I think sync.Cond is hard to use and understand, and I would prefer to have a simpler solution with a mutex or an atomic variable.

@fpetkovski that's also a great idea. If we bump our minimum Go version high enough, we could use generic version of atomics to easily swap something of any type.

But I guess the critical part here is "waiting for a condition to be fulfilled": lock released when chunks are finally loaded.

An atomic variable won't work because we must wait until the chunk loading finishes. sync.Cond doesn't work because at the end of load() it will signal only once whereas we need a permanent state of either on or off. As far as I can tell, with your suggestion in a normal operation Close() will hang forever because it will never receive a signal.

I think the only way to simplify this is to get rid of the bool and create the channel under the lock. 🤔

@GiedriusS even with sync.Cond you will see that my suggestions keep the bool variable. The instance of sync.Cond only manages the pause/resume of execution when the bool variable (the condition) fails. You can see it in the suggestion below, where we don't even check the sync.Cond of the bool says blocks are loaded:

r.loadingChunksCond.L.Lock() if r.loadingChunks { r.loadingChunksCond.Wait() } r.loadingChunksCond.L.Unlock()

In a way, the sync.Cond is only abstracting the channel. We need still the condition's bool variable to be checked.

Mhm, so maybe we can merge this now and clean up later to unblock the release?

Let's go ahead with this then, and iterate on it. 🙂

douglascamata · 2023-08-10T15:34:06Z

pkg/store/bucket.go

+	r.loadingChunksMtx.Lock()
+	r.loadingChunks = false
+	r.finishLoadingChks = make(chan struct{})
+	r.loadingChunksMtx.Unlock()


Suggested change

r.loadingChunksMtx.Lock()

r.loadingChunks = false

r.finishLoadingChks = make(chan struct{})

r.loadingChunksMtx.Unlock()

r.loadingChunksCond = sync.NewCond(&sync.Mutex{})

No need to initialize r.loadingChunks as the default value for a bool is false.

douglascamata · 2023-08-10T15:34:41Z

pkg/store/bucket.go

+	// NOTE(GiedriusS): we need to wait until loading chunks because loading
+	// chunks modifies r.block.chunkPool.
+	r.loadingChunksMtx.Lock()
+	loadingChks := r.loadingChunks
+	r.loadingChunksMtx.Unlock()
+
+	if loadingChks {
+		<-r.finishLoadingChks
+	}


Suggested change

// NOTE(GiedriusS): we need to wait until loading chunks because loading

// chunks modifies r.block.chunkPool.

r.loadingChunksMtx.Lock()

loadingChks := r.loadingChunks

r.loadingChunksMtx.Unlock()

if loadingChks {

<-r.finishLoadingChks

}

// Locks the condition and wait for a signal.

r.loadingChunksCond.L.Lock()

if r.loadingChunks {

r.loadingChunksCond.Wait()

}

r.loadingChunksCond.L.Unlock()

douglascamata · 2023-08-10T15:41:51Z

pkg/store/bucket.go

+	r.loadingChunksMtx.Lock()
+	r.loadingChunks = true
+	r.loadingChunksMtx.Unlock()
+
+	defer func() {
+		r.loadingChunksMtx.Lock()
+		r.loadingChunks = false
+		r.loadingChunksMtx.Unlock()
+
+		close(r.finishLoadingChks)
+	}()
+


Suggested change

r.loadingChunksMtx.Lock()

r.loadingChunks = true

r.loadingChunksMtx.Unlock()

defer func() {

r.loadingChunksMtx.Lock()

r.loadingChunks = false

r.loadingChunksMtx.Unlock()

close(r.finishLoadingChks)

}()

// when done loading, signal to anyone waiting on chunks to be loaded.

defer r.loadingChunksCond.Signal()

Could use r.loadingCunks.Broadcast() here if there could be multiple Go routines waiting for chunks to be loaded.

Chunk reader needs to wait until the chunk loading ends in Close() because otherwise there will be a race between appending to r.chunkBytes and reading from it. Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

pull-request-size bot added the size/S label Aug 4, 2023

fpetkovski reviewed Aug 4, 2023

View reviewed changes

GiedriusS force-pushed the wait_until_chunkloading_ends branch from 9bf6078 to 08da27e Compare August 9, 2023 08:14

pull-request-size bot added size/XS and removed size/S labels Aug 9, 2023

GiedriusS force-pushed the wait_until_chunkloading_ends branch from 08da27e to cfc6e39 Compare August 9, 2023 09:09

GiedriusS marked this pull request as draft August 9, 2023 09:53

GiedriusS force-pushed the wait_until_chunkloading_ends branch from cfc6e39 to 5273244 Compare August 9, 2023 10:51

pull-request-size bot added size/S and removed size/XS labels Aug 9, 2023

GiedriusS force-pushed the wait_until_chunkloading_ends branch 4 times, most recently from 1d0523c to e55687c Compare August 9, 2023 13:22

GiedriusS force-pushed the wait_until_chunkloading_ends branch from e55687c to 6eb3eb7 Compare August 9, 2023 14:00

GiedriusS marked this pull request as ready for review August 9, 2023 14:26

GiedriusS requested a review from fpetkovski August 9, 2023 14:26

douglascamata reviewed Aug 10, 2023

View reviewed changes

saswatamcode mentioned this pull request Aug 16, 2023

CHANGELOG: Mark v0.32 as in progress #6617

Merged

2 tasks

saswatamcode approved these changes Aug 16, 2023

View reviewed changes

saswatamcode merged commit 51da039 into main Aug 16, 2023

Conversation

GiedriusS commented Aug 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

douglascamata commented Aug 10, 2023

Uh oh!

douglascamata left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fpetkovski Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

douglascamata Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

douglascamata Aug 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GiedriusS commented Aug 4, 2023 •

edited

Loading

fpetkovski Aug 16, 2023 •

edited

Loading

douglascamata Aug 10, 2023 •

edited

Loading

douglascamata Aug 10, 2023 •

edited

Loading