libnetwork: use go-immutable-radix instead of radix by tiborvass · Pull Request #44501 · moby/moby

tiborvass · 2022-11-18T22:15:32Z

This commit allows to remove dependency on the mutable version armon/go-radix.

The go-immutable-radix package is better maintained.

It is likely that a bit more memory will be used when using the immutable version, though discarded nodes are being reused in a pool. These changes happen when networks are added/removed or nodes come and go in a cluster, so we are still talking about a relatively low frequency event.

The major changes compared to the old radix are when modifying (insert or delete) a tree, and those are pretty self-contained: we replace the entire immutable tree under a lock.

Signed-off-by: Tibor Vass teabee89@gmail.com

tiborvass · 2022-11-18T22:15:53Z

I'm curious to see if things break in CI because of this.

thaJeztah · 2022-11-21T09:44:13Z

libnetwork/networkdb/cluster.go

 	// The lock is taken at the beginning of the cycle and the deletion is inline
 	for _, nid := range nodeNetworks {
 		nDB.Lock()


Still trying to get my head around this weird construct (added in moby/libnetwork@3feb3aa) even more with the "timeCompensate" below.

Do we know why does the lock/unlock have to happen inside the loop here (but the others lock outside)?

The comment above the for loop is attempting to explain it but I'm not sure I understand it. Could it be that releasing the lock in every iteration allows other goroutines to continue work instead of lagging too much? Either way, this PR is not modifying that logic.

Actually the commit message of the commit you posted confirms it.

Yeah, was looking at that description and wondered if that changed now that there's an RWMutex, but yeah, this code is complicated, so perhaps that didn't work

thaJeztah · 2022-11-21T09:49:09Z

libnetwork/networkdb/cluster.go

 			}

-			params := strings.Split(path[1:], "/")
+			params := strings.Split(string(path[1:]), "/")


Perhaps a micro-optimisation, but if these loops run many times, perhaps worth it while we're changing;

how about using bytes.Split or even bytes.SplitN (if we expect these to have 4 components)? Quick (naive) benchmark;

Details

package main import ( "bytes" "strings" "testing" ) func BenchmarkSplitString(b *testing.B) { path := []byte("/one/two/three/four") sep := "/" b.ReportAllocs() for i := 0; i < b.N; i++ { params := strings.Split(string(path[1:]), sep) nid := params[0] tname := params[1] key := params[2] _, _, _ = nid, tname, key } } func BenchmarkSplitBytes(b *testing.B) { path := []byte("/one/two/three/four") sep := []byte("/") b.ReportAllocs() for i := 0; i < b.N; i++ { params := bytes.Split(path[1:], sep) nid := string(params[0]) tname := string(params[1]) key := string(params[2]) _, _, _ = nid, tname, key } } func BenchmarkSplitNString(b *testing.B) { path := []byte("/one/two/three/four") sep := "/" b.ReportAllocs() for i := 0; i < b.N; i++ { params := strings.SplitN(string(path[1:]), sep, 4) nid := params[0] tname := params[1] key := params[2] _, _, _ = nid, tname, key } } func BenchmarkSplitNBytes(b *testing.B) { path := []byte("/one/two/three/four") sep := []byte("/") b.ReportAllocs() for i := 0; i < b.N; i++ { params := bytes.SplitN(path[1:], sep, 4) nid := string(params[0]) tname := string(params[1]) key := string(params[2]) _, _, _ = nid, tname, key } }

BenchmarkSplitString BenchmarkSplitString-10 14783635 79.58 ns/op 88 B/op 2 allocs/op BenchmarkSplitBytes BenchmarkSplitBytes-10 16331946 72.41 ns/op 96 B/op 1 allocs/op BenchmarkSplitNString BenchmarkSplitNString-10 16576970 71.87 ns/op 88 B/op 2 allocs/op BenchmarkSplitNBytes BenchmarkSplitNBytes-10 19375464 63.30 ns/op 96 B/op 1 allocs/op

I initially had that that but whatever you gain by that is offset by the fact that you need to redo the conversion in deleteEntry.

thaJeztah · 2022-11-21T11:14:45Z

libnetwork/networkdb/cluster.go


-			params := strings.Split(path[1:], "/")
+			params := strings.Split(string(path[1:]), "/")
 			nid := params[0]


Looks like this is redundant, as we already know the prefix (which is what we're passing to iterate), which means we can also (instead of only stripping the / prefix), strip /+nid+/, and take params[0], params[1] (instead of 1 and 2);

// Format is "/<networkID>/<tableName>/<endpointID>/<value>", // trim "/+nid+/" before splitting. params := bytes.Split(path[len(nid)+2:], []byte("/")) tname := string(params[0]) key := string(params[1])

deleteEntry needs both /<tname>/<nid>/<key> and /<nid>/<tname>/<key> so there's no way around it. I mean we could pass the whole string for /<nid>/<tname>/<key> but then we'd have to pass tname nid and key anyway for the first path.

thaJeztah · 2022-11-21T12:14:25Z

libnetwork/networkdb/cluster.go

+			params := strings.Split(string(path[1:]), "/")
 			nid := params[0]
 			tname := params[1]
 			key := params[2]


Silly question (I can't comment on the line below); as the problem was that we were deleting inside the loop, and as we're already taking a lock, would it make sense to collect the list of entries to delete in WalkPrefix, and then delete them outside of it? Would that make it more performant?

Also wondering; it seems we're using the WalkPrefix to collect all entries with the given prefix so that we can delete all of them; if (IIUC), the format is /<networkID>/<tableName>/<endpointID>/<value>, but we only collect /<networkID>/<tableName>/<endpointID>, that means we're manually performing a delete multiple times (once for each <value>). Perhaps we could make use of DeletePrefix() 🤔

Yeah I thought about this too. The current logic is very hairy because it's not simply a deleting a bunch of radixes, it also checks for each entry if it was successfully deleted and only then it updates the state of the number of entries in that network: https://github.com/moby/moby/pull/44501/files#diff-7b5363044492af3bcc6d8117470bfc51476d389c7a90670cfd5c62a83c32445aR777

And there's that whole timecompensation logic too. So yeah, again, I tried to change the least amount of logic in an area I'm not very comfortable in. If somebody wants to decipher the timecompensation logic and use DeletePrefix that's just an optimization which is not the purpose of this PR.

thaJeztah · 2022-11-21T12:27:58Z

libnetwork/networkdb/networkdb.go


 func (nDB *NetworkDB) getEntry(tname, nid, key string) (*entry, error) {
-	e, ok := nDB.indexes[byTable].Get(fmt.Sprintf("/%s/%s/%s", tname, nid, key))
+	e, ok := nDB.indexes[byTable].Get([]byte(fmt.Sprintf("/%s/%s/%s", tname, nid, key)))


We should probably also consider using concatenation for these simple cases (instead of Sprintf())

I don't think it's worth to sacrifice readability in this case.

thaJeztah · 2022-11-21T12:32:43Z

libnetwork/networkdb/networkdb.go

+func (nDB *NetworkDB) deleteEntry(nid, tname, key string) (okTable bool, okNetwork bool) {
+	nDB.indexes[byTable], _, okTable = nDB.indexes[byTable].Delete([]byte(fmt.Sprintf("/%s/%s/%s", tname, nid, key)))
+	nDB.indexes[byNetwork], _, okNetwork = nDB.indexes[byNetwork].Delete([]byte(fmt.Sprintf("/%s/%s/%s", nid, tname, key)))


See my other comment; perhaps if we drop the key for these, we can use DeletePrefix;

Suggested change

func (nDB *NetworkDB) deleteEntry(nid, tname, key string) (okTable bool, okNetwork bool) {

nDB.indexes[byTable], _, okTable = nDB.indexes[byTable].Delete([]byte(fmt.Sprintf("/%s/%s/%s", tname, nid, key)))

nDB.indexes[byNetwork], _, okNetwork = nDB.indexes[byNetwork].Delete([]byte(fmt.Sprintf("/%s/%s/%s", nid, tname, key)))

func (nDB *NetworkDB) deleteEntry(nid, tname string) (okTable bool, okNetwork bool) {

nDB.indexes[byTable], okTable = nDB.indexes[byTable].DeletePrefix([]byte(fmt.Sprintf("/%s/%s", tname, nid)))

nDB.indexes[byNetwork], okNetwork = nDB.indexes[byNetwork].DeletePrefix([]byte(fmt.Sprintf("/%s/%s", nid, tname)))

We may want to do the actual delete outside of the WalkPrefix function in that case though to remove duplicates first (although it looks like DeletePrefix will just ignore if it was already deleted).

What happens when for some unknown reason (something something timeCompensation that I don't understand) not all entries in a prefix are to be deleted? Then you cannot use DeletePrefix, you have to redo a walk for each entry.

This commit allows to remove dependency on the mutable version armon/go-radix. The go-immutable-radix package is better maintained. It is likely that a bit more memory will be used when using the immutable version, though discarded nodes are being reused in a pool. These changes happen when networks are added/removed or nodes come and go in a cluster, so we are still talking about a relatively low frequency event. The major changes compared to the old radix are when modifying (insert or delete) a tree, and those are pretty self-contained: we replace the entire immutable tree under a lock. Signed-off-by: Tibor Vass <teabee89@gmail.com>

tiborvass · 2022-12-01T01:54:47Z

Rebased

thaJeztah

LGTM, thanks!

tiborvass requested review from neersighted and thaJeztah November 18, 2022 22:16

tiborvass mentioned this pull request Nov 18, 2022

vendor: remove most "replace" rules and update github.com/armon/go-radix #44498

Merged

thaJeztah reviewed Nov 21, 2022

View reviewed changes

tiborvass force-pushed the immutable_radix branch from 97bfaba to eaa7449 Compare December 1, 2022 01:03

AkihiroSuda approved these changes Dec 6, 2022

View reviewed changes

thaJeztah added status/2-code-review area/networking Networking kind/refactor PR's that refactor, or clean-up code labels Dec 6, 2022

thaJeztah added this to the v-next milestone Dec 6, 2022

thaJeztah approved these changes Dec 6, 2022

View reviewed changes

thaJeztah merged commit cc1884d into moby:master Dec 6, 2022

thaJeztah mentioned this pull request May 7, 2025

libn/networkdb: fix data race in GetTableByNetwork #49937

Merged

Conversation

tiborvass commented Nov 18, 2022

Uh oh!

tiborvass commented Nov 18, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tiborvass commented Dec 1, 2022

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants