Use libnftables in dynamically linked binary by robmry · Pull Request #51033 · moby/moby

robmry · 2025-09-24T11:03:04Z

- What I did

When dockerd is dynamically linked, use cgo to call functions in libnftables instead of exec-ing the nft binary.

On my M2 macbook, using the library knocks about 20ms off each update, down to less than 1ms. Initialisation and teardown operations happen sequentially, so the time saving comes directly off the total.

Using libnftables brings nftables update performance in line with iptables - typical times ...

	iptables	nft	libnftables
docker network create --ipv6 b46	48ms	57ms	10ms
docker run --rm -ti --network b46 -p 8080:80 busybox (start)	95ms	185ms	109ms

Click for OTEL traces ...

Network create

iptables

nft binary

libnftables

Container start

iptables

nft binary

libnftables

- How I did it

Use cgo to call libnftables functions.

In packaging scripts/config, we'll need to add a build dependency on libnftables-dev and a installation dependency on libnftables.

- How to verify it

Unit tests use the library, integration tests (against the statically linked binary) use the nft tool.

- Human readable description for the release notes

- When dynamically linked, the Docker daemon now depends on libnftables.

corhere · 2025-09-24T16:45:14Z

daemon/libnetwork/drivers/bridge/internal/nftabler/nftabler.go

+	_ = nft.table4.Close()
+	_ = nft.table6.Close()
+	return nil


Suggested change

_ = nft.table4.Close()

_ = nft.table6.Close()

return nil

return errors.Join(nft.table4.Close(), nft.table6.Close())

corhere · 2025-09-24T17:08:54Z

daemon/libnetwork/internal/nftables/nftables_linux.go

 	MustFlush      bool

 	applyLock sync.Mutex
+	nftHandle any // applyLock must be held to access


I have an idea on how to make this field concretely typed.

Suggested change

nftHandle any // applyLock must be held to access

nftHandle nftHandle // applyLock must be held to access

// nft_cgo_linux.go type nftHandle = *C.struct_nft_ctx

// nft_exec_linux.go type nftHandle = struct{}

Good idea, thank you! Done.

Signed-off-by: Rob Murray <rob.murray@docker.com>

akerouanton

LGTM, but wondering if we could do without table.applyLock.

akerouanton · 2025-09-25T08:39:31Z

daemon/libnetwork/internal/nftables/nft_cgo_linux.go

+	defer span.End()
+
+	if t.nftHandle == nil {
+		handle, err := newNftHandle()


How costly is it to instantiate a new struct nft_ctx and allocate new out/err buffers? I'm wondering if we could avoid table.applyLock if each call to nftApply is using its own struct nft_ctx as this lock serializes operations on independent networks / sandboxes.

The applyLock isn't new ... it's there to protect the table while updates from a Modifier are applied to it, while an nftables command buffer is generated from the table, and during nftApply (in case the update fails and changes to the table need to be rolled back).

The Daemon has a single table in the host's netns (plus tables in each container netns for DNS) with rules for all networks/endpoints. The table in the host netns has nftables base chains that do a single vmap lookup to decide whether the packet needs further processing. If they do, packets are processed by short chains dealing with a specific network. The alternative would be a table per-network, then it'd be possible to make updates for different networks in parallel. But then, instead of a single vmap lookup, each packet (including non-Docker packets) would need to be matched against base chain rules in each of the per-network tables.

robmry · 2025-09-25T17:21:45Z

I'm still looking at the packaging changes needed for this ... please don't merge it yet.

With this tag, a dynamically linked binary will exec the nft tool instead of using cgo to call libnftables directly. Signed-off-by: Rob Murray <rob.murray@docker.com>

robmry · 2025-09-26T15:22:47Z

I'm still looking at the packaging changes needed for this ... please don't merge it yet.

I've added a commit that makes it possible to build a dynamically linked dockerd that still execs "nft" - we'll use that for RHEL builds for now (experimental nftables release). RHEL requires a subscription to get hold of the "nftables-devel" package, and we've been waiting on an arm64 license.

Packaging PR - docker/docker-ce-packaging#1256

robmry added this to the 29.0.0 milestone Sep 24, 2025

robmry self-assigned this Sep 24, 2025

robmry added kind/enhancement Enhancements are not bugs or new features but can improve usability or performance. area/networking Networking impact/changelog area/networking/firewalling Networking labels Sep 24, 2025

robmry force-pushed the use-libnftables branch 6 times, most recently from 3626da4 to 5b37749 Compare September 24, 2025 14:16

robmry marked this pull request as ready for review September 24, 2025 16:13

robmry requested review from akerouanton and corhere September 24, 2025 16:13

corhere approved these changes Sep 24, 2025

View reviewed changes

Use libnftables in dynamically linked binary

6db6de2

Signed-off-by: Rob Murray <rob.murray@docker.com>

robmry force-pushed the use-libnftables branch from 5b37749 to 6db6de2 Compare September 24, 2025 17:27

akerouanton approved these changes Sep 25, 2025

View reviewed changes

Add build tag "no_libnftables"

38fb0dd

With this tag, a dynamically linked binary will exec the nft tool instead of using cgo to call libnftables directly. Signed-off-by: Rob Murray <rob.murray@docker.com>

robmry mentioned this pull request Sep 26, 2025

Add nftables / libnftables dependencies docker/docker-ce-packaging#1256

Merged

robmry merged commit b26972f into moby:master Oct 3, 2025
342 of 348 checks passed

robmry mentioned this pull request Oct 15, 2025

Use libnftables #50502

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use libnftables in dynamically linked binary#51033

Use libnftables in dynamically linked binary#51033
robmry merged 2 commits intomoby:masterfrom
robmry:use-libnftables

robmry commented Sep 24, 2025 •

edited by vvoland

Loading

Uh oh!

corhere Sep 24, 2025

Uh oh!

robmry Sep 24, 2025

Uh oh!

corhere Sep 24, 2025

Uh oh!

robmry Sep 24, 2025

Uh oh!

akerouanton left a comment

Uh oh!

akerouanton Sep 25, 2025

Uh oh!

robmry Sep 25, 2025 •

edited

Loading

Uh oh!

robmry commented Sep 25, 2025

Uh oh!

robmry commented Sep 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	nftHandle any // applyLock must be held to access
	nftHandle nftHandle // applyLock must be held to access

Conversation

robmry commented Sep 24, 2025 • edited by vvoland Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Network create

iptables

nft binary

libnftables

Container start

iptables

nft binary

libnftables

Uh oh!

corhere Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

robmry Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

corhere Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

robmry Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

akerouanton left a comment

Choose a reason for hiding this comment

Uh oh!

akerouanton Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

robmry Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robmry commented Sep 25, 2025

Uh oh!

robmry commented Sep 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

robmry commented Sep 24, 2025 •

edited by vvoland

Loading

robmry Sep 25, 2025 •

edited

Loading