Debugging: add builtin gdbstub component. by cfallin · Pull Request #12771 · bytecodealliance/wasmtime

cfallin · 2026-03-12T22:45:44Z

This adds a debug component that makes use of the debug-main world defined in #12756 and serves the gdbstub protocol, with Wasm extensions, compatible with LLDB.

This component is built and included inside the Wasmtime binary, and is loaded using the lower-level -D debugger=... debug-main option; the user doesn't need to specify the .wasm adapter component. Instead, the user simply runs wasmtime run -g <PORT> program.wasm ... and Wasmtime will load and prepare to run program.wasm as the debuggee, waiting for a gdbstub connection on the given TCP port before continuing.

The workflow is:

$ wasmtime run -g 1234 program.wasm
[ wasmtime starts and waits for connection ]

$ /opt/wasi-sdk/bin/lldb  # use LLDB from wasi-sdk release 32 or later
(lldb) process connect --plugin wasm connect://localhost:1234
Process 1 stopped
* thread #1, stop reason = signal SIGTRAP
    frame #0: 0x40000000000001cc
->  0x40000000000001cc: unreachable
    0x40000000000001cd: end
    0x40000000000001ce: local.get 0
    0x40000000000001d0: call   13
(lldb) si
Process 1 stopped
* thread #1, stop reason = instruction step into
    frame #0: 0x4000000000000184
->  0x4000000000000184: block
    0x4000000000000186: block
    0x4000000000000188: global.get 1
    0x400000000000018e: i32.const 3664
[ ... ]

This makes use of the gdbstub third-party crate, into which I've upstreamed support for the Wasm extensions in daniel5151/gdbstub#188, daniel5151/gdbstub#189, daniel5151/gdbstub#190, and daniel5151/gdbstub#192. (I'll add vets as part of this PR.)

cfallin · 2026-03-12T22:46:31Z

This is stacked on top of #12756 until that one lands; only the last commit is new.

I haven't added end-to-end tests that spawn/interact with LLDB yet; depending on how that goes I might be able to include that here or might defer to another PR if that's OK.

cfallin · 2026-03-13T19:24:57Z

Rebased out #12756; should be good to review now.

github-actions · 2026-03-13T21:51:52Z

Subscribe to Label Action

cc @fitzgen

Details

This issue or pull request has been labeled: "wizer"

Thus the following users have been cc'd because of the following labels:

fitzgen: wizer

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

crates/gdbstub-component/artifact/Cargo.toml

crates/gdbstub-component/artifact/build.rs

crates/gdbstub-component/Cargo.toml

src/commands/run.rs

alexcrichton · 2026-03-16T16:23:44Z

Also, to clarify, @cfallin what depth would you like me to review the gdbstub component code itself? I'm happy more-or-less not reviewing it at all in the sense that it's well-sequestered, low-risk, and we'll likely iterate a lot on it in-tree. If you'd prefer though I could give it a closer look in any particular areas of interest.

cfallin · 2026-03-16T17:20:09Z

Also, to clarify, @cfallin what depth would you like me to review the gdbstub component code itself? I'm happy more-or-less not reviewing it at all in the sense that it's well-sequestered, low-risk, and we'll likely iterate a lot on it in-tree. If you'd prefer though I could give it a closer look in any particular areas of interest.

I guess my default answer is "to whatever extent allows us to fulfill policy and be comfortable having this code in-repo" :-) I agree that since it's sandboxed, the bar could be lower than for core runtime code. I guess the spirit of our code-review policies is still that someone should give it a once-over -- but up to you how deep you take that!

crates/gdbstub-component/src/lib.rs

crates/gdbstub-component/src/target.rs

…g forward to first opcode. LLDB, when instructed to `break main`, looks at the DWARF metadata for `main` and finds its PC range, then sets a breakpoint at the first PC. This is reasonable behavior for native ISAs! That PC better be a real instruction! On Wasm, however, (i) toolchains typically emit the PC range as *including* the *locals count*, a leb128 value that precedes the first opcode and any types of locals; (ii) our gdbstub component that bridges LLDB to our debug APIs (bytecodealliance#12771) only supports *exact* PCs for breakpoints, so when presented with a PC that does not actually point to an opcode, setting the breakpoint is effectively a no-op. There will always be a difference of at least 1 byte between the start-of-function offset and first-opcode offset (for a leb128 of `0` for no locals), so a breakpoint "on" a function will never work. I initially prototyped a fix that adds a sequence point at the start of every function (which, again, is *guaranteed* to be distinct from the first opcode), and the branch is [here], but I didn't like the developer experience: this meant that when a breakpoint at a function start fired, LLDB had a weird interstitial state where no line-number applied. The behavior that would be closer in line with "native" debug expectations is that we add a bit of fuzzy-ish matching: setting a breakpoint at function start should break at the first opcode, even if that's a few (or many) bytes later. There are two options here: special-case function start, or generally change the semantics of our breakpoint API so that "add breakpoint at `pc`" means "add breakpoint at next opcode at or after `pc`". I opted for the latter in this PR because it's more consistent. The logic is a little subtle because we're effectively defining an n-to-1 mapping with this "snap-to-next" behavior, so we have to refcount each breakpoint (consider setting a breakpoint at function start *and* at the first opcode, then deleting them, one at a time). I believe the result is self-consistent, even if a little more complicated now. And, importantly, with bytecodealliance#12771 on top of this change, it produces the expected behavior for the (very simple!) debug script "`b main`; `continue`". [here]: https://github.com/cfallin/wasmtime/tree/breakpoint-at-func-start

…et PCs on traps. This was not exposed earlier by (i) lack of handling of trap events in the initial version of the gdbstub component in bytecodealliance#12771, and (ii) lack of asserting some value for the PC on the top frame in the debug-event test for traps. We got the PC for the last opcode in the function body previously because, with no debug tags on the trapping path that calls raise() (sunk to the bottom of the machine code body as cold code), we scanned backward for the last tag metadata and found that instead. Adding metadata according to the current source location when emitting traps fixes this for all trapping events.

…g forward to first opcode. (#12791) LLDB, when instructed to `break main`, looks at the DWARF metadata for `main` and finds its PC range, then sets a breakpoint at the first PC. This is reasonable behavior for native ISAs! That PC better be a real instruction! On Wasm, however, (i) toolchains typically emit the PC range as *including* the *locals count*, a leb128 value that precedes the first opcode and any types of locals; (ii) our gdbstub component that bridges LLDB to our debug APIs (#12771) only supports *exact* PCs for breakpoints, so when presented with a PC that does not actually point to an opcode, setting the breakpoint is effectively a no-op. There will always be a difference of at least 1 byte between the start-of-function offset and first-opcode offset (for a leb128 of `0` for no locals), so a breakpoint "on" a function will never work. I initially prototyped a fix that adds a sequence point at the start of every function (which, again, is *guaranteed* to be distinct from the first opcode), and the branch is [here], but I didn't like the developer experience: this meant that when a breakpoint at a function start fired, LLDB had a weird interstitial state where no line-number applied. The behavior that would be closer in line with "native" debug expectations is that we add a bit of fuzzy-ish matching: setting a breakpoint at function start should break at the first opcode, even if that's a few (or many) bytes later. There are two options here: special-case function start, or generally change the semantics of our breakpoint API so that "add breakpoint at `pc`" means "add breakpoint at next opcode at or after `pc`". I opted for the latter in this PR because it's more consistent. The logic is a little subtle because we're effectively defining an n-to-1 mapping with this "snap-to-next" behavior, so we have to refcount each breakpoint (consider setting a breakpoint at function start *and* at the first opcode, then deleting them, one at a time). I believe the result is self-consistent, even if a little more complicated now. And, importantly, with #12771 on top of this change, it produces the expected behavior for the (very simple!) debug script "`b main`; `continue`". [here]: https://github.com/cfallin/wasmtime/tree/breakpoint-at-func-start

…et PCs on traps. (#12802) This was not exposed earlier by (i) lack of handling of trap events in the initial version of the gdbstub component in #12771, and (ii) lack of asserting some value for the PC on the top frame in the debug-event test for traps. We got the PC for the last opcode in the function body previously because, with no debug tags on the trapping path that calls raise() (sunk to the bottom of the machine code body as cold code), we scanned backward for the last tag metadata and found that instead. Adding metadata according to the current source location when emitting traps fixes this for all trapping events.

This adds a debug component that makes use of the debug-main world defined in bytecodealliance#12756 and serves the gdbstub protocol, with Wasm extensions, compatible with LLDB. This component is built and included inside the Wasmtime binary, and is loaded using the lower-level `-D debugger=...` debug-main option; the user doesn't need to specify the `.wasm` adapter component. Instead, the user simply runs `wasmtime run -g <PORT> program.wasm ...` and Wasmtime will load and prepare to run `program.wasm` as the debuggee, waiting for a gdbstub connection on the given TCP port before continuing. The workflow is: ``` $ wasmtime run -g 1234 program.wasm [ wasmtime starts and waits for connection ] $ /opt/wasi-sdk/bin/lldb # use LLDB from wasi-sdk release 32 or later (lldb) process connect --plugin wasm connect://localhost:1234 Process 1 stopped * thread #1, stop reason = signal SIGTRAP frame #0: 0x40000000000001cc -> 0x40000000000001cc: unreachable 0x40000000000001cd: end 0x40000000000001ce: local.get 0 0x40000000000001d0: call 13 (lldb) si Process 1 stopped * thread #1, stop reason = instruction step into frame #0: 0x4000000000000184 -> 0x4000000000000184: block 0x4000000000000186: block 0x4000000000000188: global.get 1 0x400000000000018e: i32.const 3664 [ ... ] ``` This makes use of the `gdbstub` third-party crate, into which I've upstreamed support for the Wasm extensions in daniel5151/gdbstub#188, daniel5151/gdbstub#189, daniel5151/gdbstub#190, and daniel5151/gdbstub#192. (I'll add vets as part of this PR.)

alexcrichton

I'm realizing that I'm going to be gone for awhile after wasm.io and I don't want to leave this languishing. With the various comments I've left I think this is fine to land and iterate in-tree, and if you'd prefer feel free to defer anything to an issue and/or a follow-up PR.

…en isolated crates are used).

cfallin · 2026-03-23T21:56:33Z

Merged failed due to crate publish checks seeing that wasmtime-internal-gdbstub-component-artifact doesn't exist on crates.io yet; I'm working through the runbook here at the moment and am blocked in getting someone with the right access to accept the crate ownership.

…is published.

cfallin · 2026-03-23T22:59:35Z

It seems our publish script checks that crates compile as-published, so the compile_error! in the artifact crate when built in isolation without the component crate is no-go; for now altered build.rs to generate an empty array instead. Since the feature is off by default and our published release artifacts will be built in a way that includes the actual component, the risk of unexpected behavior seems small enough to me for now, but I'm happy to iterate on this if anyone has better ideas!

cfallin · 2026-03-23T23:34:01Z

Another merge queue failure: the check here for MSRV 1.91.0 fails because the gdbstub adapter uses wstd 0.6.6 which requires rustc 1.91.1. Do we technically still fit in the "N-2" policy if we require a patch release? If so should we bump to 1.91.1?

(This seems reasonable to me because in general someone stuck on 1.91 should be upgrading patch releases to fix bugs, but I'm curious if anyone has an objection)

cc @pchickey @alexcrichton @fitzgen

cfallin · 2026-03-23T23:41:33Z

Ah actually I think we're just out-of-date with our MSRV -- #12828 to bump.

cfallin requested review from a team as code owners March 12, 2026 22:45

cfallin requested review from dicej and removed request for a team March 12, 2026 22:45

cfallin requested review from alexcrichton and removed request for dicej March 12, 2026 22:46

cfallin force-pushed the gdbstub-component branch from d7959df to fc1f75a Compare March 12, 2026 22:47

cfallin mentioned this pull request Mar 12, 2026

Debugging: add the debug-main world. #12756

Merged

cfallin force-pushed the gdbstub-component branch 6 times, most recently from 2719201 to 71bd19d Compare March 13, 2026 08:01

cfallin mentioned this pull request Mar 13, 2026

LLDB in latest release cannot read symbols in guest binary WebAssembly/wasi-sdk#612

Closed

cfallin force-pushed the gdbstub-component branch 2 times, most recently from 34e9d51 to c0c1f02 Compare March 13, 2026 19:23

This was referenced Mar 13, 2026

Debugging: implement debug component support for wasmtime serve #12776

Closed

Debugging: tracking issue for MVP and post-MVP work #12777

Open

github-actions bot added the wizer Issues related to Wizer snapshotting, pre-initialization, and the `wasmtime wizer` subcommand label Mar 13, 2026

alexcrichton reviewed Mar 16, 2026

View reviewed changes

cfallin mentioned this pull request Mar 17, 2026

Debugging: allow breakpoints to be set at "function start" by slipping forward to first opcode. #12791

Merged

cfallin force-pushed the gdbstub-component branch from e73119a to 2c56975 Compare March 19, 2026 01:23

cfallin added 3 commits March 19, 2026 16:54

cargo vets.

51b4a10

Handle Trap events as well as breakpoints.

5aef60e

cfallin force-pushed the gdbstub-component branch from 2c56975 to 5aef60e Compare March 19, 2026 23:55

alexcrichton approved these changes Mar 20, 2026

View reviewed changes

cfallin added 4 commits March 23, 2026 13:37

Review feedback.

ec94dcf

Fix gdbstub artifact build to make it publishable (by disabling it wh…

f5a1517

…en isolated crates are used).

Review feedback.

7528625

fix published-crates list

b0625d5

cfallin enabled auto-merge March 23, 2026 21:16

cfallin added this pull request to the merge queue Mar 23, 2026

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 23, 2026

cfallin added this pull request to the merge queue Mar 23, 2026

For now, empty gdbstub data but no compile error when artifact crate …

63406fa

…is published.

cfallin removed this pull request from the merge queue due to a manual request Mar 23, 2026

cfallin enabled auto-merge March 23, 2026 22:57

cfallin added this pull request to the merge queue Mar 23, 2026

add some more Cargo metadata: version for artifact crate dep

2fd5716

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 23, 2026

cfallin added this pull request to the merge queue Mar 24, 2026

Merged via the queue into bytecodealliance:main with commit dbaaa92 Mar 24, 2026
46 checks passed

cfallin deleted the gdbstub-component branch March 24, 2026 18:50

Conversation

cfallin commented Mar 12, 2026

Uh oh!

cfallin commented Mar 12, 2026

Uh oh!

cfallin commented Mar 13, 2026

Uh oh!

github-actions bot commented Mar 13, 2026

Subscribe to Label Action

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexcrichton commented Mar 16, 2026

Uh oh!

cfallin commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexcrichton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cfallin commented Mar 23, 2026

Uh oh!

Uh oh!

cfallin commented Mar 23, 2026

Uh oh!

cfallin commented Mar 23, 2026

Uh oh!

Uh oh!

cfallin commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants