`cranelift-frontend`: Fix stack maps and liveness for loops by fitzgen · Pull Request #9071 · bytecodealliance/wasmtime

fitzgen · 2024-08-02T19:36:27Z

Previously, we were not properly handling back edges. This manifested in values incorrectly being considered not-live inside loop bodies where they definitely were live. Consider the following example:

block0:
  v0 = needs stack map

block1:
  call foo(v0)
  call foo(v0)
  jump block1

We were previously considering v0 live only for the first call foo(v0) but not the second, because we mistakenly concluded that v0 would not be used again after that second call. While it won't be used again in this iteration of the loop, it will be used again in the next iteration of the loop.

Trevor and I initially tried implementing a clever trick suggested by Chris where, if we know the minimum post-order index of all of a block's transitive predecessors, we can continue to compute liveness in a single pass over the IR. We believed we could compute the minimum predecessor post-order index via dynamic programming. It turns out, however, that approach doesn't provide correct answers out of the box for certain kinds of irreducible control flow, only nearly correct answers, and would require an additional clever fix-up pass afterwards. We deemed this cleverness on cleverness unacceptable.

Instead, Trevor and I opted to implement a worklist algorithm where we process blocks to a fixed-point. This has the advantages of being obviously correct and producing more-precise results. It has the disadvantage of requiring multiple passes over the IR in the face of loops and back edges. Because this analysis is only used when needs-stack-map values are present (i.e. when the function uses GC values) and is otherwise skipped, this additional compile-time overhead is tolerable.

Previously, we were not properly handling back edges. This manifested in values incorrectly being considered not-live inside loop bodies where they definitely were live. Consider the following example: block0: v0 = needs stack map block1: call foo(v0) call foo(v0) jump block1 We were previously considering `v0` live only for the first `call foo(v0)` but not the second, because we mistakenly concluded that `v0` would not be used again after that second `call`. While it won't be used again in *this* iteration of the loop, it will be used again in the *next* iteration of the loop. Trevor and I initially tried implementing a clever trick suggested by Chris where, if we know the minimum post-order index of all of a block's transitive predecessors, we can continue to compute liveness in a single pass over the IR. We believed we could compute the minimum predecessor post-order index via dynamic programming. It turns out, however, that approach doesn't provide correct answers out of the box for certain kinds of irreducible control flow, only nearly correct answers, and would require an additional clever fix-up pass afterwards. We deemed this cleverness on cleverness unacceptable. Instead, Trevor and I opted to implement a worklist algorithm where we process blocks to a fixed-point. This has the advantages of being obviously correct and producing more-precise results. It has the disadvantage of requiring multiple passes over the IR in the face of loops and back edges. Because this analysis is only used when needs-stack-map values are present (i.e. when the function uses GC values) and is otherwise skipped, this additional compile-time overhead is tolerable. Co-Authored-By: Trevor Elliott <telliott@fastly.com>

cfallin

Looks great, thanks! Some minor requests for clarification in comments but otherwise good to go.

cranelift/frontend/src/frontend/safepoints.rs

fitzgen requested a review from a team as a code owner August 2, 2024 19:36

fitzgen requested review from elliottt and removed request for a team August 2, 2024 19:36

fitzgen force-pushed the stack-maps-and-liveness-and-loops branch from 13e56db to c145baf Compare August 2, 2024 19:36

fitzgen requested review from cfallin and removed request for elliottt August 2, 2024 19:46

cfallin approved these changes Aug 2, 2024

View reviewed changes

cranelift/frontend/src/frontend/safepoints.rs Show resolved Hide resolved

cranelift/frontend/src/frontend/safepoints.rs Outdated Show resolved Hide resolved

cranelift/frontend/src/frontend/safepoints.rs Outdated Show resolved Hide resolved

Add and update some comments based on review

f9f4543

fitzgen enabled auto-merge August 2, 2024 21:45

fitzgen added this pull request to the merge queue Aug 2, 2024

Merged via the queue into bytecodealliance:main with commit dbc503f Aug 2, 2024

fitzgen deleted the stack-maps-and-liveness-and-loops branch August 2, 2024 22:08

fitzgen mentioned this pull request Aug 6, 2024

Switch to new "user" stack maps and use i32 for GC refs in Wasmtime #9082

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`cranelift-frontend`: Fix stack maps and liveness for loops#9071

`cranelift-frontend`: Fix stack maps and liveness for loops#9071
fitzgen merged 2 commits intobytecodealliance:mainfrom
fitzgen:stack-maps-and-liveness-and-loops

fitzgen commented Aug 2, 2024

Uh oh!

cfallin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

fitzgen commented Aug 2, 2024

Uh oh!

cfallin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants