Add preemption to escape pure-cpu busy-loops by sporksmith · Pull Request #3520 · shadow/shadow

sporksmith · 2025-02-28T14:53:33Z

This new feature is off by default, and enabled via the new experimental --native-preemption-enabled option.

When enabled, the shim code uses a native timer to interrupt managed code that runs for longer than --native-preemption-native-interval (100 ms by default) without returning control to shadow, and moves simulated time forward by --native-preemption-sim-interval (10 ms by default).

It is intended to be used to escape CPU-only busy loops.

Fixes #2066

sporksmith · 2025-03-18T20:32:50Z

Tor benchmark: https://github.com/shadow/benchmark/actions/runs/13932935805. (With this feature disabled as per default, to ensure it doesn't introduce unacceptable overhead when not actually used)

stevenengler · 2025-03-18T20:47:51Z

Tor benchmark: https://github.com/shadow/benchmark/actions/runs/13932935805. (With this feature disabled as per default, to ensure it doesn't introduce unacceptable overhead when not actually used)

I also started/queued a tor benchmark with preemption enabled and with 3 repetitions. Not that it will affect whether we merge this or anything, I'm just curious what the results look like. If it looks like it will take a long time, then we can cancel it.

sporksmith · 2025-03-18T23:13:34Z

Tor benchmark: https://github.com/shadow/benchmark/actions/runs/13932935805. (With this feature disabled as per default, to ensure it doesn't introduce unacceptable overhead when not actually used)

Results:

https://github.com/shadow/benchmark-results/blob/master/tor/2025-03-18-T20-32-33/plots/run_time.png

Looks like a small but noticeable performance hit :(. 7318 s vs 7219 s, so about 1.3%.

My prime suspect is the syscall handler, where I'm no longer skipping some code when model-unblocked-syscall-latency is disabled https://github.com/shadow/shadow/pull/3520/files#diff-dbfa63cb768b43e86e5072a4389a1ea55c4224eb25185587495f636863c7a227L271. I did this since most of that logic also applies to when this new feature is enabled. I suppose I could check for both features to potentially skip the whole block, but that's a little unwieldy and I may have to do a little plumbing to be able to check there. Or I could just require model-unblocked-syscall-latency is enabled for native preemption to work, which you'd probably want anyways in practice.

I'll do some more experimentation tomorrow to try to confirm if this is the culprit. (I suppose I should also run multiple trials, but I think this is a big enough regression to probably not just be "unlucky")

stevenengler

I didn't finish reviewing it yet (I haven't looked at most of the shim changes), so just leaving the comments I have so far.

I'll do some more experimentation tomorrow to try to confirm if this is the culprit. (I suppose I should also run multiple trials, but I think this is a big enough regression to probably not just be "unlucky")

If the benchmark I started is still running when you want to start another, please cancel mine, I don't want it to block anything.

Looks like a small but noticeable performance hit :(. 7318 s vs 7219 s, so about 1.3%.

My prime suspect is the syscall handler, where I'm no longer skipping some code when model-unblocked-syscall-latency is disabled

IMO I don't think the difference is big enough to prevent merging. But if it turns out that the performance regression can be eliminated by skipping that code path, I think it would be nice to have.

docs/limitations.md

docs/shadow_config_spec.md

src/main/core/configuration.rs

src/main/host/syscall/handler/mod.rs

sporksmith · 2025-03-19T01:33:43Z

Haven't been able to reproduce the performance regression locally, possibly just because my system is too noisy. I made some tweaks to the hot path in the shim and am trying again with 3 trials: https://github.com/shadow/benchmark/actions/runs/13936903437

sporksmith · 2025-03-19T14:50:10Z

After that last change and running 3 trials, the performance difference looks to be in the noise. https://github.com/shadow/benchmark-results/blob/master/tor/2025-03-19-T06-28-41/plots/run_time.png

Performance with the feature enabled is definitely slower (as expected), though not horrendous https://github.com/shadow/benchmark-results/blob/master/tor/2025-03-18-T22-45-17/plots/run_time.png. I think the only way to improve it substantially would be to use the per-thread timer_create timers without continuously destroying and recreating them when we switch threads (and thus instead potentially running into the system-wide resource limit). I don't plan to spend more time on it right now, though.

sporksmith · 2025-03-19T15:08:53Z

Also worth noting that as-expected having preemption enabled doesn't appear to have changed any of the results inside the simulation https://github.com/shadow/benchmark-results/tree/master/tor/2025-03-18-T22-45-17.

I expect there were probably no preemptions. When I've run the test suite locally (including the tor-minimal test) with the feature globally enabled, the only test besides the new cpu-busy-loop test that triggers it is the mmap test when initializing large chunks of memory, and that only in debug builds.

stevenengler

Looks good, I think this will be a nice feature to have.

src/test/cli/help-long-expected

docs/shadow_config_spec.md

src/lib/shim/src/preempt.rs

src/main/host/syscall/handler/mod.rs

stevenengler · 2025-03-20T22:09:15Z

After that last change and running 3 trials, the performance difference looks to be in the noise. https://github.com/shadow/benchmark-results/blob/master/tor/2025-03-19-T06-28-41/plots/run_time.png

(Just pasting the image into a comment so we have it even if we need to prune benchmark results from the repo in the future.)

docs/limitations.md

This new feature is off by default, and enabled via the new experimental `--native-preemption-enabled` option. When enabled, the shim code uses a native timer to interrupt managed code that runs for longer than `--native-preemption-native-interval` (100 ms by default) without returning control to shadow, and moves simulated time forward by `--native-preemption-sim-interval` (10 ms by default). It is intended to be used to escape CPU-only busy loops. Fixes shadow#2066

* Hold the host shmem lock longer to avoid multiple locks and unlocks. * Replace "fine grained" temporaries copied from shmem with temporaries to shmem itself, for clarity. * reset `unapplied_cpu_latency` to zero when rescheduling too, not just when moving time forward. I'm fairly certain this was a (usually mostly-innocuous) bug.

Accessing thread-local-storage is cheap, but not free, and this code is very much on the hot path.

sporksmith self-assigned this Feb 28, 2025

github-actions bot added Component: Libraries Support functions like LD_PRELOAD and logging Component: Testing Unit and integration tests and frameworks Component: Build Build/install tools and dependencies labels Feb 28, 2025

sporksmith mentioned this pull request Feb 28, 2025

Support spinning / spinlocks / busy-loops #1792

Open

sporksmith force-pushed the preempt branch from 4edcdb4 to 9cb13fe Compare March 11, 2025 14:56

sporksmith force-pushed the preempt branch from 9cb13fe to 3083d46 Compare March 18, 2025 19:27

github-actions bot added Component: Main Composing the core Shadow executable Component: Documentation In-repository documentation, under docs/ labels Mar 18, 2025

sporksmith force-pushed the preempt branch 4 times, most recently from b806900 to b8cd997 Compare March 18, 2025 20:30

sporksmith marked this pull request as ready for review March 18, 2025 20:35

sporksmith requested a review from stevenengler March 18, 2025 20:35

stevenengler reviewed Mar 19, 2025

View reviewed changes

stevenengler approved these changes Mar 20, 2025

View reviewed changes

sporksmith commented Mar 21, 2025

View reviewed changes

docs/limitations.md Outdated Show resolved Hide resolved

sporksmith added 4 commits March 26, 2025 15:24

linux-api: implement VirtualAddressSpaceIndependent trait for time types

3819896

ExecutionContext::enter_without_restorer: avoid extra TLS lookups

12a4342

Accessing thread-local-storage is cheap, but not free, and this code is very much on the hot path.

sporksmith force-pushed the preempt branch from fbc8361 to 12a4342 Compare March 26, 2025 20:25

sporksmith enabled auto-merge March 26, 2025 20:25

sporksmith merged commit 3770a34 into shadow:main Mar 26, 2025
24 of 25 checks passed

sporksmith deleted the preempt branch March 26, 2025 20:40

Conversation

sporksmith commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sporksmith commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevenengler commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sporksmith commented Mar 18, 2025

Uh oh!

stevenengler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sporksmith commented Mar 19, 2025

Uh oh!

sporksmith commented Mar 19, 2025

Uh oh!

sporksmith commented Mar 19, 2025

Uh oh!

stevenengler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stevenengler commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sporksmith commented Feb 28, 2025 •

edited

Loading

sporksmith commented Mar 18, 2025 •

edited

Loading

stevenengler commented Mar 18, 2025 •

edited

Loading