CPU Pinning by rwails · Pull Request #1063 · shadow/shadow

rwails · 2021-01-01T01:14:13Z

This PR adds an optional flag that enables CPU pinning for Shadow workers and plugins. Workers are pinned uniformly among logical CPUs available on the machine. Plugin threads are pinned to the same logical CPU that the worker is pinned to. This behavior is enabled with the --cpu-pin=1 flag.

Pinning can yield a significant performance improvement. On syscall dense workloads, I am measuring a 20-50% improvement in performance (using --cpu-pin=1 and --preload-spin-max=0).

thread. Each plugin thread adopts the worker's affinity value during `thread_new` and `thread_continue` procedure calls. Cleanup routine in affinity.c needs to be improved. Need to add a command line switch to selectively enable pinning.

…ity call to thread_resume

codecov · 2021-01-01T01:29:08Z

Codecov Report

Merging #1063 (ec2b418) into dev (a0538d8) will increase coverage by 0.09%.
The diff coverage is 59.70%.

@@            Coverage Diff             @@
##              dev    #1063      +/-   ##
==========================================
+ Coverage   54.96%   55.05%   +0.09%     
==========================================
  Files         127      129       +2     
  Lines       19235    19276      +41     
  Branches     4590     4601      +11     
==========================================
+ Hits        10572    10613      +41     
+ Misses       5896     5885      -11     
- Partials     2767     2778      +11

Flag	Coverage Δ
tests	`55.05% <59.70%> (+0.09%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/main/host/affinity.c	`23.07% <23.07%> (ø)`
src/main/core/worker.c	`83.41% <75.86%> (+0.15%)`	⬆️
src/main/host/affinity.h	`100.00% <100.00%> (ø)`
src/main/host/thread.c	`60.74% <100.00%> (+1.82%)`	⬆️
.../main/core/scheduler/scheduler_policy_host_steal.c	`81.67% <0.00%> (-0.39%)`	⬇️
src/main/core/scheduler/scheduler.c	`77.40% <0.00%> (-0.34%)`	⬇️
src/main/core/slave.c	`68.95% <0.00%> (+0.65%)`	⬆️
src/support/logger/rust_bindings/src/lib.rs	`32.90% <0.00%> (+1.02%)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a0538d8...e7ede00. Read the comment docs.

jtracey · 2021-01-01T21:10:00Z

I've not done any experiments with Phantom, but on the current release version of Shadow, I've run some experiments that show pinning to the A sides of CPUs (effectively disabling hyperthreads) is noticeably faster than using all logical CPUs -- specifically, on some large experiments, setting the number of workers to the number of physical cores and pinning to the A sides was significantly faster than pinning to A and B sides, whether setting the number of workers to the number of physical or logical cores (though the details will likely vary with specific workload, and that was pinning the entire process, not each thread, so there were still migrations happening). Pinning can be accomplished with numactl, but since A and B sides are frequently (but not always) interleaved numerically, a user who doesn't know to do that and runs Shadow with fewer workers than logical CPUs will likely end up with the workers bunched up on the same physical cores when distributing via worker_thread_id % _ncpus.

It might also be worth trying to keep threads on the same socket when multiple are available (and also bind to the memory for those numa nodes, strictly speaking, but I think linux is smart enough to handle that step on its own?). I don't know if there's a standard way to get all this information via glibc/syscalls, but it's available in /proc/cpuinfo.

robgjansen

I think we tested the case that @jtracey found works better for classic shadow when we were testing phantom (i.e., effectively disabling hyperthreading); could you summarize our conclusions there?

src/main/host/affinity.h

robgjansen · 2021-01-05T19:06:11Z

src/main/host/affinity.c

+    assert(_ncpus > 0 && pid >= 0);
+
+    // We can short-circuit if there's no work to do.
+    if (!_affinity_enabled || new_cpu_num == AFFINITY_UNINIT || new_cpu_num == old_cpu_num) {


If new_cpu_num == AFFINITY_UNINIT then don't we want to "unpin" the process to allow the scheduler to run it anywhere?

I'm using AFFINITY_UNINIT to denote the case when AFFINITY is unknown or has never been set (as opposed to the case when affinity is set to all CPUs). I think it'd be better to introduce a new variable to unpin/"unset" affinity if we want that behavior.

robgjansen · 2021-01-05T19:11:20Z

src/main/host/affinity.c

+    }
+
+    if (!set_affinity_suceeded) {
+        warning("Could not set CPU affinity for PID %d to %d", (int)pid, new_cpu_num);


If we use the cpu-pin option and we can't pin, should that be an error or a warning? IIRC, Jim thought error makes more sense, rather than continuing to run the simulation without pinning and potentially tricking the user that pinning is active when it isn't. I'm thinking that logging a warning makes sense but then refusing to run the experiment. That's more work to code because you'll have to propagate the error back, but does it give us the behavior we want?

I promoted to critical so that the user has a higher likelihood of seeing.

Crashing with an error makes me nervous because there are potentially thousands of affinity changes that could occur over the course of an experiment (during work stealing, for example). If a single one of those affinity changes doesn't succeed, I'm not sure the experiment should crash since it could otherwise complete successfully with negligible performance impact.

Regardless, if we want to make this a crash instead of a warning, I think it should be done at the call site and not inside this module.

Crashing with an error makes me nervous because there are potentially thousands of affinity changes that could occur over the course of an experiment

Haven't looked at the code in this PR yet but when I made that comment I believe the proposed pinning was all done during startup. If something goes wrong during startup I think it makes sense to crash. I agree we want to avoid crashing in the middle of a simulation if we can.

robgjansen · 2021-01-05T19:12:10Z

src/main/host/affinity.h

+#include <sched.h>
+#include <sys/types.h>
+
+enum { AFFINITY_UNINIT = -1 };


[optional nit] A brief note describing this would be nice.

OK, I added a comment

rwails · 2021-01-06T01:53:51Z

I think we tested the case that @jtracey found works better for classic shadow when we were testing phantom (i.e., effectively disabling hyperthreading); could you summarize our conclusions there?

Yeah, so most of the performance improvements due to pinning I believe is due to reducing the cost of context switching (this only applied in the process-oriented dev branch). By pinning workers and plugins to the same core we achieve very low cost context switching.

I agree with Justin that there's more benefit that could be gained by intelligently pinning workers to particular CPUs, for example taking NUMA nodes into account. Some experiments we've run so far show that pinning to hyperthread cores pessimizes performance, but for the machines I'm running experiments on the HT cores are all high numbered, so by setting -w at num_cores / 2 we avoid pinning to shared cores. I'd like to leave intelligent pinning to a different PR since it could get complicated if we're parsing /proc/cpuinfo or pulling in a dependency to examine the platform.

Thanks for the feedback @jtracey !

armadev · 2021-01-06T03:17:23Z

src/main/host/affinity.c

    }

    cpu_set_t* cpu_set = NULL;
+    bool set_affinity_suceeded = false;


typo, 'suceeded' (I noticed the typo while reading the discussion :)

robgjansen · 2021-01-06T14:57:39Z

We decided to close this PR and open 2 new ones in order to make sure we have the pinning feature in both master (classic shadow) and dev (phantom) branches. The first PR will add the feature to master, then we'll merge master to dev, and the second PR will make the changes needed to do the pinning in dev.

sporksmith · 2021-01-11T21:56:42Z

We decided to close this PR and open 2 new ones in order to make sure we have the pinning feature in both master (classic shadow) and dev (phantom) branches. The first PR will add the feature to master, then we'll merge master to dev, and the second PR will make the changes needed to do the pinning in dev.

Ok, removing myself as a reviewer on this one, then :)

Ryan Wails added 3 commits December 30, 2020 14:52

Cleaning up setProcessAffinity routine, adding command line argument

fc0c77c

Fixing bug swapping parameters in prototype vs impl, adding set affin…

3ce5eee

…ity call to thread_resume

rwails requested review from robgjansen and sporksmith January 1, 2021 01:14

rwails linked an issue Jan 1, 2021 that may be closed by this pull request

Investigate performance issues in phantom #1041

Closed

16 tasks

rwails added this to the Minimum viable prototype of process-based simulation architecture milestone Jan 1, 2021

rwails added the Priority: High Prioritized ahead of most other issues label Jan 1, 2021

robgjansen requested changes Jan 5, 2021

View reviewed changes

Minor tweaks to affinity module

e7ede00

rwails requested a review from robgjansen January 6, 2021 01:59

armadev reviewed Jan 6, 2021

View reviewed changes

robgjansen mentioned this pull request Jan 6, 2021

Investigate performance issues in phantom #1041

Closed

16 tasks

sporksmith removed their request for review January 11, 2021 21:56

rwails closed this Feb 3, 2021

robgjansen added the Tag: Performance Related to improving shadow's run-time label Mar 3, 2021

Conversation

rwails commented Jan 1, 2021

Uh oh!

codecov bot commented Jan 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jtracey commented Jan 1, 2021

Uh oh!

robgjansen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rwails commented Jan 6, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robgjansen commented Jan 6, 2021

Uh oh!

sporksmith commented Jan 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov bot commented Jan 1, 2021 •

edited

Loading

sporksmith commented Jan 11, 2021 •

edited

Loading