Spinning in the scheduler

In #2866 we found that if the worker threads spin while waiting for a new task rather than block, the runtime performance on the benchmark runner improves significantly. We should investigate this further, but should also test on systems other than the benchmark runner to make sure that the simulation performance doesn't become worse on other hardware platforms.

I've tried a few different spinning configurations:

## 1: Spin 100 times before blocking

Results: https://github.com/shadow/benchmark-results/tree/master/tgen/2023-04-13-T21-16-06
Shadow: dd651d82caaa3de43e5af5db16b18b1a6684d60b (diff: 0fe3be4ee57615bf91f7b30e223f923298f7ccc3...dd651d82caaa3de43e5af5db16b18b1a6684d60b)

A rust mutex spins 100 times when locking, so I wanted to try that here. But in a tgen simulation, there's no performance improvement:

![1681517713_grim](https://user-images.githubusercontent.com/3708797/232173141-0d3299f7-fe39-44e2-a853-46895f40a579.png)

## 2: Spin `u64::MAX` times before blocking

Results: https://github.com/shadow/benchmark-results/tree/master/tgen/2023-04-14-T02-21-54
Shadow: d0c2f9619e1967cbb36b5956d8ea87379ccb6190 (diff: 0fe3be4ee57615bf91f7b30e223f923298f7ccc3...d0c2f9619e1967cbb36b5956d8ea87379ccb6190)

Using `u64::MAX` to spin indefinitely has a large performance improvement (the same improvement we saw in #2866):

![1681517845_grim](https://user-images.githubusercontent.com/3708797/232173451-540d3423-fbdc-431e-be2c-b1939d9cd9a4.png)

## 3: Spin `u64::MAX` times before blocking and using `std::hint::spin_loop()`

Results: https://github.com/shadow/benchmark-results/tree/master/tgen/2023-04-14-T15-10-23
Shadow: 2017ac16be874dc12f2719a32fa15c2de57ba9bc (diff: 0fe3be4ee57615bf91f7b30e223f923298f7ccc3...2017ac16be874dc12f2719a32fa15c2de57ba9bc)

Like the previous version we spin `u64::MAX` times, but tell the CPU that it's a spin loop using `std::hint::spin_loop()`. We see the same performance improvement, but this is maybe better for energy efficiency:

![1681517993_grim](https://user-images.githubusercontent.com/3708797/232173580-5ed64f66-05f8-45d5-9a34-1c0ba7f81cff.png)

## 4: Spin `u64::MAX` times before blocking and using `std::thread::yield_now()`

Results: https://github.com/shadow/benchmark-results/tree/master/tgen/2023-04-14-T17-39-40
Shadow: 1cc31a0963483652c27663bacfa809b60b9a538a (diff: 0fe3be4ee57615bf91f7b30e223f923298f7ccc3...1cc31a0963483652c27663bacfa809b60b9a538a)

Like the previous version we spin `u64::MAX` times, but use `std::thread::yield_now()` instead of `std::hint::spin_loop()`. We see an even bigger performance improvement.

![1681518172_grim](https://user-images.githubusercontent.com/3708797/232173733-93ed08cd-af77-4ea4-8dc9-bc3b2e33eeef.png)

## 5: Don't spin, but use a futex_wait timeout of 1 us

Results: https://github.com/shadow/benchmark-results/tree/master/tgen/2023-04-16-T03-05-38
Shadow: 6414048ba3089a4129a01eaf4c30ea8e8cc423df (diff: 0fe3be4ee57615bf91f7b30e223f923298f7ccc3...6414048ba3089a4129a01eaf4c30ea8e8cc423df)

Had a small performance improvement, but not nearly as much as other approaches.

![1681669353_grim](https://user-images.githubusercontent.com/3708797/232333672-e45a491c-ade9-48ea-8048-42c119e2db56.png)

On the benchmark runner, it seems that spinning indefinitely with `std::thread::yield_now()` has the best performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spinning in the scheduler #2877

1: Spin 100 times before blocking

2: Spin `u64::MAX` times before blocking

3: Spin `u64::MAX` times before blocking and using `std::hint::spin_loop()`

4: Spin `u64::MAX` times before blocking and using `std::thread::yield_now()`

5: Don't spin, but use a futex_wait timeout of 1 us

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Spinning in the scheduler #2877

Description

1: Spin 100 times before blocking

2: Spin u64::MAX times before blocking

3: Spin u64::MAX times before blocking and using std::hint::spin_loop()

4: Spin u64::MAX times before blocking and using std::thread::yield_now()

5: Don't spin, but use a futex_wait timeout of 1 us

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

2: Spin `u64::MAX` times before blocking

3: Spin `u64::MAX` times before blocking and using `std::hint::spin_loop()`

4: Spin `u64::MAX` times before blocking and using `std::thread::yield_now()`