Tune thread count for binning/grouping from scratch #3641

SimonHeybrock · 2025-01-28T06:53:06Z

After #3638 reduced some baseline cost, increasing thread counts earlier pays off according to my local benchmarks (group two columns into 2M bins).

Results (blue: before #3638, orange: #3638, green: this)

I included numpy.bincount for comparison. It does something much simpler, but it nicely illustrates that our constant (in the number of rows) baseline cost is still quite substantial, significantly affecting timings up to 10M rows or so. I will have another look at this bit as a follow-up task.

After #3637 reduced some baseline cost, increasing thread counts earlier pays off according to my local benchmarks (group two columns into 2M bins).

nvaytet · 2025-01-28T07:01:04Z

lib/dataset/bins.cpp

  const auto size = std::max(scipp::index(1), da.dims()[dim]);
-  const auto nthread = size > 10000000  ? 24
-                       : size > 1000000 ? 4
+  const auto nthread = size > 8000000   ? 24


Should these numbers really be hard-coded or should they depend on how many threads are available on the machine?

It could be that the benchmarks don't look as good on a laptop or a visa instance with less than 24 cores.

It will not actually make that many threads. Instead, it cuts the input into chunks, which will be processed independently. Scipp's transform automatically uses multi-threading across these, so if you have fewer threads each thread will process multiple chunks. If chunks are too small there is extra cost to this, but I hope at this level it is not relevant.

Regarding thread counts in general, also see #3565 which would probably lead to more gains.

Tune thread count for binning/grouping from scratch

0bd3227

After #3637 reduced some baseline cost, increasing thread counts earlier pays off according to my local benchmarks (group two columns into 2M bins).

nvaytet reviewed Jan 28, 2025

View reviewed changes

Base automatically changed from reduce-bin-overhead to main January 28, 2025 08:08

nvaytet approved these changes Jan 28, 2025

View reviewed changes

Merge branch 'main' into tune-group-bin-thread-count

00ae8f0

SimonHeybrock enabled auto-merge January 28, 2025 10:05

SimonHeybrock merged commit 65da844 into main Jan 28, 2025
4 checks passed

SimonHeybrock deleted the tune-group-bin-thread-count branch January 28, 2025 10:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tune thread count for binning/grouping from scratch #3641

Tune thread count for binning/grouping from scratch #3641

Uh oh!

SimonHeybrock commented Jan 28, 2025 •

edited

Loading

Uh oh!

nvaytet Jan 28, 2025

Uh oh!

SimonHeybrock Jan 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tune thread count for binning/grouping from scratch #3641

Tune thread count for binning/grouping from scratch #3641

Uh oh!

Conversation

SimonHeybrock commented Jan 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nvaytet Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

SimonHeybrock Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SimonHeybrock commented Jan 28, 2025 •

edited

Loading