make it possible to run multiple containers in parallel by erikbern · Pull Request #168 · erikbern/ann-benchmarks

erikbern · 2020-07-01T02:51:28Z

This should do the trick :)

Running it on MNIST right now to see if it works

maumueller · 2020-07-01T09:10:33Z

I would be interested in the results of 8x annoy in parallel vs. a single-threaded version on a 8 vCPU ec2 instance. I don't know how much load the ec2 instances have otherwise, but 8x times avx2 distance computations might show some serious performance drops in query time.

erikbern · 2020-07-01T11:06:54Z

I don't think this change should have any impact on speed right? Each container is limited to 1 CPU, so they will get their own CPU (as long as the parallelism is lower than the number of CPUs, of course)

That being said, I just noticed running top that currently each process has exactly 25% CPU (there are 4 of them) so I think I messed up something with how the CPUs are determined. Let me fix!

maumueller · 2020-07-01T11:40:15Z

I don't know which CPUs are used in the current ec2 generation, but you may always run into problems with shared cache lines and clock downscaling if there are too many heavy SIMD instructions. It's best to benchmark that :-)

erikbern · 2020-07-01T12:48:21Z

It's true you might seem some more cache thrashing with higher parallelism. Idk feels like it's worth it though in order to bring down the total runtime by 4-8x. It should affect all the algorithms approximately the same anyway. Maybe we can increase the number of runs to more than 2.

I think something like 98% of all time is spent building the index as opposed to running the queries, so it's unlikely that more than one algorithm is running queries at any point in time. Running the queries 3-5 times would warm up the cache on that CPU.

I'm very tempted to doing this, as it would bring down the runtime from say a month to a week instead. It would make it easier/cheaper to re-run the benchmarks more often, with a marginal impact on the results.

erikbern · 2020-07-01T13:07:54Z

Rewrote this to actually farm it out to each CPU. I think this should work. I wiped the results for MNIST and I'm rerunning it now. Just looking at the output it looks much faster, and when I run top I see 7 processes running at 100% CPU each.

If this works out well them I'm tempted to re-run the glove benchmarks using this as well (but maybe using say 3 or 4 processes not 7).

erikbern · 2020-07-01T13:09:28Z

I can also run the full benchmark on a smaller random dataset and see if there's any difference in the results with/without this change.

maumueller · 2020-07-01T13:16:40Z

As I said above, I think annoy on MNIST might already show a performance issue if there is one.

Maybe compare single-threaded/4 parallel processes/8 parallel processes/16 parallel processes (the latter just as a sanity check to see that it performs much worse).

erikbern · 2020-07-01T13:25:48Z

Sure, I can do that. It assigns CPUs 1...args.parallelism so I can't do more than 15 on a c5.4xlarge. I'm currently running with 7. Once it's done let me try with 1 and see what's the difference.

erikbern · 2020-07-01T17:16:16Z

With 7 in parallelism (whole run took <2h)

Running with 1 now to see. It will take probably more like 24h.

erikbern · 2020-07-02T13:16:27Z

... and here's the benchmark with parallelism = 1 (it took almost 24h to run as I expected).

There is a small different, but it seems to be no more than 5-10%, and more importantly, it doesn't change the relative ranking (seems to affect all algorithms equally).

My thinking is to merge this, increase the number of runs from 2 to 5, and then run all benchmarks with parallelism 3-5 in order to finish everything much faster.

erikbern · 2020-07-02T13:17:30Z

Merging this for now and kicking of a glove build

make it possible to run multiple containers in parallel

0baf2c4

erikbern mentioned this pull request Jul 1, 2020

New run, June 2020 #166

Merged

use processes instead of threads

2dff544

use starmap to remove the useless wrapper

610ab43

erikbern force-pushed the multiple-runs-in-parallel branch from 6b28948 to 610ab43 Compare July 1, 2020 12:50

Run Docker container on each CPU

59d035b

erikbern merged commit a925c85 into new-run-june-2020 Jul 2, 2020

erikbern deleted the multiple-runs-in-parallel branch July 2, 2020 13:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make it possible to run multiple containers in parallel#168

make it possible to run multiple containers in parallel#168
erikbern merged 4 commits intonew-run-june-2020from
multiple-runs-in-parallel

erikbern commented Jul 1, 2020 •

edited

Loading

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020 •

edited

Loading

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020 •

edited

Loading

Uh oh!

erikbern commented Jul 2, 2020

Uh oh!

erikbern commented Jul 2, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erikbern commented Jul 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

maumueller commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020

Uh oh!

erikbern commented Jul 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erikbern commented Jul 2, 2020

Uh oh!

erikbern commented Jul 2, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

erikbern commented Jul 1, 2020 •

edited

Loading

erikbern commented Jul 1, 2020 •

edited

Loading

erikbern commented Jul 1, 2020 •

edited

Loading