Use parallel for_each to load all ports by Thomas1664 · Pull Request #694 · microsoft/vcpkg-tool

Thomas1664 · 2022-09-03T09:17:51Z

This PR adds the ability to use parallel algorithms to CMakeLists.txt. ~~GCC needs OpenMP to use this feature (docs). Fortunately, OpenMP is preinstalled and we can easily find it with cmake.~~

I implemented the parallel version of std::for_each for UNIX by myself because Clang doesn't support parallel algorithms at all and GCC requires run-time dependencies.

`x-add-version --all`

I was able to get 2x performance improvement [12 s before vs. 6 s after; GCC Debug]. I had similar results on Windows release and debug builds. I also tested if there are any downsides when using this command without --all and found out that for_each is a little bit faster than the range based for loop that was used previously.

include/vcpkg/base/parallel-algorithms.h

include/vcpkg/base/cache.h

include/vcpkg/base/parallel-algorithms.h

BillyONeal

In addition to the data race I made a comment about, I highly doubt that update_version_db_file and update_baseline_version are safe to call concurrently.

I am additionally concerned about adding additional runtime dependencies on Linux (OpenMP). We are a bootstrapping tool in a lot of situations, so we have to be careful there. If that dependency would ever be a separate apt package, for example, we probably do not want to do that.

That isn't saying that I dislike the parallel algorithms library, or that I dislike what this is doing. On the contrary, I think this concept in general is a good improvement, and certainly on Windows where it doesn't introduce any additional deployment problems, I'm all for it. (If anything I'm biased to say use more parallel algos as the original author of most of MSVC++'s parallel algorithms implementation :)) It's just that I think we need to value broad compatibility over performance improvements like this given the position that we are in. We ripped out the std::filesystem dependency for similar reasons.

Paths forward here if you want to keep contributing in a similar area:

Parallel for_each isn't that complex an algorithm to do for this, particularly when we know the 'elements' are going to be relatively expensive / do disk I/O like this. You could build such a thing on top of pthreads which would eliminate the extra dependencies concern on other platforms.
Our installation process currently could benefit a lot from being pipelined. For example, building the next thing doesn't have to wait for building binary cache packages and/or copying installed bits for the current thing.

Thanks for your contribution and I hope it works out!

include/vcpkg/base/cache.h

ras0219-msft · 2022-09-09T18:51:24Z

Our installation process currently could benefit a lot from being pipelined. For example, building the next thing doesn't have to wait for building binary cache packages and/or copying installed bits for the current thing.

A quick note to make the implicit explicit: we do have potential race conditions between multiple installs (manipulations within the downloads folder). The obvious improvement here is to start the next build while {packing up, uploading to binary caches, installing files}, but we currently cannot start another build while the current build is in progress.

Thomas1664 · 2022-09-10T09:46:17Z

A quick note to make the implicit explicit: we do have potential race conditions between multiple installs (manipulations within the downloads folder). The obvious improvement here is to start the next build while {packing up, uploading to binary caches, installing files}, but we currently cannot start another build while the current build is in progress.

We can only do binary caching in parallel because post build lint needs to happen before the next build starts. Parallel downloads are impossible because downloads are triggered from inside cmake and there might be file conflicts.

If we were to install ports in parallel we have to keep the console output consistent maybe by adding [port-name] in front of it.

In general, this is does not require parallel algorithms but is more a problem of std::async.

Thomas1664 · 2022-09-10T21:55:47Z

Parallel for_each isn't that complex an algorithm to do for this, particularly when we know the 'elements' are going to be relatively expensive / do disk I/O like this. You could build such a thing on top of pthreads which would eliminate the extra dependencies concern on other platforms.

I'd prefer std::async because it's C++ and doesn't require additional dependencies.

include/vcpkg/base/parallel-algorithms.h

ras0219-msft · 2022-09-28T18:06:15Z

src/vcpkg/commands.add-version.cpp

-        {
+        std::mutex mtx;
+
+        auto work = [&](const std::string& port_name) {


From this large block, I believe there are two operations worth parallelizing: Paragraphs::try_load_port and the formatting check. Everything else done in this function is a rounding error -- and worse, is user-visible I/O (printing messages, terminating the program).

I'd prefer to see these two operations individually parallelized.

They would be safer & simpler to use

They could be reused elsewhere in the program

Achieves 99.9% of the performance of wrapping the rest of this logic

Avoids printing messages from multiple threads, which makes the final output significantly more correct (all messages will be deterministically printed in the correct order, they will all be printed, and they will never be "chopped off" due to early terminates)

@ras0219-msft The thing is that try_load_port() itself may call Checks::check_exit(). Given the fact that we sprinkle Checks::check_exit everywhere in our code, there is nearly nothing that we can parallelize!

IMO we should replace this by throwing an exception instead and print the error message properly by wrapping everything in main() inside a try ... catch and call Checks::check_exit from catch.

Then as part of this, try_load_port() should be fixed to never call Checks::check_exit() and always return its errors via the existing return type of Expected<>.

Interleaving console output is still not ok -- users can and will get torn messages and nondeterministic behavior.

@ras0219-msft The formatting check can't be parallelized because it calls too many functions that may exit. Please also have a look at microsoft/vcpkg#27152 .

Thomas1664 · 2022-10-09T22:09:45Z

Hi @BillyONeal, @ras0219-msft,

As mentioned above, we can't use parallel algorithms if the functions that are called from different threads may exit. Unfortunately, exiting from a function is widespread across the codebase and even happens in base things like Filesystem or Json. Even if I replaced all of those exits with ExpectedS, there might be future changes down in the call stack that add calls to exit. Therefore, I don't think it's safe to use any kind of parallelization until we have a policy on when it is ok to exit the program.

BillyONeal · 2023-05-01T22:18:04Z

It seems unlikely to me that we are going to invest in this direction anytime soon.

Thomas1664 added 13 commits September 2, 2022 00:09

Use parallel for_each

ef8b596

Fix parallel algorithms for gcc

da4224a

Not everyone has tbb

adac455

But everyone should have openmp

19b6ee3

(hopefully) Fix crash

16b0525

Fix preprocessor

207242d

Improve Linux

fd27cbd

lock ptr

a4da749

lock because of cache

bbdea66

Atomic cache

a683e79

Optimize vcpkg x-add-version --all

1652a5f

Reduce macros

74c9e51

#include

479e4bc

Thomas1664 commented Sep 3, 2022

View reviewed changes

include/vcpkg/base/parallel-algorithms.h Outdated Show resolved Hide resolved

Thomas1664 commented Sep 3, 2022

View reviewed changes

include/vcpkg/base/cache.h Show resolved Hide resolved

Format

53b4de9

Thomas1664 marked this pull request as ready for review September 3, 2022 19:33

Thomas1664 added 2 commits September 4, 2022 17:46

Style

520749c

partially revert

aa2c68b

dg0yt reviewed Sep 5, 2022

View reviewed changes

include/vcpkg/base/parallel-algorithms.h Outdated Show resolved Hide resolved

Thomas1664 added 2 commits September 5, 2022 09:29

Fix braces issue with macros

08c3eb7

Remove comments

40ef3b9

Thomas1664 marked this pull request as draft September 7, 2022 22:33

BillyONeal requested changes Sep 9, 2022

View reviewed changes

include/vcpkg/base/cache.h Show resolved Hide resolved

Thomas1664 added 3 commits September 10, 2022 16:47

[update] Parallelize find_outdated_packages

194e61c

Protect update_*_file

69ff9e2

Fix build

3624d6f

Thomas1664 added 14 commits September 28, 2022 10:25

Fix assertion

87f5a54

Fix mutex

1329742

use vcpkg assertion

715719c

Can't use structured bindings in lambda

a0f2653

revert

9c132d4

Can't use structured bindings in lambda

7c056ab

Add missing include

02b1489

Fixes

608e791

Is this really my fault?

800f26e

revert

e2ab6a2

debug

16d0d2a

revert debug

df108ce

revert

60b4bf9

Fix segfault

102af2b

ras0219-msft reviewed Sep 28, 2022

View reviewed changes

Thomas1664 added 4 commits September 28, 2022 20:53

Don't block

84c07f6

Implement try_load_ports

1c93cf4

Fix include

3e6efc2

copy string

ebf45cc

Thomas1664 marked this pull request as draft October 3, 2022 13:05

Thomas1664 added 2 commits October 8, 2022 19:20

More ParseControlErrorInfo fixes

4fa0cc0

[WIP] CR

dae4175

Thomas1664 mentioned this pull request Oct 9, 2022

[vcpkg-tool] Don't spread calls to abort around the codebase microsoft/vcpkg#27152

Closed

Thomas1664 mentioned this pull request Feb 16, 2023

Binary cache: async push_success #908

Merged

Merge main

8446dab

Thomas1664 mentioned this pull request Jul 30, 2023

Simplify port updates microsoft/vcpkg#24199

Open

7 tasks

Thomas1664 mentioned this pull request Sep 26, 2023

Search for absolute paths in parallel #1213

Merged

Thomas1664 closed this Oct 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use parallel for_each to load all ports#694

Use parallel for_each to load all ports#694
Thomas1664 wants to merge 48 commits intomicrosoft:mainfrom
Thomas1664:load-all-ports

Thomas1664 commented Sep 3, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BillyONeal left a comment

Uh oh!

Uh oh!

ras0219-msft commented Sep 9, 2022

Uh oh!

Thomas1664 commented Sep 10, 2022

Uh oh!

Thomas1664 commented Sep 10, 2022

Uh oh!

Uh oh!

ras0219-msft Sep 28, 2022

Uh oh!

Thomas1664 Sep 28, 2022

Uh oh!

ras0219-msft Sep 28, 2022 •

edited

Loading

Uh oh!

Thomas1664 Oct 9, 2022

Uh oh!

Thomas1664 commented Oct 9, 2022

Uh oh!

BillyONeal commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Thomas1664 commented Sep 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

x-add-version --all

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BillyONeal left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ras0219-msft commented Sep 9, 2022

Uh oh!

Thomas1664 commented Sep 10, 2022

Uh oh!

Thomas1664 commented Sep 10, 2022

Uh oh!

Uh oh!

ras0219-msft Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

Thomas1664 Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

ras0219-msft Sep 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Thomas1664 Oct 9, 2022

Choose a reason for hiding this comment

Uh oh!

Thomas1664 commented Oct 9, 2022

Uh oh!

BillyONeal commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Thomas1664 commented Sep 3, 2022 •

edited

Loading

`x-add-version --all`

ras0219-msft Sep 28, 2022 •

edited

Loading