Optimize short-circuitable folds in `MCP` by sim642 · Pull Request #570 · goblint/analyzer

sim642 · 2022-01-26T16:07:13Z

This PR is an attempt to optimize the following MCP domain/context/access functions:

equal
compare
leq
is_top
is_bot
may_race

The previous implementations used folds, which inevitably had to iterate over the entire list each time. Even though the folding functions themselves used the short-circuiting binary operators, this didn't short-circuit much. Moreover, all the inefficient List.assoc calls were still happening just to pass the corresponding arguments to the folding function (which in the short-circuiting case didn't even use them).
These assocs are a pain because as the fold goes down the list, each corresponding assoc needs to go through more of the other list. Overall, this means O(n²) complexity.

This PR implements alternative short-circuitable folds for MCP, which abort the list iteration via local exception to short-circuit. It not only avoids the unnecessary list tail iteration, but also all the assoc calls that would unnecessarily happen there.

This is quite relevant because while profiling slow race warning performance that @vesalvojdani noticed, it turns out that 30% of race warning time is just spent on MCPAccess.A.compare while manipulating sets of accesses.
I'm speculating that this might even have general performance improvement since path-sensitivity also uses sets of MCP.D values.

TODO

Benchmark.

src/analyses/mCPRegistry.ml

michael-schwarz · 2022-01-27T08:02:04Z

If it is really the calls to assoc that are biting us here, one could also think about at least reducing the complexity of assoc_dom calls by introducing some dedicated data structure for it that makes accesses O(1).

As the list of all available analyses is typically small, does not change during analysis, and numbered starting at zero, one could think about generating a (module Bla.S) option array where the index corresponds to the number of the analysis?

sim642 · 2022-01-27T08:28:53Z

If it is really the calls to assoc that are biting us here, one could also think about at least reducing the complexity of assoc_dom calls by introducing some dedicated data structure for it that makes accesses O(1).

As the list of all available analyses is typically small, does not change during analysis, and numbered starting at zero, one could think about generating a (module Bla.S) option array where the index corresponds to the number of the analysis?

I've been thinking about this as well, but also the fact that the whole assoc list representation in MCP might need overhauling, for example to use Map instead. Another possibility would be to ensure that analyses in those lists have well-defined and consistent order (e.g. sorted), so they could simply be zipped. I'll open a new issue about this.

sim642 · 2022-01-27T14:01:40Z

On sv-benchmarks there's not much of a difference, maybe ~2% improvement:

Full results table

But there also isn't big multithreaded benchmarks there, so the differences aren't as pronounced as in #571.

sim642 · 2022-01-28T10:29:15Z

I would merge this after #578 because there are major conflicts between the two.

sim642 · 2022-01-31T14:21:57Z

I now rebased this on top of #578 and also simplified the implementations: instead of using exceptions to escape the fold, I'm now just using for_all* functions on lists, which also stop iterating on first false.

I also benchmarked this new version and it's much more impressive than the previous version.

This compared to #578

There's an additional ~10% speedup:

#578 and this compared to master

Together the two give ~27% speedup:

Full results table

michael-schwarz · 2022-01-31T17:25:52Z

Nice! I'm always amazed at how much time we seem to waste in non-obvious places.

sim642 added the performance Analysis time, memory usage label Jan 26, 2022

michael-schwarz reviewed Jan 27, 2022

View reviewed changes

src/analyses/mCPRegistry.ml Outdated Show resolved Hide resolved

sim642 mentioned this pull request Jan 27, 2022

Configuration of access collecting precision and performance #571

Closed

sim642 added the pr-dependency Depends or builds on another PR, which should be merged before label Jan 28, 2022

sim642 added 3 commits January 31, 2022 14:00

Optimize MCP short-circuitable folds with for_all

7467323

Optimize MCP compare

89f1acc

Remove unused MCP helper methods

958e2a0

sim642 force-pushed the mcp-short-circuit branch from 05f3885 to 958e2a0 Compare January 31, 2022 14:14

sim642 changed the base branch from master to mcp-no-assoc January 31, 2022 14:14

michael-schwarz approved these changes Jan 31, 2022

View reviewed changes

Base automatically changed from mcp-no-assoc to master February 1, 2022 08:46

sim642 removed the pr-dependency Depends or builds on another PR, which should be merged before label Feb 1, 2022

sim642 merged commit f01914a into master Feb 1, 2022

sim642 deleted the mcp-short-circuit branch February 1, 2022 08:48

sim642 added this to the v2.0.0 milestone Aug 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize short-circuitable folds in `MCP`#570

Optimize short-circuitable folds in `MCP`#570
sim642 merged 3 commits intomasterfrom
mcp-short-circuit

sim642 commented Jan 26, 2022 •

edited

Loading

Uh oh!

Uh oh!

michael-schwarz commented Jan 27, 2022 •

edited

Loading

Uh oh!

sim642 commented Jan 27, 2022

Uh oh!

sim642 commented Jan 27, 2022

Uh oh!

sim642 commented Jan 28, 2022

Uh oh!

sim642 commented Jan 31, 2022

Uh oh!

michael-schwarz commented Jan 31, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sim642 commented Jan 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Uh oh!

Uh oh!

michael-schwarz commented Jan 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sim642 commented Jan 27, 2022

Uh oh!

sim642 commented Jan 27, 2022

Uh oh!

sim642 commented Jan 28, 2022

Uh oh!

sim642 commented Jan 31, 2022

This compared to #578

#578 and this compared to master

Uh oh!

michael-schwarz commented Jan 31, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sim642 commented Jan 26, 2022 •

edited

Loading

michael-schwarz commented Jan 27, 2022 •

edited

Loading