Extract base analysis flag to separate analysis, refactor thread spawning by sim642 · Pull Request #130 · goblint/analyzer

sim642 · 2020-11-10T15:02:04Z

This is my ongoing effort to extract the flag (containing a "boolean" of single-/multithreaded mode and current thread ID) from base analysis to a separate analysis.

Additionally, I'm trying to refactor thread spawning, which multiple analyses currently try to do in parallel in special in a very similar way. The idea is to add additional transfer functions (currently threadenter and threadspawn) which make other analyses aware of a thread spawned by base analysis without needing to duplicate the logic.

This should make it easier to handle thread joins (pthread_join) and in simpler cases go back to singlethreaded mode, avoiding races that come from the end of main thread after other threads have been joined.

There is mtflag analysis but it's lacking compared to what base does with the flag.

…tions This should allow reducing the duplication of ctx.spawn logic in different analyses.

…rinc)

# Conflicts: # src/analyses/base.ml

…t affected

Every analysis should specify it itself like otherstate previously

After splitting part_access, this is unnecessary.

threadspawn result is joined anyway.

src/analyses/threadFlag.ml

src/analyses/uninit.ml

src/analyses/poly.ml

michael-schwarz · 2020-11-18T08:34:23Z

It is definitely a lot nicer to have these things separate now, good job!

sim642 · 2020-11-18T10:10:34Z

I think separately from this PR, we should probably at some point cleanup the regression tests that specify which specific analyses to run explicitly to:
* use the default ones, and add any additionally needed ones

* and only specify exactly which ones to run where necessary
This will make future diffs smaller.

Absolutely! I was thinking the same when i had to fix the merge conflicts in 100 regression test PARAMs.

sim642 · 2020-11-18T10:28:12Z

The idea is to add additional transfer functions (currently threadenter and threadspawn) which make other analyses aware of a thread spawned by base analysis without needing to duplicate the logic.

Could you elaborate a bit on when these transfer functions would be called specifically?
And also how does this relate to the otherstate which was somehow related to this if I see correctly?

The new threadenter is basically the old otherstate on steroids: it gets much more information as arguments than just the function varinfo, notably the access to ctx.local and ctx.ask. As before, this transfer function is used when a new thread's initial state is to be calculated after ctx.spawn has been done somewhere.

The new threadspawn is a bit like combine (I originally had it named threadcombine but it was misleading) but it combines the new thread's initial state (after all analyses have done their threadenter to get that) with the current thread's state. This has two main goals:

The thread analysis is made aware of a new thread that was spawned from the current thread, thus it can add that to its local created threads set. Previously the thread analysis had to duplicate and match base analysis logic to know when that happens. Now only base calls ctx.spawn and via these transfer functions everyone else is aware.
In Handle thread joins #137 the base analysis can ask for the created thread ID (which is calculated by threadid analysis' threadenter) in order to set the local pthread_t variable to point to that.

The biggest peculiarity with threadspawn (or multiple) is that it happens in parallel with a normal transfer function and those are joined, like how ctx.split works. This is because if base's special does a ctx.spawn, then thread's special can't be made aware of the fact that something spawned (unless we use ctx.postsub I guess, but that's so ugly). Also, since in some kernel cases base's assign may do ctx.spawn, this parallel threadspawn is nicely compatible with threads being spawned in every weird place or condition.

If only we had somewhere to nicely document this sort of things...

# Conflicts: # src/analyses/region.ml

sim642 · 2020-11-26T16:34:46Z

Here's a screenshot of this PRs differences on SoftwareSystems:

Notably ldv-consumption/32_7a_cilled_linux-3.8-rc1-32_7a-sound--core--snd-rawmidi.ko-ldv_main0_sequence_infinite_withcheck_stateful.cil.out.i turns from true to unknown which worries me.

sim642 · 2020-11-27T10:01:00Z

ldv-consumption/32_7a_cilled_linux-3.8-rc1-32_7a-sound--core--snd-rawmidi.ko-ldv_main0_sequence_infinite_withcheck_stateful.cil.out.i

Something quite funny is happening that this turns unknown. Apparently we end up in multithreaded mode which obviously makes us less precise. It's a SoftwareSystems program and doesn't use any threading though, so what gives? Apparently we're spawning a function through a function pointer given to an special unknown function (in this case the standard free). The program isn't doing something stupid like freeing a function pointer though but instead freeing a struct which happens to contain function pointers. The unknown function spawning logic uses collect_funargs to find all reachable function pointers from that and that's how snd_rawmidi_input_event_work ends up being spawned by free.

Why didn't this happen before? Probably because thanks to mallocWrapper the freed blob is more precise (a struct containing a function pointer, not top).

Why didn't this happen immediately when mallocWrapper was added but just now with thread flag changes? I didn't bother looking into it that deep but my guess is that the thread spawning logic, which previously was duplicated in multiple analyses, had weaker implementations in some (not using reachability for unknown specials). Now that this happens centrally in base where the logic is the most powerful, it now arises.

Obviously this is silly behavior, so I see two ways to avoid this from happening:

Add a new config option to control whether unknowns specials spawn. This can be enabled by default, keeping our current logic, but disabled for SV-COMP because there programs should be complete and not call unknown functions which may spawn threads.
EDIT: Turns out our defaults already define the option exp.unknown_funs_spawn but it's not used anywhere... That's an easy fix!
Make our LibraryFunctions invalidate actions mechanism more fine-grained. Right now free is specified as writesAll because it mutates the pointed memory but that doesn't mean it may spawn threads from it. Even the unknown specials spawning logic has a concerned comment:
```
 (* why do we only spawn arguments that are written?? *)
```
We'd probably be better off having a another invalidation action to specify which arguments of which special functions may spawn.

It already existed but was completely unused.

sim642 · 2020-11-27T11:16:09Z

The pull_request CI checks now seem to be failing because the added regression test 29/21 in master doesn't enable threadid and threadflag analyses, which would be a simple enough fix. Actually adding threadid is enough to fix it.

But it's weird because base analysis doesn't depend on threadid at all. The crash is

Fatal error: exception Failure("BUG: Empty set of start variables; may happen if enter_func of any analysis returns an empty list.")

What makes is weird that adding analyses fixes the start variables, not removing problematic analyses.

…sound--core--snd-rawmidi.ko-ldv_main0_sequence_infinite_withcheck_stateful.cil.out.i

Now without threadid enabled, morphstate gets called with bot, that reduce removes. This is unintentional.

sim642 added 10 commits November 6, 2020 17:04

Try to extract Flag from base analysis to separate analysis

f846362

There is mtflag analysis but it's lacking compared to what base does with the flag.

Add baseflag to regression test activated analyses

c149b1b

Fix baseFlag to not create threads on LAP_Se_CreateProcess

09a15d9

Fix base publish_all to force multithreaded flag as before

404a234

Reimplement kernel thread spawn on assign in baseFlag

3c763d3

Replace otherstate with threadenter (and threadcombine) transfer func…

8e66f4b

…tions This should allow reducing the duplication of ctx.spawn logic in different analyses.

Replace otherstate bot hack with empty startstate

95af64d

Add baseflag to svcomp21

a3c984f

Remove unnecessary ThreadCreate handling from most analyses (except a…

9d05d7e

…rinc)

Merge branch 'master' into thread/flag

2c024ca

# Conflicts: # src/analyses/base.ml

sim642 added the cleanup Refactoring, clean-up label Nov 10, 2020

sim642 added 19 commits November 12, 2020 11:11

Change threadcombine default to bot so other transfer functions aren'…

706975c

…t affected

Change threadEscape threadenter to match old forkfun

5624968

Remove threadenter from Analyses.DefaultSpec

86291ca

Every analysis should specify it itself like otherstate previously

Remove exp.ignored_threads

2618d25

Remove outdated and unused mtflag analysis

a17900f

Rename baseflag -> threadflag

17f3c58

Copy threadflag analysis to threadid analysis

15b3544

Split threadflag to threadid and threadflag

1609279

Split part_access between threadid and threadflag

ea65dfa

Remove BaseDomain.Flag

8f14388

Move commented out IsPublic query from Base to ThreadFlag

94f5e18

Move is_multi from Base to ThreadFlag

a5b00c1

Remove unnecessary ThreadCreate handling from thread analysis

6f58f72

Change threadcombine argument from entered domain to entered ctx

5590011

Use fctx in thread analysis

1214259

Rename threadcombine -> threadspawn

9310172

Fix threadflag HTML output

c1450a1

Remove threadflag dependency on threadid

88a4080

After splitting part_access, this is unnecessary.

Remove unnecessary join from threadflag threadspawn

577b082

threadspawn result is joined anyway.

michael-schwarz reviewed Nov 18, 2020

View reviewed changes

src/analyses/threadFlag.ml Show resolved Hide resolved

michael-schwarz reviewed Nov 18, 2020

View reviewed changes

src/analyses/uninit.ml Show resolved Hide resolved

michael-schwarz reviewed Nov 18, 2020

View reviewed changes

src/analyses/poly.ml Show resolved Hide resolved

sim642 added 11 commits November 18, 2020 12:45

Add missing threadspawn to Poly

a15f4c2

Fix LevelSliceLifter threadenter to use start_level

f7136ba

Remove otherstate and ctx.spawn comments

e55cdf9

Change ThreadFlag.is_multi to be ask-only

01e0246

Change ThreadId.get_current to be ask-only

12356d5

Merge branch 'master' into thread/flag

8d0b259

Fix 30/03 by adding threadid and threadflag analyses

c87e619

Remove ARINC tasks from base

89a6df6

Merge branch 'master' into thread/flag

e1527a5

# Conflicts: # src/analyses/region.ml

Merge branch 'master' into thread/flag

09d3770

Merge branch 'master' into thread/flag

7661068

Use option exp.unknown_funs_spawn (issue #130)

b7c8991

It already existed but was completely unused.

sim642 added 2 commits November 27, 2020 13:35

Fix g2html crash on ldv-consumption/32_7a_cilled_linux-3.8-rc1-32_7a-…

c2c0365

…sound--core--snd-rawmidi.ko-ldv_main0_sequence_infinite_withcheck_stateful.cil.out.i

Fix PathSensitive morphstate on bot

598f24c

Now without threadid enabled, morphstate gets called with bot, that reduce removes. This is unintentional.

sim642 merged commit d43ad0e into master Nov 27, 2020

sim642 deleted the thread/flag branch November 27, 2020 12:19

sim642 added a commit that referenced this pull request Nov 27, 2020

Disable unknown_funs_spawn in svcomp21 (issue #130)

ca83bc4

michael-schwarz mentioned this pull request Dec 3, 2020

Fixpoints not being reached #153

Closed

6 tasks

sim642 mentioned this pull request Dec 16, 2020

Cleanup: Only have one query for must-equality between expressions #156

Merged

michael-schwarz mentioned this pull request Apr 20, 2021

Fix unsoundness when thread escape analysis is disabled #191

Merged

sim642 mentioned this pull request May 11, 2021

Fix failing arinc semaphore regression. #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract base analysis flag to separate analysis, refactor thread spawning#130

Extract base analysis flag to separate analysis, refactor thread spawning#130
sim642 merged 50 commits intomasterfrom
thread/flag

sim642 commented Nov 10, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michael-schwarz commented Nov 18, 2020

Uh oh!

sim642 commented Nov 18, 2020

Uh oh!

sim642 commented Nov 18, 2020

Uh oh!

sim642 commented Nov 26, 2020

Uh oh!

sim642 commented Nov 27, 2020 •

edited

Loading

Uh oh!

sim642 commented Nov 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sim642 commented Nov 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michael-schwarz commented Nov 18, 2020

Uh oh!

sim642 commented Nov 18, 2020

Uh oh!

sim642 commented Nov 18, 2020

Uh oh!

sim642 commented Nov 26, 2020

Uh oh!

sim642 commented Nov 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ldv-consumption/32_7a_cilled_linux-3.8-rc1-32_7a-sound--core--snd-rawmidi.ko-ldv_main0_sequence_infinite_withcheck_stateful.cil.out.i

Uh oh!

sim642 commented Nov 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sim642 commented Nov 10, 2020 •

edited

Loading

sim642 commented Nov 27, 2020 •

edited

Loading