Fix AFL show-map test by ncik-roberts · Pull Request #1789 · oxcaml/oxcaml

ncik-roberts · 2023-08-31T15:57:34Z

Fix an AFL instrumentation test, which is failing locally and skipped in CI. I believe it's failing for an uninteresting reason, so this PR is just a band-aid to get it passing again.

Review: @mshinwell says that @stedolan is familiar with this test — could you take a look?

What is this test anyway?

The AFL testsuite consists of:

some top-level startup code that's shared among all tests
a series of tests (really just functions of type unit -> unit)

A run of a test consists of:

Running the startup code
(I) Gathering afl-show-map output for a single invocation of the test function
(II) Gathering afl-show-map output for two serial invocations of the test function
Checking that the afl-show-map output exactly doubles from (I) to (II).

I'm not very familiar with afl-show-map but it looks like it prints some stats about which basic blocks are explored by the run of the instrumented program. The intuition is: if you run the same (deterministic) code twice, if a basic block is explored once in (I), it should be explored twice in (II).

What test is failing and why?

The laziness test is failing:

(* Top-level startup code *)
let already_forced = lazy (ref 42)
let _ = Lazy.force already_forced

(* Test function *)
let laziness () = opaque @@
  let _ = Lazy.force already_forced in
  Gc.major ()

The reason the test fails is that the count of basic blocks explored for the laziness does not exactly double from 1 invocation to 2 invocations.

For 1 invocation:

026649:1
051424:1

For 2 invocations:

I suspect that the first call to Gc.major () is doing something "different enough" to later calls (maybe just more work?), and that's why 051424 is hit in the first call and 053443 and 040923 are hit in the second call. A magic trace suggests that the first call is doing a lot more work in caml_empty_minor_heap, probably collecting the other garbage generated by top-level startup code. Indeed, if I call Gc.minor () once at top-level before laziness runs, then the output doubles as expected:

$ afl-showmap -q -o /dev/stdout -- ./test 1 7
025003:1
038699:1
042577:1
$ afl-showmap -q -o /dev/stdout -- ./test 2 7
025003:2
038699:2
042577:2

stedolan · 2023-09-12T10:15:23Z

At first glance, it looks like this is actually detecting a real issue, so I'd prefer to either fix it or leave it failing rather than change the test in this way. It's important that afl instrumentation output does not depend on when GC runs, and it looks like GC is actually affecting it here.

ncik-roberts · 2023-09-12T13:17:45Z

OK, I'll bisect at some point.

ncik-roberts · 2023-09-12T18:48:58Z

Closing in favor of #1824

Fix AFL test

44261c8

ncik-roberts requested review from antalsz, ccasin, goldfirere, gretay-js, lpw25, lukemaurer, mshinwell, poechsel, riaqn, stedolan and xclerc as code owners August 31, 2023 15:57

This was referenced Sep 12, 2023

Add AFL support for Flambda 2 #1196

Merged

Fix AFL test in flambda2 #1824

Merged

ncik-roberts closed this Sep 12, 2023

ncik-roberts deleted the fix-afl-test branch September 12, 2023 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AFL show-map test#1789

Fix AFL show-map test#1789
ncik-roberts wants to merge 1 commit intomainfrom
fix-afl-test

ncik-roberts commented Aug 31, 2023 •

edited

Loading

Uh oh!

stedolan commented Sep 12, 2023

Uh oh!

ncik-roberts commented Sep 12, 2023

Uh oh!

ncik-roberts commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ncik-roberts commented Aug 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is this test anyway?

What test is failing and why?

Uh oh!

stedolan commented Sep 12, 2023

Uh oh!

ncik-roberts commented Sep 12, 2023

Uh oh!

ncik-roberts commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ncik-roberts commented Aug 31, 2023 •

edited

Loading