Faster Weak.blit by aalekseyev · Pull Request #9259 · ocaml/ocaml

aalekseyev · 2020-01-23T18:37:40Z

This is a fix for #9258.

This PR makes caml_ephemeron_blit_key faster by no longer scanning the ephemeron keys that are not involved in the operation.

Implementation details

We introduce caml_ephe_clean_partial, a version of caml_ephe_clean that takes a range of keys to clean.

One difference from caml_ephe_clean is that if we don't scan all the keys, we potentially end up with release_data = false. There are two potential concerns here:

the assertion at the end of the function might fail
it's not obviously correct to not release the data

I claim that not releasing the data is OK because that's what happens when you call caml_ephemeron_get_key and caml_ephemeron_set_key.

I've adjusted the assertion accordingly.

Testing

The benchmark in #9258 takes ~0.1s with this patch instead of >10s without.

I have to admit that, even though I have run the test suite, I have not seen the (old version of) assertion fail. There might not be sufficient test coverage to hit this case.

gasche · 2020-01-23T21:26:35Z

(cc potential reviewers: @damiendoligez @bobot @jhjourdan @stedolan @kayceesrk)

bobot · 2020-01-24T10:33:37Z

Just to rephrase and complete to see if our understanding are in sync:

It is an optimization for the clean phase (I'm surprised that we spend so many time in this phase, do we know why?)
The clean phase is incremental, but we don't know which ephemerons have been cleaned (the dead values are removed)
The goal of the clean phase is that even if all the dead values are not yet removed from the ephemeron we want to do as if.
So each argument of blit could be uncleaned or already cleaned

So:

if dst (ard) has already been cleaned we should not add unclean keys: but the only keys added are the one blitted and they are checked by the new partial clean in src (ars).
if dst is unclean: we should not remove all the unclean keys without cleaning the data, the only keys removed in dst are checked in the new partial clean.

I agree with the idea of the optimization, and in fact it seems that we could avoid the partial clean of dst if the data of dst is caml_ephe_none (weak array case).

bobot · 2020-01-24T10:39:47Z

runtime/caml/weak.h

-  hd = Hd_val (v);
-  size = Wosize_hd (hd);
-  for (i = 2; i < size; i++){
+  for (i = offset; i < offset + count; i++){


I would prefer to remove the + and -2 in caml_ephe_partial and caml_ephe_clean, by using after_last_offset instead of count in this function. But the name of the variable is bad so.

That's reasonable. How about offset_start and offset_end? End of range being the position after the last element seems like an established enough convention in C.

(pushed this change)

Another nitpicking, could you add the ASSERT (EDIT: CAMLassert) corresponding to the requirement between 2 <= offset_start <= offset_end <= Wosize(Hd_val...) if it is true.

Added that assertion. It's definitely better be true.

aalekseyev · 2020-01-24T10:52:21Z

I agree on every point.

I don't know why the clean phase takes much time (or what fraction of time it's supposed to take, anyway). Maybe in this benchmark it's extreme because most (all?) of the objects allocated on the major heap are ephemerons, but we've seen a noticeable performance hit in a real application where I think <5% of the heap consists of ephemerons. Anyway, whatever the proportion is, if it's constant we ended up with a quadratic cost of ~~blit~~ elementwise blitting asymptotically.

it seems that we could avoid the partial clean of dst if the data of dst is caml_ephe_none

That sounds mostly right. I don't know whether it's safe/desirable to call do_set with an unclean destination, though. Do you think this is safe and worth doing?

bobot · 2020-01-24T11:47:43Z

Do you think this is safe and worth doing?

I think it is safe, because the only thing checked on the old key is if it is young. And a key can't be young and unclean.

Is it worth doing?

The patch is a oneliner, but I have no experimental comparison.

aalekseyev · 2020-01-24T12:18:04Z

I pushed the optimization. It doesn't seem to affect the benchmark I have, presumably because the destination is always filled with nones and scanning those is very cheap (compared to caml_page_table_lookup).

Still, cleaning the source array constitutes just over 10% of the cost of Weak.blit (which itself, by the way, is only ~20% of the overall benchmark time now) so saving some cleaning could well be non-negligible in some cases.

aalekseyev · 2020-01-24T12:32:34Z

Wait, actually I don't agree with your safety argument. A key that's unclean and being overwritten is in fact guaranteed to visit the if (!(Is_block (old) && Is_young (old))){ branch in do_set, and, therefore, add_to_ephe_ref_table, with consequences that are beyond my understanding.

Given that it seems tricky to make a safety argument and the win is not great, I weakly prefer if we undo the optimization.

What do you think? (or maybe I miscounted the number of negations :-D)

aalekseyev · 2020-01-24T12:37:36Z

Oh, maybe the answer is that this branch is visited in both cases (because caml_ephe_none is not a young block either)?

bobot · 2020-01-24T12:37:42Z

The set is already done when add_to_ephe_ref_table (Caml_state->ephe_ref_table, ar, offset); is run, so old is not in the picture anymore (and in fact the key is not stored in the table). The value old is only given to Is_block and Is_young.

aalekseyev · 2020-01-24T12:40:16Z

Yeah, I do agree now. Thank you for clarifying.

damiendoligez

Looks good to me. Approving on behalf of @bobot and myself.

aalekseyev · 2020-01-28T15:08:11Z

I adjusted the changelog and collapsed the commits into two: one that introduces caml_ephe_clean_partial and another that implements the optimization proposed by @bobot. Please let me know if I should do anything else before this PR can be merged.

Faster Weak.blit

aalekseyev force-pushed the faster-Weak.blit branch 2 times, most recently from 2aed5ac to 19a0d91 Compare January 23, 2020 18:52

bobot approved these changes Jan 24, 2020

View reviewed changes

damiendoligez approved these changes Jan 28, 2020

View reviewed changes

damiendoligez self-assigned this Jan 28, 2020

aalekseyev added 2 commits January 28, 2020 15:04

Make Weak.blit and Ephemeron.blit_key faster

e447fb1

Small optimization: don't clean the keys about to be overwritten

5903aa0

aalekseyev force-pushed the faster-Weak.blit branch from 3d8d8e9 to 5903aa0 Compare January 28, 2020 15:05

aalekseyev added 3 commits January 29, 2020 13:30

Merge branch 'trunk' into faster-Weak.blit

315620d

Merge branch 'trunk' into faster-Weak.blit

629585d

Merge branch 'trunk' into faster-Weak.blit

59ca2a6

lpw25 merged commit b807931 into ocaml:trunk Feb 14, 2020

stedolan pushed a commit to janestreet/ocaml that referenced this pull request Mar 17, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

50ee36c

Faster Weak.blit

bobot mentioned this pull request Apr 19, 2020

Functions from weak are breaking all marking invariants on ephemerons #9424

Merged

2 tasks

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 16, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

0995a51

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 20, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

1afcf1b

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 20, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

9ce6148

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 21, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

1b65a12

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 21, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

c271000

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 30, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

2cd4d34

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Jul 30, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

cddf2b5

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 3, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

8d0bd32

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 4, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

00030b1

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 5, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

0fe033e

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 7, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

2eef1ed

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 10, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

89fa7b2

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 10, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

cbe48fa

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 17, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

2889e6e

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 18, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

8c6df34

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 19, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

17f1096

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 20, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

71dacb2

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Aug 28, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

1cba260

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Sep 2, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

dbf084a

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Sep 2, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

a04cf96

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Sep 2, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

b4adddb

Faster Weak.blit

mshinwell pushed a commit to mshinwell/ocaml that referenced this pull request Sep 7, 2020

ocaml#9259 from aalekseyev/faster-Weak.blit (cherry-pick b807931)

cb99991

Faster Weak.blit

bobot mentioned this pull request Mar 3, 2025

Allow values reachable from ephemeron keys to be collected by minor GC #13643

Merged

Conversation

aalekseyev commented Jan 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Implementation details

Testing

Uh oh!

gasche commented Jan 23, 2020

Uh oh!

bobot commented Jan 24, 2020

Uh oh!

bobot Jan 24, 2020

Choose a reason for hiding this comment

Uh oh!

aalekseyev Jan 24, 2020

Choose a reason for hiding this comment

Uh oh!

aalekseyev Jan 24, 2020

Choose a reason for hiding this comment

Uh oh!

bobot Jan 24, 2020

Choose a reason for hiding this comment

Uh oh!

bobot Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aalekseyev Jan 24, 2020

Choose a reason for hiding this comment

Uh oh!

aalekseyev commented Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bobot commented Jan 24, 2020

Uh oh!

aalekseyev commented Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aalekseyev commented Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aalekseyev commented Jan 24, 2020

Uh oh!

bobot commented Jan 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aalekseyev commented Jan 24, 2020

Uh oh!

damiendoligez left a comment

Choose a reason for hiding this comment

Uh oh!

aalekseyev commented Jan 28, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

aalekseyev commented Jan 23, 2020 •

edited

Loading

bobot Jan 24, 2020 •

edited

Loading

aalekseyev commented Jan 24, 2020 •

edited

Loading

aalekseyev commented Jan 24, 2020 •

edited

Loading

aalekseyev commented Jan 24, 2020 •

edited

Loading

bobot commented Jan 24, 2020 •

edited

Loading