Unbox across static handlers by alainfrisch · Pull Request #8735 · ocaml/ocaml

alainfrisch · 2019-06-14T13:21:33Z

This is a version of #2162, but built on top of the new (proposed) treatment of unboxing at the Cmm level (from #2165).

The first two commits are those from #2165. Only the third commit should be reviewer here, and this PR should not be merged before #2165.

asmcomp/cmm.ml

alainfrisch · 2019-09-13T15:55:33Z

@lthls : would you like to review this one as well?

lthls

I approve this pull request, both its objective and its implementation. There are a few remarks that I can make, tough:

If I understand correctly, after this gets merged we'll have catch handlers with non-val register types for the first time on trunk. I think this is already handled correctly, but I'll ask a few people around me who have experienced with register types if there's anything to watch for.
The notify_catch field of the environment is a bit unintuitive. Knowing what it tries to achieve, I understand how it works, but it could use a few lines of comments explaining what it's used for. I think that the natural solution would be to have the notify_catch field contain an unboxed_number_kind IntMap.t, but this would mean that the environment contains information that needs to be returned back to the caller, changing the signature of the transl* functions and updating the code everywhere. I see the proposed version here as a way around that problem, but it could do with a bit more documentation.
I'd be curious to know how often the No_result case occurs in transl_catch, and on which kind of programs. There's a number of optimisations that could be done if it happens, but I'd expect that either the optimisations are already done elsewhere or that this case never occurs in practice.

alainfrisch · 2019-10-01T12:21:53Z

Thanks @lthls. I've added some words about the notify_catch callback technique.

I'll ask a few people around me who have experienced with register types if there's anything to watch for.

I think all the cmm and post-cmm backend is ready to deal with non-val types (also for functions arguments, which is what I used in an experimental PR about unboxed calling conventions), but indeed, this hasn't received wide coverage. Did you more feedback from other people?

I'd be curious to know how often the No_result case occurs in transl_catch, and on which kind of programs.

I haven't been able to produce this situation (corresponding to a dead catch handler), and possibly this is excluded by other passes, but it would seem weird to reject the case at this point.

lthls · 2019-10-01T12:36:36Z

I think all the cmm and post-cmm backend is ready to deal with non-val types (also for functions arguments, which is what I used in an experimental PR about unboxed calling conventions), but indeed, this hasn't received wide coverage. Did you more feedback from other people?

Yes, they confirmed that they didn't find any bugs linked to this change in their experiments.

I haven't been able to produce this situation (corresponding to a dead catch handler), and possibly this is excluded by other passes, but it would seem weird to reject the case at this point.

I wasn't thinking about rejecting the case, I would have simply removed the handler since it's unreachable. If you haven't encountered such a situation in practice, it's probably not worth worrying about.

Thanks for addressing my concerns.

In the light of the recently merged #8584, do you want to take the opportunity to show some of the improvements that one can expect ? Your original pull request (#2162) showed some of the cases where it would trigger, maybe you could build a micro-benchmark based on these.

alainfrisch · 2019-10-01T12:43:39Z

Sorry, I should indeed have put a link to #2165 (comment) which still applies (just checked).

alainfrisch · 2019-10-01T12:44:31Z

(That's a 40% speedup on the micro-benchmark.)

alainfrisch · 2019-10-01T15:41:45Z

Based on @lthls 's approval and the availability of the micro-benchmark above, I'll merge this soon if nobody complains.

mshinwell · 2019-10-01T16:03:51Z

@alainfrisch Have you obtained any benchmark numbers from larger programs with this patch? It seems a significant enough change to be worth doing so.

As an aside, I think the existing register typing for unboxed integers is dubious, as they are assigned type Int. However this could in theory be joined with Val to yield Val, which would be wrong for e.g. a boxed int64. I suspect such a join never happens for a register holding a value that arose from unboxing, but this should probably be fixed nonetheless. My previous PR on improved register typing, which has been taken over by @Gbury (in conjunction with work on register type inference, as discussed at the last dev meeting), should be sufficient.

alainfrisch · 2019-10-01T16:15:00Z

@alainfrisch Have you obtained any benchmark numbers from larger programs with this patch? It seems a significant enough change to be worth doing so.

No visible change on some larger payloads I tried, but we wrote our critical numerical code knowing how unboxing works currently, and carefully rewrote parts to workaround current limitations. This ends up with some code like:

  let a = ref 0. and b = 0. in
  if cond then (a := ...; b := ...) else (a:= ...; b := ...)

which could be avoided by a simple unboxed float tuple binding with this PR.

I suspect such a join never happens for a register holding a value that arose from unboxing, but this should probably be fixed nonetheless.

If there is a problem, I think it already exists with normal unboxing of let-bound identifiers. Here we unbox catch arguments if they are boxed numbers for sure, considering all call sites; and for the "continuation" (i.e. the handler code), it is as if we inserted a local let-bound identifier to hold the unboxed value (let argi_unboxed = unbox argi in ...).

mshinwell · 2019-10-01T16:19:16Z

I think the typing problem isn't specific to this PR, agreed.

alainfrisch · 2019-10-01T16:22:42Z

@mshinwell : are you ok with merging?

mshinwell · 2019-10-01T16:23:48Z

I think as long as you're satisfied with the potential performance changes, it's fine.

alainfrisch · 2019-10-02T13:12:54Z

Closed/reopened to relaunch Appveyor CI, which took significantly longer than usual (and one job was killed because of it); no problem on Travis, so I suspect a transient issue on Appveyor, but let's be sure before merging.

alainfrisch · 2019-10-03T08:52:35Z

Ok, AppVeyor finished each job in < 1 hour, but job time seems to be significantly higher than on master; I think it's worth investigating before merging.

alainfrisch · 2019-10-03T11:22:17Z

False alarm, the latest AppVeyor build is a fast as usual (also confirmed by checking AppVeyor logs that slow builds were also slow on the bytecode parts).

alainfrisch force-pushed the unbox_across_static_handlers branch from 46427c7 to fa94371 Compare June 14, 2019 13:28

alainfrisch marked this pull request as ready for review June 14, 2019 13:29

alainfrisch requested a review from mshinwell June 14, 2019 13:29

alainfrisch mentioned this pull request Jun 14, 2019

Decide unboxing of let-bound expressions based on their Cmm translation #2165

Merged

lthls reviewed Jun 14, 2019

View reviewed changes

asmcomp/cmm.ml Outdated Show resolved Hide resolved

alainfrisch force-pushed the unbox_across_static_handlers branch from fa94371 to b011ea9 Compare June 14, 2019 16:11

Unbox across static handlers

6ab4a64

alainfrisch force-pushed the unbox_across_static_handlers branch from dd36b56 to 6ab4a64 Compare September 17, 2019 10:30

lthls approved these changes Sep 17, 2019

View reviewed changes

Document the callback technique

4aa339b

alainfrisch closed this Oct 2, 2019

alainfrisch reopened this Oct 2, 2019

alainfrisch closed this Oct 3, 2019

alainfrisch reopened this Oct 3, 2019

alainfrisch merged commit 552858d into ocaml:trunk Oct 3, 2019

lthls mentioned this pull request Oct 3, 2019

Split cmmgen into generic cmm helpers and clambda-specific transformations #1963

Merged

gretay-js pushed a commit to janestreet/ocaml that referenced this pull request Nov 19, 2019

Unbox across static handlers (ocaml#8735)

7e09ebc

smuenzel-js mentioned this pull request Feb 21, 2020

Some "Changes" in 4.09 weren't included in 4.09 #9323

Merged

gasche mentioned this pull request Apr 26, 2021

[patch] Avoid boxing float/int32/int64 when doing direct call #5894

Closed

lthls mentioned this pull request Jan 30, 2025

Track type of variables bound by as #13763

Merged

Conversation

alainfrisch commented Jun 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

alainfrisch commented Sep 13, 2019

Uh oh!

lthls left a comment

Choose a reason for hiding this comment

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

lthls commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

mshinwell commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

mshinwell commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 1, 2019

Uh oh!

mshinwell commented Oct 1, 2019

Uh oh!

alainfrisch commented Oct 2, 2019

Uh oh!

alainfrisch commented Oct 3, 2019

Uh oh!

alainfrisch commented Oct 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

alainfrisch commented Jun 14, 2019 •

edited

Loading