Fix MPR7253, exit behaviour in case of raising at_exit functions. by dbuenzli · Pull Request #685 · ocaml/ocaml

dbuenzli · 2016-07-12T17:07:47Z

Rather than building a closure that captures the list of [at_exit]
functions to call at [exit] we explicitly store them in a mutable
list in LIFO order. We change the [Pervasives.do_at_exit] function
so that it sequentially pops one from the list before executing it.

If the popped function raises, the general uncaught exception mecanism
of [caml_main] kicks in. This eventually leads to the execution of
[Printexc.handle_uncaught_exception'] which reports the uncaught
exception and then calls [Pervasives.do_at_exit] to continue calling
the remaining [at_exit] functions. The latter call is guarded to catch
and report further exception raised by [at_exit] function in a loop,
so that each uncaught expression in [at_exit] functions gets reported.

This changes the semantics of the handler given to
Printexc.set_uncaught_exception_handler. It used to be that handler
would be called after all [at_exit] functions would have been called.
This is no longer the case, it is called before any [at_exit] function
that didn't get executed yet when the uncaught exception was raised.

Rather than building a closure that captures the list of [at_exit] functions to call at [exit] we explicitly store them in a mutable list in LIFO order. We change the [Pervasives.do_at_exit] function so that it sequentially pops one from the list before executing it. If the popped function raises, the general uncaught exception mecanism of [caml_main] kicks in. This eventually leads to the execution of [Printexc.handle_uncaught_exception'] which reports the uncaught exception and then calls [Pervasives.do_at_exit] to continue calling the remaining [at_exit] functions. The latter call is guarded to catch and report further exception raised by [at_exit] function in a loop, so that each uncaught expression in [at_exit] functions gets reported. This changes the semantics of the handler given to Printexc.set_uncaught_exception_handler. It used to be that handler would be called after all [at_exit] functions would have been called. This is no longer the case, it is called before any [at_exit] function that didn't get executed yet when the uncaught exception was raised.

dbuenzli · 2016-07-12T17:14:38Z

stdlib/printexc.ml

          print_raw_backtrace stderr raw_backtrace;
-          eprintf "Fatal error in uncaught exception handler: exception %s\n"
-            (to_string exn');
-          print_raw_backtrace stderr raw_backtrace';


N.B. what you see here is a diff artefact I'm not changing the structure of this see the removed None case above.

alainfrisch · 2016-07-12T20:53:20Z

Contrary to what I believed (and commented on the other PR), one can already have Pervasives.exit gives back control (through an exception), and this new PR does not change that. I'm wondering if this shouldn't be addressed as well. Couldn't one simply run the pending at_exit functions when exit is called (popping them one by one from the list ref before they are executed), and if a nested call to exit occurs, this would simply trigger a new local loop (at the top of the stack)? If any at_exit function raises an exception, it is displayed by calling the printer in Printexc, but this does not stop the loop.

(There could also be a global mechanism to remember about the first or last non-0 code passed to exit to support the case of nested exits.)

dbuenzli · 2016-07-12T21:12:46Z

Contrary to what I believed (and commented on the other PR), one can already have Pervasives.exit gives back control (through an exception), and this new PR does not change that. I'm wondering if this shouldn't be addressed as well.

Indeed and somehow I agree exit should be a sink that never returns.

If any at_exit function raises an exception, it is displayed by calling the printer in Printexc, but this does not stop the loop.

The only problem is that we can't access Printexc's function in Pervasives due to the dependency chain. This is what this comment said and I tried to circumvent via this "solution".

Now I didn't dig too far into Printexc to see if this this can maybe be reversed (of course at the moment the call made by Printexc.handle_uncaught_exception to Pervasives.do_at_exit prevents this, but this may not be needed if we handle the loop in Pervasives).

Will dig further...

alainfrisch · 2016-07-12T21:29:09Z

The "custom uncaught exception handlers" take a raw_backtrace argument, and the default one rely on backtrace functions from Printexc. It does not seem realistic to move all that to Pervasives.

But there is already a "for system use only" section in Pervasives (also exposing do_at_exit), and it could be ok to add a forward reference to Printexc there to break the cycle. (The code in Pervasives must behave correctly if Printexc is not linked in, though.)

Alternatively, one could collect all exceptions raised by at_exit hooks in a list ref exposed by Pervasives to the runtime system and let the runtime system call the uncaught exception printer registered by Printexc (although we would probably not get the stack trace in such cases).

xavierleroy · 2017-10-19T15:08:42Z

The issue is still open (e.g. in the original Mantis report), so I'm reading this discussion again. I cannot get an idea of which semantics (singular or plural) @dbuenzli , @alainfrisch , and the rest of us expect for an uncaught exception raised out of an at_exit function. Not the semantics currently implemented, that's for sure, but which one(s) instead? Please describe the semantics in terms of behaviors, not in terms of implementation of at_exit. That will come later.

dbuenzli · 2017-10-19T15:27:08Z

From the documentation bits of this PR I think we want this:

If the registered function raises an uncaught exception it is given to the handler setup by {!Printexc.set_uncaught_exception_handler}.

dbuenzli · 2017-10-19T15:37:25Z

However, as noted, this changes the semantics of {!Printexc.set_uncaught_exception_handler} which somehow seems to have been designed under the assumption that it will be called with a single unhandled exception from the program. Not necessarily in fact since the comment doesn't say when it is given to the uncaught the exception handler.

xavierleroy · 2017-10-19T15:51:47Z

If the registered function raises an uncaught exception it is given to the handler setup by {!Printexc.set_uncaught_exception_handler}.

This doesn't say what happens when the handler calls exit itself, although I guess you intend this to execute the remaining at_exit finalizers (those that haven't been executed yet), right? Still, I can't convince myself this is the obvious thing to do. Why not, say, ignore the exceptions raised by at_exit functions? (Maybe not a great idea, but just an example of some other behavior.)

dbuenzli · 2017-10-19T16:06:49Z

This doesn't say what happens when the handler calls exit itself,

Ah ok you also wanted that story aswell. This is MPR7178 and discussed in GPR675 in particular for me this comment.

So to sum up the full story for me would be for an at_exit handler f:

The registered function f is wrapped by at_exit with a catch all exception handler, if f raises an exception it is eventually passed to Printexc.set_uncaught_exception_handler
If the registered function f calls exit, Invalid_argument is raised (and thus catched by the protecting catch all handler).
Possibly, introduce a function exit_at_exit : int -> unit that allows to set the ret code in an at_exit handler with the semantics that the last exit_at_exit executed with non-zero exit code defines the retcode of the program.

xavierleroy · 2017-10-19T17:26:30Z

Thanks a lot @dbuenzli for having summarized the whole story. I'm still unsure where we go from there: is your semantics the one to be pursued or shall we look for something else?

dbuenzli · 2017-10-19T18:35:32Z

An easier one would be to have 1.+2. without the Printexc. business this would at least solve the issues in some way.

While I'm not so fond about systems who silently swallow errors as if nothing happened, one could argue that writing an exit handler requires care and thus mandate the programmer do the Printexc business herself in her fs --- which she perfectly can.

Regarding 3. the actual use cases (one is here) may be too rare to justify it.

dbuenzli reviewed Jul 12, 2016
View reviewed changes

dbuenzli closed this Jul 12, 2016

xavierleroy mentioned this pull request May 17, 2018

MPR#7796: avoid running at_exit functions twice #1783

Closed

dbuenzli deleted the mpr-7253-fix-raising-at-exit branch November 7, 2018 14:04

dbuenzli mentioned this pull request Nov 30, 2018

protect: treat as an error the case where two exceptions are being raised #2118

Merged

vicuna mentioned this pull request Mar 14, 2019

at_exit functions get called twice if a callback raises and prevents earlier handlers to execute. #7253

Closed

dbuenzli mentioned this pull request Oct 28, 2019

Thread.exit considered harmful #9071

Closed

dbuenzli mentioned this pull request Apr 27, 2022

Exception raising Stdlib.at_exit callback breaks Stdlib.exit #11221

Open

stedolan pushed a commit to stedolan/ocaml that referenced this pull request Sep 21, 2022

Remove 32-bit target support from To_cmm (ocaml#685)

4d2f22b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MPR7253, exit behaviour in case of raising at_exit functions.#685

Fix MPR7253, exit behaviour in case of raising at_exit functions.#685
dbuenzli wants to merge 1 commit intoocaml:trunkfrom
dbuenzli:mpr-7253-fix-raising-at-exit

dbuenzli commented Jul 12, 2016 •

edited

Loading

Uh oh!

dbuenzli Jul 12, 2016

Uh oh!

alainfrisch commented Jul 12, 2016

Uh oh!

dbuenzli commented Jul 12, 2016

Uh oh!

alainfrisch commented Jul 12, 2016

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017 •

edited

Loading

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017 •

edited

Loading

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dbuenzli commented Jul 12, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dbuenzli Jul 12, 2016

Choose a reason for hiding this comment

Uh oh!

alainfrisch commented Jul 12, 2016

Uh oh!

dbuenzli commented Jul 12, 2016

Uh oh!

alainfrisch commented Jul 12, 2016

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xavierleroy commented Oct 19, 2017

Uh oh!

dbuenzli commented Oct 19, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dbuenzli commented Jul 12, 2016 •

edited

Loading

dbuenzli commented Oct 19, 2017 •

edited

Loading

dbuenzli commented Oct 19, 2017 •

edited

Loading