Fix stack-overflow in scheduler#3
Merged
avsm merged 1 commit intoocaml-multicore:mainfrom Mar 11, 2021
Merged
Conversation
There's no need to call the scheduler again manually after `continue`. The coroutine will return to the scheduler anyway when done.
Collaborator
Author
|
@avsm : I'm not entirely sure how this is supposed to work. Does this look right to you? |
Contributor
|
This looks right; I'd planned to make complete_rw_req and the schedule function mutually recursive anyway, since there needs to be a direct control flow line back to the scheduler in this case. We could do with a test case that spawns a few billion fibres to test the stack safety... |
talex5
added a commit
to talex5/eio
that referenced
this pull request
Mar 12, 2021
Before, the continuations all returned `unit`. This makes it easy to get confused about what it means (as in ocaml-multicore#1 and ocaml-multicore#3). Using a distinct type for this should prevent such errors in future. The error in ocaml-multicore#1 now results in: Error: This expression has type effect but an expression was expected of type [ `Exit_scheduler ] The error in ocaml-multicore#3 now results in: Error: This expression has type [ `Exit_scheduler ] but an expression was expected of type unit because it is in the left-hand side of a sequence
talex5
added a commit
to talex5/eio
that referenced
this pull request
Nov 11, 2021
Before, the continuations all returned `unit`. This makes it easy to get confused about what it means (as in ocaml-multicore#1 and ocaml-multicore#3). Using a distinct type for this should prevent such errors in future. The error in ocaml-multicore#1 now results in: Error: This expression has type effect but an expression was expected of type [ `Exit_scheduler ] The error in ocaml-multicore#3 now results in: Error: This expression has type [ `Exit_scheduler ] but an expression was expected of type unit because it is in the left-hand side of a sequence
talex5
added a commit
to talex5/eio
that referenced
this pull request
Jul 27, 2023
The `execve` action allocated the arrays in the forked child process.
However, in a multi-threaded program we might have forked while another
thread had the malloc lock. In that case, the child would wait forever
because it inherited the locked mutex but not the thread that would
unlock it. e.g.
#0 futex_wait (private=0, expected=2, futex_word=0xffff9509cb10 <main_arena>) at ../sysdeps/nptl/futex-internal.h:146
ocaml-multicore#1 __GI___lll_lock_wait_private (futex=futex@entry=0xffff9509cb10 <main_arena>) at ./nptl/lowlevellock.c:34
ocaml-multicore#2 0x0000ffff94f8e780 in __libc_calloc (n=<optimized out>, elem_size=<optimized out>) at ./malloc/malloc.c:3650
ocaml-multicore#3 0x0000aaaac67cfa68 in make_string_array (errors=errors@entry=37, v_array=281472912006504) at fork_action.c:47
ocaml-multicore#4 0x0000aaaac67cfaf4 in action_execve (errors=37, v_config=281472912003024) at fork_action.c:61
ocaml-multicore#5 0x0000aaaac67cf93c in eio_unix_run_fork_actions (errors=errors@entry=37, v_actions=281472912002960) at fork_action.c:19
talex5
added a commit
to talex5/eio
that referenced
this pull request
Jul 28, 2023
The `execve` action allocated the arrays in the forked child process.
However, in a multi-threaded program we might have forked while another
thread had the malloc lock. In that case, the child would wait forever
because it inherited the locked mutex but not the thread that would
unlock it. e.g.
#0 futex_wait (private=0, expected=2, futex_word=0xffff9509cb10 <main_arena>) at ../sysdeps/nptl/futex-internal.h:146
ocaml-multicore#1 __GI___lll_lock_wait_private (futex=futex@entry=0xffff9509cb10 <main_arena>) at ./nptl/lowlevellock.c:34
ocaml-multicore#2 0x0000ffff94f8e780 in __libc_calloc (n=<optimized out>, elem_size=<optimized out>) at ./malloc/malloc.c:3650
ocaml-multicore#3 0x0000aaaac67cfa68 in make_string_array (errors=errors@entry=37, v_array=281472912006504) at fork_action.c:47
ocaml-multicore#4 0x0000aaaac67cfaf4 in action_execve (errors=37, v_config=281472912003024) at fork_action.c:61
ocaml-multicore#5 0x0000aaaac67cf93c in eio_unix_run_fork_actions (errors=errors@entry=37, v_actions=281472912002960) at fork_action.c:19
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
There's no need to call the scheduler again manually after
continue. The coroutine will return to the scheduler anyway when done.