-
Notifications
You must be signed in to change notification settings - Fork 7.4k
[Bug] Threaded actor stress test invokes SIGSEGV #19748
Copy link
Copy link
Closed
Labels
bugSomething that is supposed to be working; but isn'tSomething that is supposed to be working; but isn'ttriageNeeds triage (eg: priority, bug/not-bug, and owning component)Needs triage (eg: priority, bug/not-bug, and owning component)
Milestone
Description
Search before asking
- I searched the issues and found no similar issues.
Ray Component
Ray Core
What happened + What you expected to happen
When I ran the local threaded actor tests with 10 max_concurrency, this segfaults were observed.
We should fix it.
(Child pid=12953) *** SIGSEGV received at time=1635241081 ***
(Child pid=12953) PC: @ 0x10e1ffdac (unknown) ray::core::CoreWorkerProcess::SetCurrentThreadWorkerId()
(Child pid=12953) @ 0x10e986a64 (unknown) absl::lts_20210324::WriteFailureInfo()
(Child pid=12953) @ 0x10e9867cb (unknown) absl::lts_20210324::AbslFailureSignalHandler()
(Child pid=12953) @ 0x7fff204a3d7d (unknown) _sigtramp
(Child pid=12953) @ 0x7fff20286877 (unknown) small_malloc_should_clear
(Child pid=12953) @ 0x10e2073b1 (unknown) ray::core::CoreWorker::ExecuteTask()
(Child pid=12953) [failure_signal_handler.cc : 334] RAW: Signal 11 raised at PC=0x10e1ffdac while already in AbslFailureSignalHandler()
(Child pid=12953) @ 0x10e273bb0 (unknown) std::__1::__function::__func<>::operator()()
(Child pid=12953) @ 0x10e2eec83 (unknown) std::__1::__function::__func<>::operator()()
(Child pid=12953) @ 0x10e2764c1 (unknown) ray::core::InboundRequest::Accept()
(Child pid=12953) @ 0x10e2eb0fa (unknown) boost::asio::detail::executor_op<>::do_complete()
(Child pid=12953) @ 0x10e971866 (unknown) boost::asio::detail::scheduler::do_run_one()
(Child pid=12953) @ 0x10e967f01 (unknown) boost::asio::detail::scheduler::run()
(Child pid=12953) @ 0x10e980a9b (unknown) boost::asio::detail::posix_thread::func<>::run()
(Child pid=12953) @ 0x10e96e650 (unknown) boost_asio_detail_posix_thread_function
(Child pid=12953) @ 0x7fff2045e8fc (unknown) _pthread_start
(Child pid=12953) @ 0x7fff2045a443 (unknown) thread_start
Versions / Dependencies
master
Reproduction script
Run the unit test in this PR with max_concurrency=10
Anything else
No response
Are you willing to submit a PR?
- Yes I am willing to submit a PR!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething that is supposed to be working; but isn'tSomething that is supposed to be working; but isn'ttriageNeeds triage (eg: priority, bug/not-bug, and owning component)Needs triage (eg: priority, bug/not-bug, and owning component)