Skip to content

[Bug] Threaded actor stress test invokes SIGSEGV #19748

@rkooo567

Description

@rkooo567

Search before asking

  • I searched the issues and found no similar issues.

Ray Component

Ray Core

What happened + What you expected to happen

When I ran the local threaded actor tests with 10 max_concurrency, this segfaults were observed.

We should fix it.

(Child pid=12953) *** SIGSEGV received at time=1635241081 ***
(Child pid=12953) PC: @        0x10e1ffdac  (unknown)  ray::core::CoreWorkerProcess::SetCurrentThreadWorkerId()
(Child pid=12953)     @        0x10e986a64  (unknown)  absl::lts_20210324::WriteFailureInfo()
(Child pid=12953)     @        0x10e9867cb  (unknown)  absl::lts_20210324::AbslFailureSignalHandler()
(Child pid=12953)     @     0x7fff204a3d7d  (unknown)  _sigtramp
(Child pid=12953)     @     0x7fff20286877  (unknown)  small_malloc_should_clear
(Child pid=12953)     @        0x10e2073b1  (unknown)  ray::core::CoreWorker::ExecuteTask()
(Child pid=12953) [failure_signal_handler.cc : 334] RAW: Signal 11 raised at PC=0x10e1ffdac while already in AbslFailureSignalHandler()
(Child pid=12953)     @        0x10e273bb0  (unknown)  std::__1::__function::__func<>::operator()()
(Child pid=12953)     @        0x10e2eec83  (unknown)  std::__1::__function::__func<>::operator()()
(Child pid=12953)     @        0x10e2764c1  (unknown)  ray::core::InboundRequest::Accept()
(Child pid=12953)     @        0x10e2eb0fa  (unknown)  boost::asio::detail::executor_op<>::do_complete()
(Child pid=12953)     @        0x10e971866  (unknown)  boost::asio::detail::scheduler::do_run_one()
(Child pid=12953)     @        0x10e967f01  (unknown)  boost::asio::detail::scheduler::run()
(Child pid=12953)     @        0x10e980a9b  (unknown)  boost::asio::detail::posix_thread::func<>::run()
(Child pid=12953)     @        0x10e96e650  (unknown)  boost_asio_detail_posix_thread_function
(Child pid=12953)     @     0x7fff2045e8fc  (unknown)  _pthread_start
(Child pid=12953)     @     0x7fff2045a443  (unknown)  thread_start

Versions / Dependencies

master

Reproduction script

Run the unit test in this PR with max_concurrency=10

#19746

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething that is supposed to be working; but isn'ttriageNeeds triage (eg: priority, bug/not-bug, and owning component)

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions