Skip to content

Coredump when releasing PageStorage snapshot #2249

@JaySon-Huang

Description

@JaySon-Huang

Seems that running sdb_data with default param and adding scheduler add shuffle-region-scheduler in PD, TiFlash will coredump randomly with a stack like this:

Maybe similar with #1429

The full logs are truncated, only error log left.

2021.06.22 04:22:51.940237 [ 767428 ] <Error> BaseDaemon: (from thread 90) Terminate called after throwing an instance of std::system_error
what(): Resource deadlock avoided
Stack trace:

0. bin/tiflash/tiflash(StackTrace::StackTrace()+0x15) [0x364de75]
1. bin/tiflash/tiflash() [0x3709b8e]
2. bin/tiflash/tiflash(__cxxabiv1::__terminate(void (*)())+0x5) [0x8457c45]
3. bin/tiflash/tiflash(__cxa_call_terminate+0x38) [0x8458d48]
4. bin/tiflash/tiflash(__gxx_personality_v0+0x2e7) [0x84583f7]
5. bin/tiflash/tiflash() [0x84ee972]
6. bin/tiflash/tiflash() [0x84ef18e]
7. bin/tiflash/tiflash(__cxa_throw+0x35) [0x8459425]
8. bin/tiflash/tiflash(std::__throw_system_error(int)+0x7d) [0x846661d]
9. bin/tiflash/tiflash(std::_Sp_counted_ptr_inplace<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntriesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot, std::allocator<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntr
iesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot>, (__gnu_cxx::_Lock_policy)2>::_M_dispose()+0x584)
2021.06.22 04:22:51.940270 [ 767428 ] <Error> BaseDaemon: ########################################
2021.06.22 04:22:51.940335 [ 767428 ] <Error> BaseDaemon: (from thread 90) Received signal Aborted (6).
...
2021.06.22 09:26:09.464698 [ 3310080 ] <Error> BaseDaemon: (from thread 78) Terminate called after throwing an instance of std::system_error
what(): Resource deadlock avoided
Stack trace:

0. bin/tiflash/tiflash(StackTrace::StackTrace()+0x15) [0x364de75]
1. bin/tiflash/tiflash() [0x3709b8e]
2. bin/tiflash/tiflash(__cxxabiv1::__terminate(void (*)())+0x5) [0x8457c45]
3. bin/tiflash/tiflash(__cxa_call_terminate+0x38) [0x8458d48]
4. bin/tiflash/tiflash(__gxx_personality_v0+0x2e7) [0x84583f7]
5. bin/tiflash/tiflash() [0x84ee972]
6. bin/tiflash/tiflash() [0x84ef18e]
7. bin/tiflash/tiflash(__cxa_throw+0x35) [0x8459425]
8. bin/tiflash/tiflash(std::__throw_system_error(int)+0x7d) [0x846661d]
9. bin/tiflash/tiflash(std::_Sp_counted_ptr_inplace<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntriesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot, std::allocator<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntr
iesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot>, (__gnu_cxx::_Lock_policy)2>::_M_dispose()+0x584)
2021.06.22 09:26:09.478354 [ 3310080 ] <Error> BaseDaemon: ########################################
2021.06.22 09:26:09.485173 [ 3310080 ] <Error> BaseDaemon: (from thread 78) Received signal Aborted (6).
...
2021.06.22 10:11:17.095425 [ 416940 ] <Error> BaseDaemon: (from thread 128) Terminate called after throwing an instance of std::system_error
what(): Resource deadlock avoided
Stack trace:

0. bin/tiflash/tiflash(StackTrace::StackTrace()+0x15) [0x364de75]
1. bin/tiflash/tiflash() [0x3709b8e]
2. bin/tiflash/tiflash(__cxxabiv1::__terminate(void (*)())+0x5) [0x8457c45]
3. bin/tiflash/tiflash(__cxa_call_terminate+0x38) [0x8458d48]
4. bin/tiflash/tiflash(__gxx_personality_v0+0x2e7) [0x84583f7]
5. bin/tiflash/tiflash() [0x84ee972]
6. bin/tiflash/tiflash() [0x84ef18e]
7. bin/tiflash/tiflash(__cxa_throw+0x35) [0x8459425]
8. bin/tiflash/tiflash(std::__throw_system_error(int)+0x7d) [0x846661d]
9. bin/tiflash/tiflash(std::_Sp_counted_ptr_inplace<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntriesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot, std::allocator<DB::MVCC::VersionSetWithDelta<DB::PageEntriesForDelta, DB::PageEntr
iesView, DB::PageEntriesEdit, DB::DeltaVersionEditAcceptor>::Snapshot>, (__gnu_cxx::_Lock_policy)2>::_M_dispose()+0x584)
2021.06.22 10:11:17.095467 [ 416940 ] <Error> BaseDaemon: ########################################
2021.06.22 10:11:17.095515 [ 416940 ] <Error> BaseDaemon: (from thread 128) Received signal Aborted (6).

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions