-
Notifications
You must be signed in to change notification settings - Fork 8.3k
[CI crash] ReplicatedMergeTreePartCheckThread crash during execution #94662
Copy link
Copy link
Closed
Labels
Description
Stack trace details
The sipHash64(st.trace_full) is 14342350456720407598
The trace is from the master or release branch: True
The query for CIDB to compare the trace with the known one:
WITH
(
SELECT groupArrayDistinct(cleanStackTrace(trace_full) AS trace) FROM default.stack_traces
WHERE sipHash64(trace) IN (14342350456720407598, {ANOTHER_TRACE_HASH}) -- FIXME: replace with the known hash
) AS traces,
1.97 AS alpha,
stack_frame_weights AS (
WITH
(
SELECT count()
FROM default.stack_traces
FINAL
) AS total,
2.0 AS beta,
3.7 AS gamma
SELECT
arrayJoin(cleanStackTrace(trace_full)) AS frame,
countDistinct(trace_full) AS count,
log(total / count) AS IDF,
sigmoid(beta * (IDF - gamma)) AS weight
FROM default.stack_traces
FINAL
GROUP BY frame
),
(SELECT groupArray(weight) AS w, groupArray(frame) AS f FROM stack_frame_weights) AS weights,
(trace -> arrayMap((_frame, pos) -> (pow(pos, -alpha) * arrayFirst(w, f -> (f = _frame), weights.w, weights.f)), trace, arrayEnumerate(trace))) AS get_trace_weights,
(arr -> arrayStringConcat(arr, '\n')) AS joinArr
SELECT arraySimilarity(traces[1], traces[2], get_trace_weights(traces[1]) AS weights1, get_trace_weights(traces[2]) AS weights2) AS similarity,
arrayLevenshteinDistanceWeighted(traces[1], traces[2], weights1, weights2),
joinArr(traces[1]), joinArr(traces[2]), joinArr(weights1), joinArr(weights2)
The following new stack trace from CI Logs system.crash_log found:
DB::handle_error_code(String const&, std::basic_string_view<char, std::char_traits<char>>, int, bool, std::vector<void*, std::allocator<void*>> const&)
DB::ReplicatedMergeTreePartCheckThread::run()
DB::BackgroundSchedulePoolTaskInfo::execute(DB::BackgroundSchedulePool&)
DB::BackgroundSchedulePool::threadFunction()
ThreadPoolImpl<std::thread>::ThreadFromThreadPool::worker()
void* std::__thread_proxy[$ABI]<std::tuple<std::unique_ptr<std::__thread_struct, std::default_delete<std::__thread_struct>>, void (ThreadPoolImpl<std::thread>::ThreadFromThreadPool::*)(), ThreadPoolImpl<std::thread>::ThreadFromThreadPool*>>(void*)
Possible causes:
- Invalid or corrupted part data during check
- Race condition in part check logic
- Incorrect state handling in ReplicatedMergeTreePartCheckThread
- Uninitialized or invalid pointers in part metadata
- Improper synchronization between threads in BackgroundSchedulePool
The stack trace appeared in the following checks:
Reactions are currently unavailable