fix: Write reports on concurrent crashes by denrase · Pull Request #7340 · getsentry/sentry-cocoa

denrase · 2026-02-02T16:23:09Z

📜 Description

Track which thread is handling the crash via g_crashingThread
Same thread re-entering = recrash (allowed)
Different thread = block for 2s, then continue
Use C11 atomics (<stdatomic.h>)
Rename sentrycrashcm_notifyFatalExceptionCaptured to sentrycrashcm_notifyFatalException to move it closer to KSCrash naming.
Merged notify + suspend into sentrycrashcm_notifyFatalException to prevent deadlock (concurrency check must run before any threads are suspended)

This eliminates the ENOENT race by ensuring only one thread ever enters crash handling at a time.

Aligned with KSCrash upstream (KSCrashMonitor.c):

💡 Motivation and Context

When multiple threads crash simultaneously, the second crash was incorrectly treated as a "recrash" (crash during crash handling) because g_handlingFatalException had no thread awareness.

Fixes #3296

💚 How did you test it?

📝 Checklist

You have to check all boxes before merging:

I added tests to verify the changes.
No new PII added or SDK only sends newly added PII if sendDefaultPII is enabled.
I updated the docs if needed.
I updated the wizard if needed.
Review from the native team if needed.
No breaking change or entry added to the changelog.
No breaking change for hybrid SDKs or communicated to hybrid SDKs.

Concurrent crashes from different threads were incorrectly detected as recrash, causing ENOENT errors when the recrash logic tried to read a crash report that hadn't been written yet. Add thread-aware atomic guard to ensure only one thread handles a crash. Same-thread recrash is still detected correctly; concurrent crashes from other threads are discarded. Fixes #3296

linear · 2026-02-02T16:23:13Z

COCOA-14 Sentry fails to write report to disk with concurrent fatal crashes

github-actions · 2026-02-02T16:23:34Z

Semver Impact of This PR

🟢 Patch (bug fixes)

📋 Changelog Preview

This is how your changes will appear in the changelog.
Entries from this PR are highlighted with a left border (blockquote style).

New Features ✨

(visionOS) Enable MetricKit Integration for visionOS by denrase in #7466

Bug Fixes 🐛

Write reports on concurrent crashes by denrase in #7340

Revert TelemetryScopeApplier gating of user attributes on sendDefaultPII by philprime in #7437

Internal Changes 🔧

Deps

Bump actions/checkout from 6.0.1 to 6.0.2 by dependabot in #7463
Bump getsentry/craft from 2.21.2 to 2.21.4 by dependabot in #7462
Bump getsentry/craft/.github/workflows/changelog-preview.yml from 2.21.2 to 2.21.4 by dependabot in #7461
Bump fastlane from 2.232.0 to 2.232.1 by dependabot in #7460
Bump fastlane-plugin-sentry from 2.0.0 to 2.1.0 by dependabot in #7459
Bump faraday from 1.10.4 to 1.10.5 by dependabot in #7446

_{🤖 This preview updates automatically when you update the PR.}

denrase · 2026-02-02T16:24:19Z

Please look closely at the monitor changes (SentryCrashMonitor_MachException.c, SentryCrashMonitor_Signal.c, SentryCrashMonitor_CPPException.cpp, SentryCrashMonitor_NSException.m). This is all sensitive async-signal-safe code which i'm not familiar with, as we also don't seem to have extensive test coverage here. So please give me feedback if bailing out in this concurrent crash case is the correct thing to do here.

codecov · 2026-02-02T16:34:22Z

Codecov Report

❌ Patch coverage is 89.65517% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.324%. Comparing base (c899804) to head (366847f).
⚠️ Report is 1 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...ording/Monitors/SentryCrashMonitor_MachException.c	0.000%	1 Missing ⚠️
...ecording/Monitors/SentryCrashMonitor_NSException.m	0.000%	1 Missing ⚠️
...ash/Recording/Monitors/SentryCrashMonitor_Signal.c	0.000%	1 Missing ⚠️

Additional details and impacted files

@@              Coverage Diff              @@
##              main     #7340       +/-   ##
=============================================
- Coverage   85.324%   85.324%   -0.001%     
=============================================
  Files          480       480               
  Lines        28620     28632       +12     
  Branches     12371     12398       +27     
=============================================
+ Hits         24420     24430       +10     
- Misses        4150      4154        +4     
+ Partials        50        48        -2

Files with missing lines	Coverage Δ
...entryCrash/Recording/Monitors/SentryCrashMonitor.c	`84.000% <100.000%> (+3.672%)`	⬆️
...rding/Monitors/SentryCrashMonitor_CPPException.cpp	`77.876% <100.000%> (-0.195%)`	⬇️
Sources/SentryCrash/Recording/SentryCrashC.c	`77.777% <100.000%> (+0.277%)`	⬆️
...ording/Monitors/SentryCrashMonitor_MachException.c	`36.286% <0.000%> (+0.152%)`	⬆️
...ecording/Monitors/SentryCrashMonitor_NSException.m	`31.914% <0.000%> (+0.664%)`	⬆️
...ash/Recording/Monitors/SentryCrashMonitor_Signal.c	`61.261% <0.000%> (+0.546%)`	⬆️

... and 5 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c899804...366847f. Read the comment docs.

denrase · 2026-02-02T16:34:37Z

Reproduced:

@IBAction func asyncCrash(_ sender: Any) {
    for i in 1...6 {
        DispatchQueue.global().async {
            fatalError("Concurrent crash #\(i)")
        }
    }
}

Capture:
https://denrase.sentry.io/issues/7236198917/?project=4508007036551168&query=is%3Aunresolved&referrer=issue-stream

philipphofmann · 2026-02-05T13:00:35Z

Sorry that it took so long. It just took me 2 minutes to give input 🤦

So first, I would double-check KSCrash. They did plenty of fixes over the past, and they might have a similar fix. And second, we need a review from a native dev. We could ping Mischan, but I think he might be sick this week.

Thanks for looking into this @denrase 😃

denrase · 2026-02-09T15:29:47Z

@philipphofmann Ok, I aligned this with the KSCrash solution, where they suspend for 2 seconds when a concurrent thread also crashed. Also updated naming. This fixes our issue with incorrectly detecting a re-crash and also aligns us closer with KSCrash. Of course we still need a native team review here. 🙇

itaybre

LGTM, just some small comments

itaybre · 2026-02-09T19:36:28Z

(sorry for the duplicated comments, github was having issues)

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

…tifyFatalException

github-actions · 2026-02-10T15:11:11Z

Performance metrics 🚀

	Plain	With Sentry	Diff
Startup time	1225.62 ms	1258.84 ms	33.22 ms
Size	24.14 KiB	1.11 MiB	1.09 MiB

Baseline results on branch: main

Startup times

Revision	Plain	With Sentry	Diff
`50e7b3e`	1221.54 ms	1250.81 ms	29.27 ms
`ffe0649`	1213.35 ms	1248.64 ms	35.29 ms
`a1a9260`	1192.15 ms	1229.80 ms	37.64 ms
`58a9225`	1211.40 ms	1238.88 ms	27.48 ms
`e8cc4e7`	1216.94 ms	1246.62 ms	29.68 ms
`0e4b033`	1218.59 ms	1248.51 ms	29.92 ms
`41b4993`	1215.15 ms	1248.14 ms	32.99 ms
`88c11f3`	1226.18 ms	1262.62 ms	36.44 ms
`5ca545a`	1219.06 ms	1244.59 ms	25.53 ms
`e92ab66`	1228.86 ms	1258.43 ms	29.57 ms

App size

Revision	Plain	With Sentry	Diff
`50e7b3e`	24.14 KiB	1.04 MiB	1.02 MiB
`ffe0649`	24.14 KiB	1.06 MiB	1.04 MiB
`a1a9260`	24.14 KiB	1.08 MiB	1.06 MiB
`58a9225`	24.14 KiB	1.06 MiB	1.04 MiB
`e8cc4e7`	24.14 KiB	1.11 MiB	1.09 MiB
`0e4b033`	24.14 KiB	1.11 MiB	1.09 MiB
`41b4993`	24.14 KiB	1.06 MiB	1.04 MiB
`88c11f3`	24.14 KiB	1.11 MiB	1.09 MiB
`5ca545a`	24.14 KiB	1.06 MiB	1.04 MiB
`e92ab66`	24.14 KiB	1.10 MiB	1.08 MiB

Previous results on branch: repro/cocoa-14-concurrent-fatal-crash

Startup times

Revision	Plain	With Sentry	Diff
`a4250c1`	1224.16 ms	1253.79 ms	29.62 ms

App size

Revision	Plain	With Sentry	Diff
`a4250c1`	24.14 KiB	1.10 MiB	1.08 MiB

denrase · 2026-02-16T13:36:42Z

@supervacuus I am once again asking for your review. 🙇

supervacuus

This looks good. The thing that surprises me the most, though, is that all monitors that invoke sentrycrashcm_notifyFatalException() have very different behavior:

fatal signal handlers preempt the thread that crashed (and can preempt themselves), so they can reenter not only from other threads but from the same
Mach exceptions are handled on a port listener thread and would only re-enter from another thread if the handler dishes out the retrieved exception to worker threads
The cxa_throw hook only reenters from other threads (edge-case is throwing from cxa_throw, of course, which is certainly less defined than than raising a signal from a signal handler)

(I don't know enough about the NSException mechanism to categorize)

So, while guarding against reentrancy and async-signal-safety is certainly not a bad idea in all of the above, the exposure to certain issues is very different. This is primarily something that I would inline document. For instance, wasHandlingFatalException would never appear in Mach exception handling (unless that handler defers to worker threads), and an assigned thread wouldn't be the "crashing thread" but the crash handler thread. The code, as written in sentrycrashcm_notifyFatalException(), appears to be tuned to POSIX signal handling.

…al safe

denrase · 2026-02-17T15:02:37Z

@supervacuus I have added additional comments derived from your feedback, do you think this is good enough?

supervacuus · 2026-02-17T15:12:41Z

@supervacuus I have added additional comments derived from your feedback, do you think this is good enough?

I think so. It is now clear that this is primarily a first preventative measure, using the worst-case monitor (the signal handler) as a baseline in the face of concurrent crashes, and is much better than how it was handled before. I know inline docs can get old quickly, but it certainly helps to orient whoever works on this next.

denrase changed the title ~~fix: Distinguish concurrent crashes from recrash~~ fix: Write reports on concurrent crashes (first wins) Feb 2, 2026

denrase added 2 commits February 2, 2026 17:26

add cl entry

010a947

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

6a6417a

denrase added 8 commits February 9, 2026 13:07

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

07c382e

sleep 2 s if we have a concurrent crash while handling a crash

8ca34fe

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

4181bfb

move start into async call

4e12e78

suspend env after notifying, align naming with KSCrash

5bd268c

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

5a58ca0

re-inline replyToMachExceptionMessage

d0d4e91

don’t early rertun so we don’t ahve duplicated raise path

2cace55

denrase changed the title ~~fix: Write reports on concurrent crashes (first wins)~~ fix: Write reports on concurrent crashes Feb 9, 2026

update cl

392972c

denrase added the ready-to-merge Use this label to trigger all PR workflows label Feb 9, 2026

denrase marked this pull request as ready for review February 9, 2026 15:30

denrase requested review from itaybre, noahsmartin, philipphofmann and philprime as code owners February 9, 2026 15:30

sentry Bot reviewed Feb 9, 2026

View reviewed changes

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

cursor Bot reviewed Feb 9, 2026

View reviewed changes

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

itaybre reviewed Feb 9, 2026

View reviewed changes

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor_MachException.c Outdated

itaybre reviewed Feb 9, 2026

View reviewed changes

itaybre approved these changes Feb 9, 2026

View reviewed changes

philprime reviewed Feb 10, 2026

View reviewed changes

Comment thread Sources/Sentry/include/SentryCrashMonitor.h

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

denrase added 3 commits February 10, 2026 14:30

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

39b5849

store g_crashingThread sooner to avoid misidentifies same-thread recrash

248c58e

update comment

d501fa4

cursor Bot reviewed Feb 10, 2026

View reviewed changes

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor_MachException.c Outdated

denrase added 2 commits February 10, 2026 15:09

Restore Set g_isHandlingCrash = true; before calling sentrycrashcm_no…

cf5d269

…tifyFatalException

fix typo in changelog

7242496

denrase requested a review from philprime February 16, 2026 13:34

supervacuus reviewed Feb 17, 2026

View reviewed changes

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

Comment thread Sources/SentryCrash/Recording/Monitors/SentryCrashMonitor.c

denrase added 5 commits February 17, 2026 15:16

Merge branch 'main' into repro/cocoa-14-concurrent-fatal-crash

d0526da

document that const pthread_t self = pthread_self(); is not async-sgn…

08c5d1c

…al safe

document _Atomic usage

5c25a32

add comments/context to handlers

25c5592

add more comments from mischan

366847f

denrase requested a review from supervacuus February 17, 2026 15:03

denrase merged commit bcabca0 into main Feb 17, 2026
206 of 209 checks passed

denrase deleted the repro/cocoa-14-concurrent-fatal-crash branch February 17, 2026 16:19

Copilot AI mentioned this pull request Feb 20, 2026

fix: Resolve data race crash in monitorCachedData #7423

Merged

7 tasks

Uh oh!

Conversation

denrase commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📜 Description

💡 Motivation and Context

💚 How did you test it?

📝 Checklist

Uh oh!

linear Bot commented Feb 2, 2026

Uh oh!

github-actions Bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Semver Impact of This PR

New Features ✨

Bug Fixes 🐛

Internal Changes 🔧

Deps

Uh oh!

denrase commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

denrase commented Feb 2, 2026

Uh oh!

philipphofmann commented Feb 5, 2026

Uh oh!

denrase commented Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

itaybre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

itaybre commented Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance metrics 🚀

Baseline results on branch: main

Startup times

App size

Previous results on branch: repro/cocoa-14-concurrent-fatal-crash

Startup times

App size

Uh oh!

denrase commented Feb 16, 2026

Uh oh!

supervacuus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

denrase commented Feb 17, 2026

Uh oh!

supervacuus commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

denrase commented Feb 2, 2026 •

edited

Loading

github-actions Bot commented Feb 2, 2026 •

edited

Loading

denrase commented Feb 2, 2026 •

edited

Loading

codecov Bot commented Feb 2, 2026 •

edited

Loading

github-actions Bot commented Feb 10, 2026 •

edited

Loading