ref(replay): improve dedupe logic by JonasBa · Pull Request #8836 · getsentry/sentry-javascript

JonasBa · 2023-08-17T17:40:33Z

dedupePerformanceEntries showed up in a couple of browser profiles so I took the liberty of optimizing it. Since perf buffer registries can grow large in cases of SPA's, the execution time will grow and could become a bottleneck (this is still the case after my change)

The optimization relies on the fact that entries are sorted by startTime - since the list is sorted, we can do this in On vs On^2 time as we do not need to check the entire list of entries for duplicates, but we can just check the range of value i<x<=list size.

A possible further optimization would be to only query performance for getEntriesByType(navigation) and only dedupe those (same for LCP), which would allow us to skip iterating over elements that we are not detecting duplicates for anyways.

I ran a quick benchmark and confirmed the new dedupe is ~85% faster (on an input of ~200 perf entries)

Side note: since it seems that some perf registries are unbounded, it might make sense to add some safeguarding in place to guard from very large lists or prioritize certain entries over others.

github-actions · 2023-08-17T17:53:33Z

size-limit report 📦

Path	Size
@sentry/browser (incl. Tracing, Replay) - Webpack (gzipped)	75.28 KB (+0.12% 🔺)
@sentry/browser (incl. Tracing) - Webpack (gzipped)	31.17 KB (0%)
@sentry/browser - Webpack (gzipped)	21.85 KB (0%)
@sentry/browser (incl. Tracing, Replay) - ES6 CDN Bundle (gzipped)	69.83 KB (+0.15% 🔺)
@sentry/browser (incl. Tracing) - ES6 CDN Bundle (gzipped)	28.18 KB (-0.01% 🔽)
@sentry/browser - ES6 CDN Bundle (gzipped)	20.18 KB (-0.01% 🔽)
@sentry/browser (incl. Tracing, Replay) - ES6 CDN Bundle (minified & uncompressed)	220.14 KB (+0.09% 🔺)
@sentry/browser (incl. Tracing) - ES6 CDN Bundle (minified & uncompressed)	84.78 KB (0%)
@sentry/browser - ES6 CDN Bundle (minified & uncompressed)	59.86 KB (0%)
@sentry/browser (incl. Tracing) - ES5 CDN Bundle (gzipped)	31.04 KB (-0.01% 🔽)
@sentry/react (incl. Tracing, Replay) - Webpack (gzipped)	75.3 KB (+0.12% 🔺)
@sentry/react - Webpack (gzipped)	21.88 KB (0%)
@sentry/nextjs Client (incl. Tracing, Replay) - Webpack (gzipped)	93.12 KB (+0.1% 🔺)
@sentry/nextjs Client - Webpack (gzipped)	50.7 KB (0%)

billyvg

Nice thanks @JonasBa -- we should probably revisit this and figure out the root cause as there shouldn't be dupes.

A test seems to be legit failing: looks like we are missing an expected "op": "navigation.navigate" event

billyvg · 2023-08-17T19:56:36Z

packages/replay/src/coreHandlers/performanceObserver.ts

    // For whatever reason the observer was returning duplicate navigation
    // entries (the other entry types were not duplicated).
-    const newPerformanceEntries = dedupePerformanceEntries(
-      replay.performanceEvents,


🤔 I don't remember if we were seeing duplicate entries across calls to the observer handler

JonasBa · 2023-08-17T20:27:26Z

Nice thanks @JonasBa -- we should probably revisit this and figure out the root cause as there shouldn't be dupes.

A test seems to be legit failing: looks like we are missing an expected "op": "navigation.navigate" event

Hmm, that test is sus, I wondered why it only fails in the esm bundle test and not in cjs as well

billyvg · 2023-08-17T20:28:08Z

Yeah could be flakey

JonasBa · 2023-08-17T22:56:35Z

@billyvg I was too eager on my claim and wrong on my first implementation. I discarded the fact that replay used to persist the entries (I assume in case of a clear buffer event), meaning we cannot just dedupe the list in a single pass like I did. I refactored this to respect the original implementation by treating both lists as queues (reading from each based on startTime) which merges the two lists in m+n time.

I will try and simplify and add comments to the implementation as it's not IMO not the most readable piece of code and might not be worth using if we want to optimize for readability.

getsantry · 2023-10-18T07:00:10Z

This issue has gone three weeks without activity. In another week, I will close it.

But! If you comment or otherwise update it, I will reset the clock, and if you remove the label Waiting for: Community, I will leave it alone ... forever!

"A weed is but an unloved flower." ― Ella Wheeler Wilcox 🥀

JonasBa · 2023-10-18T15:30:44Z

Closing this in favor of removing dedupe logic entirely. Nice work @mydea!

ref(replay): improve dedupe logic

2c7e8d1

JonasBa requested review from Lms24, billyvg and mydea August 17, 2023 17:40

ref(replay): fix call site

73c057e

Merge branch 'develop' into jb/ref/replay-dedupe-entries

3965c06

billyvg reviewed Aug 17, 2023

View reviewed changes

JonasBa added 2 commits August 17, 2023 18:08

ref(replay): persist usage

eda8484

ref(replay): remove lingering expect

377d05d

JonasBa added 2 commits August 18, 2023 13:41

ref(replay): simplify dedupe queue logic

41fc05f

fix: lint

99025b9

getsantry bot added the Stale label Oct 18, 2023

mydea mentioned this pull request Oct 18, 2023

feat(replay): Share performance instrumentation with tracing #9296

Merged

JonasBa closed this Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ref(replay): improve dedupe logic#8836

ref(replay): improve dedupe logic#8836
JonasBa wants to merge 7 commits intodevelopfrom
jb/ref/replay-dedupe-entries

JonasBa commented Aug 17, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Aug 17, 2023 •

edited

Loading

Uh oh!

billyvg left a comment

Uh oh!

billyvg Aug 17, 2023

Uh oh!

JonasBa commented Aug 17, 2023

Uh oh!

billyvg commented Aug 17, 2023

Uh oh!

JonasBa commented Aug 17, 2023

Uh oh!

getsantry bot commented Oct 18, 2023

Uh oh!

JonasBa commented Oct 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

JonasBa commented Aug 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

size-limit report 📦

Uh oh!

billyvg left a comment

Choose a reason for hiding this comment

Uh oh!

billyvg Aug 17, 2023

Choose a reason for hiding this comment

Uh oh!

JonasBa commented Aug 17, 2023

Uh oh!

billyvg commented Aug 17, 2023

Uh oh!

JonasBa commented Aug 17, 2023

Uh oh!

getsantry bot commented Oct 18, 2023

Uh oh!

JonasBa commented Oct 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JonasBa commented Aug 17, 2023 •

edited

Loading

github-actions bot commented Aug 17, 2023 •

edited

Loading