feat(attribute-distributions): parallelize stats query #104113

shruthilayaj · 2025-11-27T21:48:48Z

Parallelize the stats query by explicitly passing a list of attributes to fetch.
Hoping to make the stats and ranked endpoints faster this way.

sentry · 2025-12-03T20:18:42Z

src/sentry/api/endpoints/organization_trace_item_stats.py

+            attrs_response = snuba_rpc.attribute_names_rpc(attrs_request)
+
+        # Chunk attributes and run stats query in parallel
+        chunked_attributes: dict[int, list[AttributeKey]] = defaultdict(list[AttributeKey])


Bug: defaultdict is incorrectly instantiated with list[AttributeKey] (a type hint) instead of a callable list, causing a TypeError on first access.
_{Severity: CRITICAL | Confidence: High}

🔍 Detailed Analysis

The defaultdict constructor is incorrectly instantiated with list[AttributeKey]. This is a type hint (a types.GenericAlias object in Python 3.9+), not a callable function. When a missing key is accessed in chunked_attributes, defaultdict attempts to call list[AttributeKey](), leading to a TypeError: 'types.GenericAlias' object is not callable. This will cause an immediate server crash in OrganizationTraceItemsStatsEndpoint.get() and OrganizationTraceItemsAttributesRankedEndpoint.get() when processing attribute distributions.

💡 Suggested Fix

Change defaultdict(list[AttributeKey]) to defaultdict(list). The type hint dict[int, list[AttributeKey]] is correct and should remain.

🤖 Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/api/endpoints/organization_trace_item_stats.py#L97 Potential issue: The `defaultdict` constructor is incorrectly instantiated with `list[AttributeKey]`. This is a type hint (a `types.GenericAlias` object in Python 3.9+), not a callable function. When a missing key is accessed in `chunked_attributes`, `defaultdict` attempts to call `list[AttributeKey]()`, leading to a `TypeError: 'types.GenericAlias' object is not callable`. This will cause an immediate server crash in `OrganizationTraceItemsStatsEndpoint.get()` and `OrganizationTraceItemsAttributesRankedEndpoint.get()` when processing attribute distributions.

_{Did we get this right? 👍 / 👎 to inform future reviews.}
_{Reference ID: 5300574}

sentry · 2025-12-03T20:27:09Z

src/sentry/api/endpoints/organization_trace_item_stats.py

+                for stats in result:
+                    for stats_type, data in stats.items():
+                        stats_results[stats_type]["data"].update(data["data"])


Bug: The organization_trace_item_stats.py endpoint's response format changed, breaking API compatibility.
_{Severity: CRITICAL | Confidence: High}

🔍 Detailed Analysis

The organization_trace_item_stats.py endpoint's response format has changed from a dictionary structure, {"data": {stat_type: {...}}}, to a list of single-key dictionaries, {"data": [{stat_type: {...}}, ...]}. This transformation is a breaking API change that will cause clients expecting the original dictionary format to fail.

💡 Suggested Fix

Revert the response format in organization_trace_item_stats.py line 135 to return Response({"data": stats_results}) instead of Response({"data": [{k: v} for k, v in stats_results.items()]}).

🤖 Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/api/endpoints/organization_trace_item_stats.py#L131-L133 Potential issue: The `organization_trace_item_stats.py` endpoint's response format has changed from a dictionary structure, `{"data": {stat_type: {...}}}`, to a list of single-key dictionaries, `{"data": [{stat_type: {...}}, ...]}`. This transformation is a breaking API change that will cause clients expecting the original dictionary format to fail.

_{Did we get this right? 👍 / 👎 to inform future reviews.}
_{Reference ID: 5302423}

cursor · 2025-12-03T20:31:42Z

src/sentry/api/endpoints/organization_trace_item_attributes_ranked.py

+                meta=attrs_meta,
+                limit=max_attributes,
+                type=attr_type,
+                intersecting_attributes_filter=cohort_2,


Bug: Suspect cohort attributes missed due to baseline-only attribute filtering

The TraceItemAttributeNamesRequest uses intersecting_attributes_filter=cohort_2 (the baseline cohort), which means only attributes present in the baseline are fetched and analyzed. These attributes are then used to query both the suspect cohort (cohort_1) and baseline cohort (cohort_2). Attributes unique to the suspect cohort are completely missed, which defeats the purpose of comparing cohorts to find differentiating attributes. The old code didn't pre-filter attributes, so it analyzed all attributes from both cohorts independently.

Abdkhan14

Lgtm, I would look at that seer comment to make sure we don't break the FE

shruthilayaj · 2025-12-03T20:50:43Z

Lgtm, I would look at that seer comment to make sure we don't break the FE

stats results would have returned list[dict[str, Any]] originally

feat: parallelize stats query

b4accdb

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Nov 27, 2025

vercel bot deployed to Preview November 27, 2025 21:52 View deployment

also parallelize ranked endpoint

5b6b8fa

shruthilayaj force-pushed the shruthi/feat/parallelize-trace-item-stats branch from 44c5aa5 to 5b6b8fa Compare December 1, 2025 16:39

vercel bot deployed to Preview December 1, 2025 16:43 View deployment

merge conflicts

3ccc2e8

shruthilayaj force-pushed the shruthi/feat/parallelize-trace-item-stats branch from 508a20b to 3ccc2e8 Compare December 3, 2025 18:59

vercel bot deployed to Preview December 3, 2025 19:03 View deployment

typing issues

02db9ee

shruthilayaj marked this pull request as ready for review December 3, 2025 20:15

shruthilayaj requested review from a team as code owners December 3, 2025 20:15

shruthilayaj force-pushed the shruthi/feat/parallelize-trace-item-stats branch from 5a5b131 to 02db9ee Compare December 3, 2025 20:16

sentry bot reviewed Dec 3, 2025

View reviewed changes

vercel bot deployed to Preview December 3, 2025 20:19 View deployment

typing

cee4d15

shruthilayaj force-pushed the shruthi/feat/parallelize-trace-item-stats branch from 0242c8b to cee4d15 Compare December 3, 2025 20:23

❄️ re-freeze requirements

69a3e85

sentry bot reviewed Dec 3, 2025

View reviewed changes

vercel bot deployed to Preview December 3, 2025 20:27 View deployment

cursor bot reviewed Dec 3, 2025

View reviewed changes

Abdkhan14 approved these changes Dec 3, 2025

View reviewed changes

shruthilayaj merged commit a0836a8 into master Dec 3, 2025
68 checks passed

shruthilayaj deleted the shruthi/feat/parallelize-trace-item-stats branch December 3, 2025 20:53

github-actions bot locked and limited conversation to collaborators Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(attribute-distributions): parallelize stats query #104113

feat(attribute-distributions): parallelize stats query #104113

Uh oh!

shruthilayaj commented Nov 27, 2025 •

edited

Loading

Uh oh!

sentry bot Dec 3, 2025

Uh oh!

sentry bot Dec 3, 2025

Uh oh!

cursor bot Dec 3, 2025

Uh oh!

Abdkhan14 left a comment

Uh oh!

shruthilayaj commented Dec 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat(attribute-distributions): parallelize stats query #104113

feat(attribute-distributions): parallelize stats query #104113

Uh oh!

Conversation

shruthilayaj commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sentry bot Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

sentry bot Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Dec 3, 2025

Choose a reason for hiding this comment

Bug: Suspect cohort attributes missed due to baseline-only attribute filtering

Uh oh!

Abdkhan14 left a comment

Choose a reason for hiding this comment

Uh oh!

shruthilayaj commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shruthilayaj commented Nov 27, 2025 •

edited

Loading

shruthilayaj commented Dec 3, 2025 •

edited

Loading