[ty] Split `ScopedPlaceId` into `ScopedSymbolId` and `ScopedMemberId` by MichaReiser · Pull Request #19497 · astral-sh/ruff

MichaReiser · 2025-07-22T20:28:57Z

Summary

This PR changes the place structs in the semantic index module to enums over a symbol or a member:

ScopedPlaceId: Is now an enum over ScopedSymbolId and ScopedMemberId
PlaceExpr: Is now an enum over Symbol and Member
PlaceExprRef: Is now an enum over &Symbol and &Member
PlaceTable: now separately tracks symbols and members

This idea was initially mentioned in #19470. The hope was that splitting Symbol and Member would help to reduce memory usage because Symbol is only 32 bytes (compared to PlaceExpr, which is 40 bytes).

However, the memory usage is roughly unchanged after running ty on a large repository (the saving must be less than 100MB if there's even any). Splitting symbol and members also introduces some extra memory overhead because all IndexVec over ScopedPlaceId now needs to be two separate vectors.

Performance also shows to be mostly neutral.

The main benefit I now see in this refactor is that it clarifies some usages of places. Before, it was often unclear whether some code iterates over symbols or members or both. In fact, I spent the majority of my time during this refactor on trying to understand which of the two it is. The split also helped me better understand which flags are only supported on flags/members and which flags are supported by both.

Now, whether this distinction makes sense long term heavily depends on whether we expect that many places that currently are only over symbols will need to handle members the same way in the future.

Overall: I don't have a strong opinion on this. I do think it helps clarify some code but it does come at a cost.

Code increase

This PR adds a fair amount of new code. However, most of it is just the boilerplate from having separate Symbol/Member, SymbolTable/MemberTable/PlaceTable and SymbolTableBuilder/MemberTableBuilder/PlaceTableBuilder. Most of that code is fairly simple (many getters)

crates/ty_python_semantic/src/semantic_index/member.rs

github-actions · 2025-07-24T05:48:14Z

`mypy_primer` results

No ecosystem changes detected ✅
No memory usage changes detected ✅

github-actions · 2025-07-24T09:53:34Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

MichaReiser · 2025-07-24T09:54:13Z

crates/ty_python_semantic/src/semantic_index/member.rs

I'll write some more docs if we decide to go forward with this PR. The current docs are carried over from PlaceExpr

MichaReiser · 2025-07-24T09:55:19Z

crates/ty_python_semantic/src/semantic_index/place.rs


-/// Reference to a node that introduces a new scope.
-#[derive(Copy, Clone, Debug)]
-pub(crate) enum NodeWithScopeRef<'a> {


I moved everything Scope to scope.rs. That code doesn't need to belong into place.

MichaReiser · 2025-07-24T11:44:53Z

crates/ty_python_semantic/src/semantic_index/member.rs

+    symbol_name: Name,
+    segments: SmallVec<[MemberSegment; 1]>,


I think we can optimize this further by:

Storing a single Name which is the entire path of the member

In the SmallVec: Store the kind of each segment and where it starts as a u32.

This doesn't change the size of Member, but it reduces the size of the stored segments because it doesn't require a length and capacity for each segment.

This should be sufficient because all we need is a unique key to hash, knowing if it has the form a.b, and a Display implementation

carljm

I think this is a strong improvement in clarity; if it's neutral in performance, I'm +1 to go ahead with it. Thank you!

crates/ty_python_semantic/src/semantic_index/scope.rs

crates/ty_python_semantic/src/semantic_index/member.rs

crates/ty_python_semantic/src/semantic_index/use_def.rs

dhruvmanila · 2025-07-25T04:07:28Z

crates/ty_python_semantic/src/semantic_index/builder.rs

+                    if self.is_method_of_class().is_some() {
+                        if let PlaceExpr::Member(member) = &mut place_expr {
+                            if member.is_instance_attribute_candidate() {


nit: it might be useful to reduce the nesting here by chaining the calls

It's not clear to me how I would do this without let chains

sharkdp

Thank you!

A minor naming quibble: "member" refers to something like self.x (which seems consistent with how we use that term elsewhere, as an attribute on something), but also to a subscript expression like xs[0]. But no need to change anything.

MichaReiser · 2025-07-25T11:30:16Z

A minor naming quibble: "member" refers to something like self.x (which seems consistent with how we use that term elsewhere, as an attribute on something), but also to a subscript expression like xs[0]. But no need to change anything.

That's fair. Not sure what else to use. I considered Attribute but that is confusing because only x.a is an attribute access. That's why I landed on Member because, to me, that includes all sort of member access (but maybe that's just me coming from the JS ecosystem where all those expressions are called member expression)

MichaReiser · 2025-07-25T11:43:58Z

A possible alternative is Subscript. It might be confusing too, but x.a is just a special way of writing the subscript. I'm happy to make that rename if we decide for that name in a separate PR

carljm · 2025-07-25T15:25:47Z

Yes, the naming issue occurred to me in review also, but I didn't mention it because I didn't have any concrete suggestion that I liked. I don't know of a term that clearly means "either attribute or subscript", and "member" (borrowed from JS) seems not unreasonable (though I would also initially assume it to mean just an attribute).

The clearest options would be the verbose ones: AttributeOrSubscript

…#19497)

dcreager · 2025-07-25T20:56:27Z

either attribute or subscript

subscribute 😁

AlexWaygood added the ty Multi-file analysis & type inference label Jul 22, 2025

MichaReiser force-pushed the micha/place-refactor branch 2 times, most recently from 4d1afec to 3616804 Compare July 22, 2025 21:23

oconnor663 reviewed Jul 23, 2025

View reviewed changes

crates/ty_python_semantic/src/semantic_index/member.rs Outdated Show resolved Hide resolved

crates/ty_python_semantic/src/semantic_index/member.rs Outdated Show resolved Hide resolved

MichaReiser force-pushed the micha/place-refactor branch from a610975 to 2ed78c7 Compare July 24, 2025 05:44

This comment was marked as resolved.

Sign in to view

MichaReiser commented Jul 24, 2025

View reviewed changes

MichaReiser marked this pull request as ready for review July 24, 2025 09:55

MichaReiser requested review from AlexWaygood, carljm, dcreager and sharkdp as code owners July 24, 2025 09:55

AlexWaygood removed their request for review July 24, 2025 10:01

MichaReiser commented Jul 24, 2025

View reviewed changes

carljm approved these changes Jul 25, 2025

View reviewed changes

crates/ty_python_semantic/src/semantic_index/scope.rs Outdated Show resolved Hide resolved

crates/ty_python_semantic/src/semantic_index/member.rs Outdated Show resolved Hide resolved

crates/ty_python_semantic/src/semantic_index/use_def.rs Outdated Show resolved Hide resolved

carljm mentioned this pull request Jul 25, 2025

[ty] improve lazy scope place lookup #19321

Merged

dhruvmanila reviewed Jul 25, 2025

View reviewed changes

dhruvmanila added the internal An internal refactor or improvement label Jul 25, 2025

MichaReiser added 9 commits July 25, 2025 09:59

[ty] In progress, split place

7b895e5

It compiles

1f69f90

Fix most bugs

f677047

Fix del statement bug

8d1b8dd

Don't reverse segments

7bc0ffb

Fix last bug

c5be7ea

Clippy and fmt

5dc7446

Shrink symbols

390f926

Use map for associated sub members

4adb94a

MichaReiser added 6 commits July 25, 2025 10:01

Store name on Member as an explicit field

090e9c2

Revert map use

179172f

Fix docs

905af65

Undo rename

a789382

Docs, restrict visibility

2e89b8c

Incorporate lazy scopes

a56bb11

MichaReiser force-pushed the micha/place-refactor branch from cb7bc75 to a56bb11 Compare July 25, 2025 08:37

sharkdp approved these changes Jul 25, 2025

View reviewed changes

Docs

f14697f

MichaReiser force-pushed the micha/place-refactor branch from 4abc97b to f14697f Compare July 25, 2025 11:32

Code review feedback

452d7a2

MichaReiser force-pushed the micha/place-refactor branch from e391e8b to 452d7a2 Compare July 25, 2025 11:38

MichaReiser merged commit b033fb6 into main Jul 25, 2025
37 checks passed

MichaReiser deleted the micha/place-refactor branch July 25, 2025 11:54

AlexWaygood pushed a commit that referenced this pull request Jul 25, 2025

[ty] Split ScopedPlaceId into ScopedSymbolId and ScopedMemberId (…

db9978e

…#19497)

Conversation

MichaReiser commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Code increase

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

This comment was marked as resolved.

github-actions bot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

MichaReiser Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carljm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhruvmanila Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sharkdp left a comment

Choose a reason for hiding this comment

Uh oh!

MichaReiser commented Jul 25, 2025

Uh oh!

MichaReiser commented Jul 25, 2025

Uh oh!

Uh oh!

carljm commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcreager commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

MichaReiser commented Jul 22, 2025 •

edited

Loading

github-actions bot commented Jul 24, 2025 •

edited

Loading

`mypy_primer` results

github-actions bot commented Jul 24, 2025 •

edited

Loading

`ruff-ecosystem` results

MichaReiser Jul 24, 2025 •

edited

Loading

MichaReiser Jul 24, 2025 •

edited

Loading

MichaReiser Jul 25, 2025 •

edited

Loading

carljm commented Jul 25, 2025 •

edited

Loading