fix composite keys order by sfc-gh-jcieslak · Pull Request #669 · snowflakedb/snowflake-sqlalchemy

sfc-gh-jcieslak · 2026-03-12T16:10:13Z

What GitHub issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-922868: Preserve the sequence order of foreign keys when dealing with composite keys. #450
References SNOW-922868 Preserve the sequence order of foreign keys when dealing with composite keys #449
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
- I am adding new logging messages
- I am adding new credentials
- I am adding a new dependency
Please describe how your code solves the related issue.

Adding sequence key to column queries so that in case of composite keys, they're being returned in order, which matters in cases like Alembic migrations (also following other dialects' behavior).

sfc-gh-mraba · 2026-03-26T13:29:01Z

+            ]
+
+            # Sort referred columns by key sequence
+            v["referred_columns"] = [col for _, col in sorted(v["referred_columns"])]


There are a few repetitions of this sorting.
Given the comment on lexicographic sort this could be replaced with helper

@staticmethod def _sorted_by_key_sequence(pairs): """Sort (key_sequence, col_name) pairs and return col_names in order.""" return [col for _, col in sorted(pairs, key=lambda x: int(x[0]))]

Good point, added helper

sfc-gh-mraba · 2026-03-26T13:31:02Z

+            constraint["column_names"] = [
+                col for _, col in sorted(constraint["column_names"])
+            ]


row._mapping["key_sequence"] is likely returned as a string or Decimal by the Snowflake
Python connector. Sorting tuples of (str, str) is lexicographic, which breaks for
sequences beyond 9:

"1", "10", "2" ← string sort (wrong) 1, 2, 10 ← int sort (correct)

This is a latent bug that will only surface on tables with 10+ column composite keys —
uncommon, but definitely possible in data-warehouse schemas.

Good point, added NamedTuple which casts to int and helper works on this NamedTuple to ensure we are sorting on ints rather than strings

sfc-gh-mraba · 2026-03-26T13:31:54Z


        ans = defaultdict(list)
        for constraint in unique_constraints.values():
+            # Sort constrained columns by key sequence


These are sorted by 'sequence' or 'column_names'?

Adjusted within helper to make it more readable; it should be sorted by key_sequence

sfc-gh-asawicki · 2026-04-01T11:42:32Z

+        eq_(fks[0]["referred_columns"], ["col_a", "col_b", "col_c"])
+        eq_(fks[0]["referred_table"], "test_keys_fk_parent_decl")
+    finally:
+        child.drop(engine_testaccount)


question: what if child.drop fails (e.g. throwing the exception)? parent.drop wouldn't be executed them, right? SHouldn't we use metadata.drop_all(engine_testaccount) instead in all such tests (example:

snowflake-sqlalchemy/tests/test_core.py

Line 559 in fc2e8fa

metadata.drop_all(engine_testaccount)

)? (I see that it's also implemented like this in other tests, eg

snowflake-sqlalchemy/tests/test_core.py

Line 326 in fc2e8fa

addresses.drop(engine_testaccount)

)

sfc-gh-asawicki · 2026-04-01T11:43:17Z

+from sqlalchemy.testing.assertions import eq_
+
+
+def test_composite_fk_reflects_key_order(engine_testaccount):


nit: these tests are pretty repeatable (they share similar setup, execution, and validation), I could see them as parametrized tests (it would make it easier to add more cases and focus on use cases). There are multiple options to implement paramterized tests, so maybe we could discuss later on.

sfc-gh-mraba · 2026-04-15T12:52:47Z

+    @staticmethod
+    def _sort_columns_by_key_sequence(columns: list[_KeyedColumn]) -> list[str]:
+        """Sort columns by key_sequence and return column names."""
+        columns.sort(key=lambda c: c.key_sequence)


.sort() mutates the caller's list. The method signature and docstring both look like a pure function that returns a transformed list, but it also has an invisible side effect. The callers are fine today because they immediately reassign the result but this could silently corrupt state if the list were ever passed from somewhere that also holds a reference. Use sorted() instead:

return [c.column_name for c in sorted(columns, key=lambda c: c.key_sequence)]

This is also more idiomatic Python for "return a sorted copy."

sfc-gh-mraba · 2026-04-15T12:55:30Z

test_composite_fk_reflects_key_order and test_composite_fk_when_parent_pk_order_differs_from_columns both test multi-column FK reflection. The second one adds a twist (PK column order differs from table column order), which is the most important edge case. Consider merging the two, or making it clear in the first test's name that it only covers the "same order" case.

Shortened by parametrizing

sfc-gh-mraba · 2026-04-15T13:13:07Z

+class _KeyedColumn(NamedTuple):
+    key_sequence: int
+    column_name: str
+
+    @classmethod
+    def new(cls, key_sequence, column_name: str) -> "_KeyedColumn":
+        return cls(int(key_sequence), column_name)


I suggest to remove the factory entirely and use as

_KeyedColumn(int(row._mapping["key_sequence"]), self.normalize_name(...))

The int() cast is now visible at the call site — it's explicit that the DB cursor returns a string and we're converting it. That's not something worth hiding.

Fewer moving parts. The factory saves no lines; every call site with .new(x, y) is the same length as (int(x), y).

The NamedTuple constructor signature already documents the types (key_sequence: int). A reader who sees int(row._mapping["key_sequence"]) immediately understands the intent.

The .new() name is also misleading — it looks like it might do validation or have richer logic, but it's just a coercion wrapper.

The only scenario where a factory would be justified is if key_sequence required non-trivial parsing (e.g., stripping units, handling nulls, mapping enums). A bare
int() cast doesn't clear that bar.

sfc-gh-jcieslak marked this pull request as ready for review March 25, 2026 15:22

sfc-gh-jcieslak requested a review from a team as a code owner March 25, 2026 15:22

sfc-gh-jcieslak added 3 commits March 25, 2026 15:23

Adjust composite key order

304cac5

Add test

b3bbf23

Add and adjust tests

7ae539b

sfc-gh-jcieslak force-pushed the jcieslak/fix-composite-keys-order branch from ccf8dff to 7ae539b Compare March 25, 2026 15:23

Fix core test

25ab751

sfc-gh-jcieslak mentioned this pull request Mar 26, 2026

mraba/reflection-optimisation: skip schema resolution for single table #656

Merged

4 tasks

sfc-gh-mraba reviewed Mar 26, 2026

View reviewed changes

sfc-gh-jcieslak added 3 commits March 31, 2026 12:26

Adjust after review

63a4b08

Adjust after test failures

f2318f2

Rename

0310584

sfc-gh-asawicki approved these changes Apr 1, 2026

View reviewed changes

sfc-gh-mraba added a commit that referenced this pull request Apr 15, 2026

mraba/reflection-optimisation: optimise for changes in #669

a70bba6

sfc-gh-jcieslak added 2 commits April 15, 2026 14:20

changes after review

fa992bf

Adjust test cleanup

589437d

sfc-gh-jcieslak force-pushed the jcieslak/fix-composite-keys-order branch from 31841ab to 589437d Compare April 15, 2026 12:32

sfc-gh-mraba reviewed Apr 15, 2026

View reviewed changes

sfc-gh-jcieslak added 3 commits April 16, 2026 09:34

Adjust after review

e56da3c

Adjust after review

3304a50

Fix after review

03c6678

sfc-gh-mraba approved these changes Apr 20, 2026

View reviewed changes

sfc-gh-jcieslak merged commit c1b7d72 into main Apr 20, 2026
63 checks passed

sfc-gh-jcieslak deleted the jcieslak/fix-composite-keys-order branch April 20, 2026 10:00

github-actions Bot locked and limited conversation to collaborators Apr 20, 2026

		from sqlalchemy.testing.assertions import eq_


		def test_composite_fk_reflects_key_order(engine_testaccount):

Conversation

sfc-gh-jcieslak commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jcieslak Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sfc-gh-asawicki Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sfc-gh-jcieslak commented Mar 12, 2026 •

edited

Loading

sfc-gh-jcieslak Mar 26, 2026 •

edited

Loading

sfc-gh-asawicki Apr 1, 2026 •

edited

Loading