Generate deterministic ids when formatting notebooks by zanieb · Pull Request #9359 · astral-sh/ruff

zanieb · 2024-01-02T16:11:17Z

When formatting notebooks, we populate the id field for cells that do not have one. Previously, we generated a UUID v4 which resulted in non-deterministic formatting. Here, we generate the UUID from a seeded random number generator instead of using true randomness. For example, here are the first five ids it would generate:

7fb27b94-1602-401d-9154-2211134fc71a
acae54e3-7e7d-407b-bb7b-55eff062a284
9a63283c-baf0-4dbc-ab1f-6479b197f3a8
8dd0d809-2fe7-4a7c-9628-1538738b07e2
72eea511-9410-473a-a328-ad9291626812

We also add a check that an id is not present in another cell to prevent accidental introduction of duplicate ids.

The specification is lax, and we could just use incrementing integers e.g. 0, 1, ... but I have a minor preference for retaining the UUID format. Some discussion here — I'm happy to go either way though.

Discovered via #9293

github-actions · 2024-01-02T16:23:38Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

charliermarsh · 2024-01-03T16:53:36Z

+        // https://github.com/jupyter/enhancement-proposals/blob/master/62-cell-id/cell-id.md#questions
        if raw_notebook.nbformat == 4 && raw_notebook.nbformat_minor >= 5 {
+            // We use a mock random number generator to generate deterministic uuids
+            let mut rng = rand::rngs::mock::StepRng::new(0, 1);


Could we use a seeded number generator rather than the StepRng one, so that the IDs are deterministic but look random rather than structured as they do now? Or was this the only option for deterministic UUIDs?

I'm sure there's another option for a seeded number generator, this one was just the most obvious way I saw.

Sure we can use the StdRng seeded with 0

7fb27b94-1602-401d-9154-2211134fc71a acae54e3-7e7d-407b-bb7b-55eff062a284 9a63283c-baf0-4dbc-ab1f-6479b197f3a8 8dd0d809-2fe7-4a7c-9628-1538738b07e2 72eea511-9410-473a-a328-ad9291626812

konstin

I'd go with natural numbers for simplicity but it doesn't matter much since the users shouldn't see that id anyway.

zanieb commented Jan 2, 2024

View reviewed changes

Comment thread crates/ruff_notebook/src/notebook.rs Outdated

zanieb commented Jan 2, 2024

View reviewed changes

Comment thread crates/ruff_notebook/src/notebook.rs Outdated

zanieb force-pushed the zb/notebook-id-fmt branch from 27a8726 to 411f988 Compare January 2, 2024 21:16

zanieb mentioned this pull request Jan 2, 2024

Add jupyter notebooks to ecosystem checks #9293

Merged

zanieb added 6 commits January 2, 2024 20:28

Generate deterministic ids when formatting notebooks

9a0fff1

Pre-collect ids

2a6377f

Remove unnecessary clone

182bbbb

Use a random number generator

487959e

Simplify contains / insert

376f39c

Use a simple formatter instead of hyphenated

f4edaa6

zanieb force-pushed the zb/notebook-id-fmt branch from 411f988 to f4edaa6 Compare January 3, 2024 02:28

zanieb marked this pull request as ready for review January 3, 2024 03:38

zanieb requested review from dhruvmanila and konstin January 3, 2024 16:40

charliermarsh reviewed Jan 3, 2024

View reviewed changes

charliermarsh approved these changes Jan 3, 2024

View reviewed changes

Use StdRng instead of StepRng

0154bf2

zanieb added the formatter Related to the formatter label Jan 3, 2024

konstin approved these changes Jan 4, 2024

View reviewed changes

zanieb merged commit aaa0097 into main Jan 4, 2024

zanieb deleted the zb/notebook-id-fmt branch January 4, 2024 15:19

Porkepix mentioned this pull request Jan 12, 2024

ruff 0.1.12 Homebrew/homebrew-core#159717

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate deterministic ids when formatting notebooks#9359

Generate deterministic ids when formatting notebooks#9359
zanieb merged 7 commits intomainfrom
zb/notebook-id-fmt

zanieb commented Jan 2, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 2, 2024 •

edited

Loading

Uh oh!

charliermarsh Jan 3, 2024

Uh oh!

zanieb Jan 3, 2024

Uh oh!

zanieb Jan 3, 2024

Uh oh!

konstin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zanieb commented Jan 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

Uh oh!

charliermarsh Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

zanieb Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

zanieb Jan 3, 2024

Choose a reason for hiding this comment

Uh oh!

konstin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zanieb commented Jan 2, 2024 •

edited

Loading

github-actions bot commented Jan 2, 2024 •

edited

Loading

`ruff-ecosystem` results