Fix exponential parser time on sequence of [[[[ by anka-213 · Pull Request #10439 · nushell/nushell

anka-213 · 2023-09-20T10:48:49Z

Description

Before this change, parsing [[[[[[[[[[[[[[[[[[[[[[ would cause nushell to consume several gigabytes of memory, now it should be linear in time.

The old code first tried parsing the head of the table as a list and then after that it checked if it got more arguments. If it didn't, it throws away the previous result and tries to parse the whole thing as a list, which means we call parse_list_expression twice for each call to parse_table_expression, resulting in the exponential growth

The fix is to simply check that we have all the arguments we need before parsing the head of the table, so we know that we will either call parse_list_expression only on sub-expressions or on the whole thing, never both.

Fixes #10438

User-Facing Changes

Should give a noticable speedup when typing a sequence of [[[[[[ open brackets

Tests + Formatting

I would like to add tests, but I'm not sure how to do that without crashing CI with OOM on regression

Don't forget to add tests that cover your changes.
cargo fmt --all -- --check to check standard code formatting (cargo fmt --all applies these changes)
cargo clippy --workspace -- -D warnings -D clippy::unwrap_used to check that you're using the standard code style
cargo test --workspace to check that all tests pass (on Windows make sure to enable developer mode)
cargo run -- -c "use std testing; testing run-tests --path crates/nu-std" to run the tests for the standard library

After Submitting

If your PR had any user-facing changes, update the documentation after the PR is merged, if necessary. This will help us keep the docs up to date.

The old code first tried parsing the head of the table as a list and then after that it checked if it got more arguments. If it didn't, it throws away the previous result and tries to parse the whole thing as a list, which means we call parse_list_expression twice for each call to parse_table_expression, resulting in the exponential growth The fix is to simply check that we have all the arguments we need before parsing the head of the table, so we know that we will either call parse_list_expression only on sub-expressions or on the whole thing, never both. Fixes nushell#10438

anka-213 · 2023-09-20T11:10:34Z

I've added a test now, but it will just freeze on regression instead of giving an error. I would like to add a timeout, but I don't know how to do that. Maybe it's only available as a global setting for the full testsuite?

sophiajt · 2023-09-20T15:46:16Z

Wow, nice catch

# Description  Before this change, parsing `[[[[[[[[[[[[[[[[[[[[[[` would cause nushell to consume several gigabytes of memory, now it should be linear in time. The old code first tried parsing the head of the table as a list and then after that it checked if it got more arguments. If it didn't, it throws away the previous result and tries to parse the whole thing as a list, which means we call `parse_list_expression` twice for each call to `parse_table_expression`, resulting in the exponential growth The fix is to simply check that we have all the arguments we need before parsing the head of the table, so we know that we will either call parse_list_expression only on sub-expressions or on the whole thing, never both. Fixes nushell#10438 # User-Facing Changes Should give a noticable speedup when typing a sequence of `[[[[[[` open brackets  # Tests + Formatting I would like to add tests, but I'm not sure how to do that without crashing CI with OOM on regression - [x] Don't forget to add tests that cover your changes. - [x] `cargo fmt --all -- --check` to check standard code formatting (`cargo fmt --all` applies these changes) - [x] `cargo clippy --workspace -- -D warnings -D clippy::unwrap_used` to check that you're using the standard code style - [x] `cargo test --workspace` to check that all tests pass (on Windows make sure to [enable developer mode](https://learn.microsoft.com/en-us/windows/apps/get-started/developer-mode-features-and-debugging)) - [x] `cargo run -- -c "use std testing; testing run-tests --path crates/nu-std"` to run the tests for the standard library  # After Submitting If your PR had any user-facing changes, update [the documentation](https://github.com/nushell/nushell.github.io) after the PR is merged, if necessary. This will help us keep the docs up to date.

anka-213 force-pushed the anka-213/issue10438 branch from 0fea546 to 92f04e0 Compare September 20, 2023 10:51

Add test for parse of deeply nested lists

0c747aa

sophiajt merged commit 8d8b443 into nushell:main Sep 20, 2023

anka-213 mentioned this pull request Sep 27, 2024

Exponential algorithmic complexity on parsing nested brackets #13944

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix exponential parser time on sequence of [[[[#10439

Fix exponential parser time on sequence of [[[[#10439
sophiajt merged 2 commits intonushell:mainfrom
anka-213:anka-213/issue10438

anka-213 commented Sep 20, 2023 •

edited

Loading

Uh oh!

anka-213 commented Sep 20, 2023

Uh oh!

sophiajt commented Sep 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anka-213 commented Sep 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

User-Facing Changes

Tests + Formatting

After Submitting

Uh oh!

anka-213 commented Sep 20, 2023

Uh oh!

sophiajt commented Sep 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anka-213 commented Sep 20, 2023 •

edited

Loading