Fix for #4264: --line-ranges formats entire file when ranges are at EOF by sumezulike · Pull Request #4273 · psf/black

sumezulike · 2024-03-12T16:43:54Z

Description

This fixes #4264. The issue was that the empty last line does not count as a line to adjusted_lines because it is not in the list returned by str.splitlines. Since adjusted_lines is only called on the second pass of _format_str_once in format_str, the first pass would format the code correctly, then the "invalid" line would get removed from lines, and the second pass would format the whole code.

I added a small change to adjusted_lines to cap the end value of any line tuple. For example, --line-ranges 1-100 gets reduced to (1, 4) if the code is only four lines long.
One could change if end > original_line_count to if end == original_line_count + 1 to only allow this one additional line, but I think allowing an oversized range to just cover the rest of the code is not surprising behavior, slices act similarly.

I also added a call to adjusted_lines with the unmodified source code before the first pass of _format_str_once. This is an additional computational expense but ensures consistency.

Checklist - did you ...

Add an entry in CHANGES.md if necessary?
Add / update tests if necessary?
[-] Add new / update outdated documentation?

github-actions · 2024-03-12T16:59:28Z

diff-shades reports zero changes comparing this PR (a1ba877) to main (1abcffc).

What is this? | Workflow run | diff-shades documentation

JelleZijlstra · 2024-03-13T04:13:21Z

cc @yilei

yilei · 2024-03-13T19:08:51Z

Thanks for tagging, @JelleZijlstra!

I also added a call to adjusted_lines with the unmodified source code before the first pass of _format_str_once. This is an additional computational expense but ensures consistency.

Why is this still necessary, after the change to adjusted_lines to cap the end value?

Could you also add a test file next to https://github.com/psf/black/blob/main/tests/data/cases/line_ranges_basic.py ?

This opens up a question on what happens when you specify a --line-ranges= that's outside of the unformatted file. This change makes it valid when you specify a larger <END>, but if the entire range is outside then the entire file is still formatted. How about make it format nothing if everything is outside of the range too? The implementation could be, instead of changing adjusted_lines, to call a new sanitize_lines(lines, src_contents) once in format_str:

def format_str(...):
    if lines:
        lines = sanitize_lines(lines, src_contents)
        if not lines:
            return src_contents  # Nothing to format
    dst_contents = _format_str_once(src_contents, mode=mode, lines=lines)
    ...

sumezulike · 2024-03-14T08:35:27Z

Thank you so much for the feedback!

Calling adjusted_lines beforehand is not really necessary. I just used it like one would sanitize_lines to make sure that any lines that would be removed on the second pass would already be removed on the first. Writing a new function for that definitely makes more sense!

Thank you also for noticing that moving the entire range out of the file still formats everything, I'll fix that as suggested.

yilei

Thanks! Looks good to me overall, left a few minor comments.

yilei · 2024-03-14T19:22:33Z

tests/test_ranges.py

+2. def func(arg1,
+3.   arg2, arg3):
+4.   pass
+"""


Can you also add a case for source not ending with a newline?

yilei · 2024-03-14T19:23:31Z

tests/data/cases/line_ranges_outside_source.py

+def foo3(parameter_1, parameter_2, parameter_3, parameter_4, parameter_5, parameter_6, parameter_7): pass
+def foo4(parameter_1, parameter_2, parameter_3, parameter_4, parameter_5, parameter_6, parameter_7): pass
+
+# Adding some unformated code covering a wide range of syntaxes.


I would simply remove the lines below, as this test case just need to verify a completely out-of-range input doesn't format

yilei · 2024-03-14T19:26:27Z

src/black/ranges.py

+    if not src_contents:
+        return []
+    good_lines = []
+    src_line_count = len(src_contents.splitlines())


Not too strong an opinion, it can be more efficient to do a count("\n") but then you also need to add 1 when it doesn't end with a new line.

Oh, that's really quite a difference

$ python -m timeit -n 10000 -s "f = open('src/black/__init__.py'); src=f.read(); f.close()" "len(src.splitlines())" 10000 loops, best of 5: 171 usec per loop $ python -m timeit -n 10000 -s "f = open('src/black/__init__.py'); src=f.read(); f.close()" "src.count('\n')" 10000 loops, best of 5: 36.6 usec per loop

I resisted the temptation to write src_contents.count("\n") + src_contents[-1] != "\n" 😄

sumezulike · 2024-03-15T18:58:45Z

Thanks for reviewing and approving my PR! Glad to be able to contribute to one of my all-time favorite Python projects :)

Fix for psf#4264

90136fb

sumezulike changed the title ~~Fix for #4264~~ Fix for #4264: --line-ranges formats entire file when ranges are at EOF Mar 12, 2024

Fix for psf#4264

5f29c7c

yilei reviewed Mar 14, 2024

View reviewed changes

Added feedback

a1ba877

yilei approved these changes Mar 15, 2024

View reviewed changes

JelleZijlstra approved these changes Mar 15, 2024

View reviewed changes

JelleZijlstra merged commit 7b5a657 into psf:main Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for #4264: --line-ranges formats entire file when ranges are at EOF#4273

Fix for #4264: --line-ranges formats entire file when ranges are at EOF#4273
JelleZijlstra merged 3 commits intopsf:mainfrom
sumezulike:fix-line-ranges-last-line

sumezulike commented Mar 12, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Mar 12, 2024 •

edited

Loading

Uh oh!

JelleZijlstra commented Mar 13, 2024

Uh oh!

yilei commented Mar 13, 2024

Uh oh!

sumezulike commented Mar 14, 2024

Uh oh!

yilei left a comment

Uh oh!

yilei Mar 14, 2024

Uh oh!

yilei Mar 14, 2024

Uh oh!

yilei Mar 14, 2024

Uh oh!

sumezulike Mar 14, 2024 •

edited

Loading

Uh oh!

sumezulike commented Mar 15, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sumezulike commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist - did you ...

Uh oh!

github-actions bot commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JelleZijlstra commented Mar 13, 2024

Uh oh!

yilei commented Mar 13, 2024

Uh oh!

sumezulike commented Mar 14, 2024

Uh oh!

yilei left a comment

Choose a reason for hiding this comment

Uh oh!

yilei Mar 14, 2024

Choose a reason for hiding this comment

Uh oh!

yilei Mar 14, 2024

Choose a reason for hiding this comment

Uh oh!

yilei Mar 14, 2024

Choose a reason for hiding this comment

Uh oh!

sumezulike Mar 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sumezulike commented Mar 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sumezulike commented Mar 12, 2024 •

edited

Loading

github-actions bot commented Mar 12, 2024 •

edited

Loading

sumezulike Mar 14, 2024 •

edited

Loading

sumezulike commented Mar 15, 2024 •

edited

Loading