language-servers/ruff - Forgejo: Beyond coding. We Forge.

mirror of https://github.com/astral-sh/ruff.git synced 2025-09-28 12:55:05 +00:00

Author	SHA1	Message	Date
Andrew Gallant	f585e3e2dc	remove several uses of `unsafe` (#8600 ) This PR removes several uses of `unsafe`. I generally limited myself to low hanging fruit that I could see. There are still a few remaining uses of `unsafe` that looked a bit more difficult to remove (if possible at all). But this gets rid of a good chunk of them. I put each `unsafe` removal into its own commit with a justification for why I did it. So I would encourage reviewing this PR commit-by-commit. That way, we can legislate them independently. It's no problem to drop a commit if we feel the `unsafe` should stay in that case.	2023-11-28 09:50:03 -05:00
Dhruv Manilawala	017e829115	Update string nodes for implicit concatenation (#7927 ) ## Summary This PR updates the string nodes (`ExprStringLiteral`, `ExprBytesLiteral`, and `ExprFString`) to account for implicit string concatenation. ### Motivation In Python, implicit string concatenation are joined while parsing because the interpreter doesn't require the information for each part. While that's feasible for an interpreter, it falls short for a static analysis tool where having such information is more useful. Currently, various parts of the code uses the lexer to get the individual string parts. One of the main challenge this solves is that of string formatting. Currently, the formatter relies on the lexer to get the individual string parts, and formats them including the comments accordingly. But, with PEP 701, f-string can also contain comments. Without this change, it becomes very difficult to add support for f-string formatting. ### Implementation The initial proposal was made in this discussion: https://github.com/astral-sh/ruff/discussions/6183#discussioncomment-6591993. There were various AST designs which were explored for this task which are available in the linked internal document[^1]. The selected variant was the one where the nodes were kept as it is except that the `implicit_concatenated` field was removed and instead a new struct was added to the `Expr*` struct. This would be a private struct would contain the actual implementation of how the AST is designed for both single and implicitly concatenated strings. This implementation is achieved through an enum with two variants: `Single` and `Concatenated` to avoid allocating a vector even for single strings. There are various public methods available on the value struct to query certain information regarding the node. The nodes are structured in the following way: ``` ExprStringLiteral - "foo" "bar" \|- StringLiteral - "foo" \|- StringLiteral - "bar" ExprBytesLiteral - b"foo" b"bar" \|- BytesLiteral - b"foo" \|- BytesLiteral - b"bar" ExprFString - "foo" f"bar {x}" \|- FStringPart::Literal - "foo" \|- FStringPart::FString - f"bar {x}" \|- StringLiteral - "bar " \|- FormattedValue - "x" ``` [^1]: Internal document: https://www.notion.so/astral-sh/Implicit-String-Concatenation-e036345dc48943f89e416c087bf6f6d9?pvs=4 #### Visitor The way the nodes are structured is that the entire string, including all the parts that are implicitly concatenation, is a single node containing individual nodes for the parts. The previous section has a representation of that tree for all the string nodes. This means that new visitor methods are added to visit the individual parts of string, bytes, and f-strings for `Visitor`, `PreorderVisitor`, and `Transformer`. ## Test Plan - `cargo insta test --workspace --all-features --unreferenced reject` - Verify that the ecosystem results are unchanged	2023-11-24 17:55:41 -06:00
konsti	14e65afdc6	Update to Rust 1.74 and use new clippy lints table (#8722 ) Update to [Rust 1.74](https://blog.rust-lang.org/2023/11/16/Rust-1.74.0.html) and use the new clippy lints table. The update itself introduced a new clippy lint about superfluous hashes in raw strings, which got removed. I moved our lint config from `rustflags` to the newly stabilized [workspace.lints](https://doc.rust-lang.org/stable/cargo/reference/workspaces.html#the-lints-table). One consequence is that we have to `unsafe_code = "warn"` instead of "forbid" because the latter now actually bans unsafe code: ``` error[E0453]: allow(unsafe_code) incompatible with previous forbid --> crates/ruff_source_file/src/newlines.rs:62:17 \| 62 \| #[allow(unsafe_code)] \| ^^^^^^^^^^^ overruled by previous forbid \| = note: `forbid` lint level was set on command line ``` --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-11-16 18:12:46 -05:00
Dhruv Manilawala	3e00ddce38	Preserve trailing semicolon for Notebooks (#8590 ) ## Summary This PR updates the formatter to preserve trailing semicolon for Jupyter Notebooks. The motivation behind the change is that semicolons in notebooks are typically used to hide the output, for example when plotting. This is highlighted in the linked issue. The conditions required as to when the trailing semicolon should be preserved are: 1. It should be a top-level statement which is last in the module. 2. For statement, it can be either assignment, annotated assignment, or augmented assignment. Here, the target should only be a single identifier i.e., multiple assignments or tuple unpacking isn't considered. 3. For expression, it can be any. ## Test Plan Add a new integration test in `ruff_cli`. The test notebook basically acts as a document as to which trailing semicolons are to be preserved. fixes: #8254	2023-11-10 21:53:35 +05:30
konsti	b6c4074836	Insert newline between docstring and following own line comment (#8216 ) Summary Previously, own line comment following after a docstring followed by newline(s) before the first content statement were treated as trailing on the docstring and we didn't insert a newline after the docstring as black would. Before: ```python class ModuleBrowser: """Browse module classes and functions in IDLE.""" # This class is also the base class for pathbrowser.PathBrowser. def __init__(self, master, path, , _htest=False, _utest=False): pass ``` After: ```python class ModuleBrowser: """Browse module classes and functions in IDLE.""" # This class is also the base class for pathbrowser.PathBrowser. def __init__(self, master, path, , _htest=False, _utest=False): pass ``` I'm not entirely happy about hijacking `handle_own_line_comment_between_statements`, but i don't know a better spot to put it. Fixes #7948 Test Plan Fixtures	2023-10-30 13:18:54 +00:00
Dhruv Manilawala	230c9ce236	Split `Constant` to individual literal nodes (#8064 ) ## Summary This PR splits the `Constant` enum as individual literal nodes. It introduces the following new nodes for each variant: * `ExprStringLiteral` * `ExprBytesLiteral` * `ExprNumberLiteral` * `ExprBooleanLiteral` * `ExprNoneLiteral` * `ExprEllipsisLiteral` The main motivation behind this refactor is to introduce the new AST node for implicit string concatenation in the coming PR. The elements of that node will be either a string literal, bytes literal or a f-string which can be implemented using an enum. This means that a string or bytes literal cannot be represented by `Constant::Str` / `Constant::Bytes` which creates an inconsistency. This PR avoids that inconsistency by splitting the constant nodes into it's own literal nodes, literal being the more appropriate naming convention from a static analysis tool perspective. This also makes working with literals in the linter and formatter much more ergonomic like, for example, if one would want to check if this is a string literal, it can be done easily using `Expr::is_string_literal_expr` or matching against `Expr::StringLiteral` as oppose to matching against the `ExprConstant` and enum `Constant`. A few AST helper methods can be simplified as well which will be done in a follow-up PR. This introduces a new `Expr::is_literal_expr` method which is the same as `Expr::is_constant_expr`. There are also intermediary changes related to implicit string concatenation which are quiet less. This is done so as to avoid having a huge PR which this already is. ## Test Plan 1. Verify and update all of the existing snapshots (parser, visitor) 2. Verify that the ecosystem check output remains unchanged for both the linter and formatter ### Formatter ecosystem check #### `main` \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \| #### `dhruv/constant-to-literal` \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \|	2023-10-30 12:13:23 +05:30
Charlie Marsh	3c3d9ab173	Insert necessary blank line between class and leading comments (#8224 ) ## Summary Given: ```python # comment class A: def foo(self): pass ``` We need to insert an additional newline between `# comment` and `class A`. We were missing this handling for the case in which `# comment` is a leading comment on `class A`, as opposed to a trailing comment of some preceding statement. In practice, I think this only applies to the specific case in which a class or function is the first statement in a module, and there's a single empty line between a leading comment and that class or function. If there are no empty lines, then the comment "sticks" to the definition; if there are two or more, then `leading_comments` will truncate appropriately. If the class or function is nested, then we only need one empty line anyway. Closes https://github.com/astral-sh/ruff/issues/8215. ## Test Plan No change in similarity. Before: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \| After: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1648 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \|	2023-10-25 20:31:59 -04:00
Dhruv Manilawala	c2ec5f0bc9	Use source type to determine parser mode for formatting (#8205 ) ## Summary This PR fixes the bug where if a Notebook contained IPython syntax, then the format command would fail. This was because the correct mode was not being used while parsing through the formatter code path. ## Test Plan This PR isn't the only requirement for Notebook formatting to start working with IPython escape commands. The following PR in the stack is required as well.	2023-10-25 19:20:02 +05:30
konsti	8f9753f58e	Comments outside expression parentheses (#7873 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Fixes https://github.com/astral-sh/ruff/issues/7448 Fixes https://github.com/astral-sh/ruff/issues/7892 I've removed automatic dangling comment formatting, we're doing manual dangling comment formatting everywhere anyway (the assert-all-comments-formatted ensures this) and dangling comments would break the formatting there. ## Test Plan New test file. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-10-19 09:24:11 +00:00
konsti	0c3123e07e	Insert newline after nested function or class statements (#7946 ) Summary Insert a newline after nested function and class definitions, unless there is a trailing own line comment. We need to e.g. format ```python if platform.system() == "Linux": if sys.version > (3, 10): def f(): print("old") else: def f(): print("new") f() ``` as ```python if platform.system() == "Linux": if sys.version > (3, 10): def f(): print("old") else: def f(): print("new") f() ``` even though `f()` is directly preceded by an if statement, not a function or class definition. See the comments and fixtures for trailing own line comment handling. Test Plan I checked that the new content of `newlines.py` matches black's formatting. --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-10-18 09:45:58 +00:00
Charlie Marsh	d685107638	Move {AnyNodeRef, AstNode} to ruff_python_ast crate root (#8030 ) This is a do-over of https://github.com/astral-sh/ruff/pull/8011, which I accidentally merged into a non-`main` branch. Sorry!	2023-10-18 00:01:18 +00:00
Micha Reiser	e2ec42539b	Attach dangling comments to the comprehension instead of the `if` or `iter` nodes (#7693 )	2023-09-29 10:45:01 +01:00
Dhruv Manilawala	e62e245c61	Add support for PEP 701 (#7376 ) ## Summary This PR adds support for PEP 701 in Ruff. This is a rollup PR of all the other individual PRs. The separate PRs were created for logic separation and code reviews. Refer to each pull request for a detail description on the change. Refer to the PR description for the list of pull requests within this PR. ## Test Plan ### Formatter ecosystem checks Explanation for the change in ecosystem check: https://github.com/astral-sh/ruff/pull/7597#issue-1908878183 #### `main` ``` \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| ``` #### `dhruv/pep-701` ``` \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76051 \| 1789 \| 1632 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| ```	2023-09-29 02:55:39 +00:00
Charlie Marsh	f62b4c801f	Extend pragma comment cases (#7687 ) ## Summary Extends the pragma comment detection in the formatter to support case-insensitive `noqa` (as supposed by Ruff), plus a variety of other pragmas (`isort:`, `nosec`, etc.). Also extracts the detection out into the trivia crate so that we can reuse it in the linter (see: https://github.com/astral-sh/ruff/issues/7471). ## Test Plan `cargo test`	2023-09-28 18:55:19 +00:00
Charlie Marsh	8028de8956	Improve some comments in `normalize_comment` (#7688 )	2023-09-28 03:08:25 +00:00
konsti	4d16e2308d	Formatter and parser refactoring (#7569 ) I got confused and refactored a bit, now the naming should be more consistent. This is the basis for the range formatting work. Chages: * `format_module` -> `format_module_source` (format a string) * `format_node` -> `format_module_ast` (format a program parsed into an AST) * Added `parse_ok_tokens` that takes `Token` instead of `Result<Token>` * Call the source code `source` consistently * Added a `tokens_and_ranges` helper * `python_ast` -> `module` (because that's the type)	2023-09-26 15:29:43 +02:00
Charlie Marsh	17ceb5dcb3	Preserve newlines after nested compound statements (#7608 ) ## Summary Given: ```python if True: if True: pass else: pass # a # b # c else: pass ``` We want to preserve the newline after the `# c` (before the `else`). However, the `last_node` ends at the `pass`, and the comments are trailing comments on the `pass`, not trailing comments on the `last_node` (the `if`). As such, when counting the trailing newlines on the outer `if`, we abort as soon as we see the comment (`# a`). This PR changes the logic to skip _all_ comments (even those with newlines between them). This is safe as we know that there are no "leading" comments on the `else`, so there's no risk of skipping those accidentally. Closes https://github.com/astral-sh/ruff/issues/7602. ## Test Plan No change in compatibility. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-25 14:21:44 +00:00
Charlie Marsh	865c89800e	Avoid searching for bracketed comments in unparenthesized generators (#7627 ) Similar to tuples, a generator _can_ be parenthesized or unparenthesized. Only search for bracketed comments if it contains its own parentheses. Closes https://github.com/astral-sh/ruff/issues/7623.	2023-09-24 02:08:44 +00:00
Charlie Marsh	1a4f2a9baf	Avoid reordering mixed-indent-level comments after branches (#7609 ) ## Summary Given: ```python if True: if True: if True: pass #a #b #c else: pass ``` When determining the placement of the various comments, we compute the indentation depth of each comment, and then compare it to the depth of the previous statement. It turns out this can lead to reordering comments, e.g., above, `#b` is assigned as a trailing comment of `pass`, and so gets reordered above `#a`. This PR modifies the logic such that when we compute the indentation depth of `#b`, we limit it to at most the indentation depth of `#a`. In other words, when analyzing comments at the end of branches, we don't let successive comments go any _deeper_ than their preceding comments. Closes https://github.com/astral-sh/ruff/issues/7602. ## Test Plan `cargo test` No change in similarity. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-22 18:12:31 -04:00
Charlie Marsh	d7508af48d	Truncate to one empty line in stub files (#7558 ) ## Summary This PR modifies a variety of sites in which we insert up to two empty lines to instead truncate to at most one empty line in stub files. We already enforce this in _some_ places, but not all. ## Test Plan `cargo test` No changes in similarity (as expected, since this only impacts unformatted `.pyi` files). Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 323 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 323 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-21 16:24:42 -04:00
Charlie Marsh	7f1456a2c9	Allow up to two newlines before trailing clause body comments (#7575 ) ## Summary This is the peer to https://github.com/astral-sh/ruff/pull/7557, but for "leading" clause comments, like: ```python if True: pass # comment else: pass ``` In this case, we again want to allow up to two newlines at the top level. ## Test Plan `cargo test` No changes. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 323 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 323 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-21 14:52:38 +00:00
Charlie Marsh	2759db6604	Allow up to two newlines after trailing clause body comments (#7557 ) ## Summary The number of newlines after a trailing comment in a clause body needs to follow the usual rules -- so, up to two for top-level, up to one for nested, etc. For example, Black preserves both newlines after `# comment` here: ```python if True: pass # comment else: pass ``` But it truncates to one newline here: ```python if True: if True: pass # comment else: pass else: pass ``` ## Test Plan Significant improvement on `transformers`. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99957 \| 2587 \| 402 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 323 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99979 \| 3496 \| 22 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-21 14:04:49 +00:00
Charlie Marsh	124d95d246	Fix instability in trailing clause body comments (#7556 ) ## Summary When we format the trailing comments on a clause body, we check if there are any newlines after the last statement; if not, we insert one. This logic didn't take into account that the last statement could itself have trailing comments, as in: ```python if True: pass # comment else: pass ``` We were thus inserting a newline after the comment, like: ```python if True: pass # comment else: pass ``` In the context of function definitions, this led to an instability, since we insert a newline _after_ a function, which would in turn lead to the bug above appearing in the second formatting pass. Closes https://github.com/astral-sh/ruff/issues/7465. ## Test Plan `cargo test` Small improvement in `transformers`, but no regressions. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99956 \| 2587 \| 404 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99957 \| 2587 \| 402 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \|	2023-09-21 13:32:16 +00:00
Charlie Marsh	4c4eceee36	Add dangling comment handling for `lambda` expressions (#7493 ) ## Summary This PR adds dangling comment handling for `lambda` expressions. In short, comments around the `lambda` and the `:` are all considered dangling. Comments that come between the `lambda` and the `:` may be moved after the colon for simplicity (this is an odd position for a comment anyway), unless they also precede the lambda parameters, in which case they're formatted before the parameters. Closes https://github.com/astral-sh/ruff/issues/7470. ## Test Plan `cargo test` No change in similarity. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99982 \| 2760 \| 37 \| \| transformers \| 0.99957 \| 2587 \| 398 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99929 \| 648 \| 16 \| \| zulip \| 0.99962 \| 1437 \| 22 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99982 \| 2760 \| 37 \| \| transformers \| 0.99957 \| 2587 \| 398 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99929 \| 648 \| 16 \| \| zulip \| 0.99962 \| 1437 \| 22 \|	2023-09-19 15:23:51 -04:00
Charlie Marsh	e07670ad97	Add dangling comment handling to dictionary key-value pairs (#7495 ) ## Summary This PR fixes a formatting instability by changing the comment handling around the `:` in a dictionary to mirror that of the `:` in a lambda: we treat comments around the `:` as dangling, then format them after the `:`. Closes https://github.com/astral-sh/ruff/issues/7458. ## Test Plan `cargo test` No change in similarity. Before: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99956 \| 2587 \| 404 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99929 \| 648 \| 16 \| \| zulip \| 0.99969 \| 1437 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99956 \| 2587 \| 404 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99929 \| 648 \| 16 \| \| zulip \| 0.99969 \| 1437 \| 21 \|	2023-09-19 19:17:21 +00:00
Micha Reiser	26ae0a6e8d	Fix dangling module comments (#7456 )	2023-09-17 14:56:41 +00:00
Micha Reiser	c907317199	Fix build (#7437 )	2023-09-16 14:50:36 +00:00
konsti	2cbe1733c8	Use CommentRanges in backwards lexing (#7360 ) ## Summary The tokenizer was split into a forward and a backwards tokenizer. The backwards tokenizer uses the same names as the forwards ones (e.g. `next_token`). The backwards tokenizer gets the comment ranges that we already built to skip comments. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-16 03:21:45 +00:00
Charlie Marsh	cc9e84c144	Format trailing operator comments as dangling (#7427 ) ## Summary Given a trailing operator comment in a unary expression, like: ```python if ( not # comment a): ... ``` We were attaching these to the operand (`a`), but formatting them in the unary operator via special handling. Parents shouldn't format the comments of their children, so this instead attaches them as dangling comments on the unary expression. (No intended change in formatting.)	2023-09-15 20:34:09 -04:00
Micha Reiser	2d9b39871f	Introduce `IndentWidth` (#7301 )	2023-09-13 14:52:24 +02:00
konsti	f4c7bff36b	Don't reorder parameters in function calls (#7268 ) ## Summary In `f(args, a=b, args2, *kwargs)` the args (`args`, `args2`) and keywords (`a=b`, `kwargs`) are interleaved, which we previously didn't handle. Fixes #6498 main* \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99966 \| 2760 \| 58 \| \| transformers \| 0.99930 \| 2587 \| 447 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99825 \| 648 \| 22 \| \| zulip \| 0.99950 \| 1437 \| 27 \| PR \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99967 \| 2760 \| 53 \| \| transformers \| 0.99930 \| 2587 \| 447 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99825 \| 648 \| 22 \| \| zulip \| 0.99950 \| 1437 \| 27 \| ## Test Plan New fixtures	2023-09-13 09:01:49 +00:00
Micha Reiser	1e6df19a35	Bool expression comment placement (#7269 )	2023-09-12 06:39:57 +00:00
konsti	3a2c3a7398	Format empty lines in stub files like black's preview style (#7206 ) ## Summary Fix all but one empty line differences with the black preview style in typeshed. The remaining differences are breaking with type comments and trailing commas in function definitions. I compared the empty line differences with the preview mode of black since stable has some oddities that would have been hard to replicate (https://github.com/psf/black/issues/3861). Additionally, it assumes the style proposed in https://github.com/psf/black/issues/3862. An edge case that also surfaced with typeshed are newline before trailing module comments. main \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99966 \| 2760 \| 58 \| \| transformers \| 0.99930 \| 2587 \| 447 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3496 \| 2173 \| \| warehouse \| 0.99825 \| 648 \| 22 \| \| zulip \| 0.99950 \| 1437 \| 27 \| PR \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99966 \| 2760 \| 58 \| \| transformers \| 0.99930 \| 2587 \| 447 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99825 \| 648 \| 22 \| \| zulip \| 0.99950 \| 1437 \| 27 \| Closes #6723 ## Test Plan The main driver was the typeshed diff. I added new test cases for all kinds of possible empty line combinations in stub files, test cases for newlines before trailing module comments. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-11 08:03:59 +00:00
Micha Reiser	e376c3ff7e	Split implicit concatenated strings before binary expressions (#7145 )	2023-09-08 06:51:26 +00:00
konsti	45f9fca228	Reuse locator in formatter comments (#7227 ) Summary The comment visitor used to rebuild the locator for every comment. Instead, we now keep the locator on the builder. Follow-up to #6813. Test Plan No formatting changes.	2023-09-07 20:08:28 +02:00
konsti	447b7cb0e2	Formatter: Show preceding, following and enclosing nodes of comments, Attempt 2 (#6813 )	2023-09-06 12:26:13 +02:00
Micha Reiser	5f59101811	Memoize text width (#6552 )	2023-09-06 07:10:13 +00:00
Micha Reiser	175b3702c3	Reduce `comments.clone` calls (#7144 )	2023-09-05 11:32:56 +02:00
Charlie Marsh	7be28a38c5	Cache comment lookups in `suite.rs` (#7092 )	2023-09-04 08:45:14 +00:00
Micha Reiser	c05e4628b1	Introduce Token element (#7048 )	2023-09-02 10:05:47 +02:00
Charlie Marsh	dea65536e9	Fix placement for comments within f-strings concatenations (#7047 ) ## Summary Restores the dangling comment handling for f-strings, which broke with the parenthesized expression code. Closes https://github.com/astral-sh/ruff/issues/6898. ## Test Plan `cargo test` No change in any of the similarity indexes or changed file counts: \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1632 \| \| django \| 0.99957 \| 2760 \| 67 \| \| transformers \| 0.99927 \| 2587 \| 468 \| \| twine \| 0.99982 \| 33 \| 1 \| \| typeshed \| 0.99978 \| 3496 \| 2173 \| \| warehouse \| 0.99818 \| 648 \| 24 \| \| zulip \| 0.99942 \| 1437 \| 32 \|	2023-09-01 16:27:32 +00:00
Chris Pryer	0489bbc54c	Match Black's formatting of trailing comments containing NBSP (#7030 )	2023-09-01 14:52:59 +02:00
Chris Pryer	17a44c0078	Exclude pragma comments from measured line width (#7008 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-01 06:34:51 +00:00
Charlie Marsh	376d3caf47	Treat empty-line separated comments as trailing statement comments (#6999 ) ## Summary This PR modifies our between-statement comment handling such that comments that are not separated by a statement by any newlines continue to be treated as leading comments on the statement, but comments that _are_ separated are instead formatted as trailing comments on the preceding statement. See, e.g., the originating snippet: ```python DEFAULT_TEMPLATE = "flatpages/default.html" # This view is called from FlatpageFallbackMiddleware.process_response # when a 404 is raised, which often means CsrfViewMiddleware.process_view # has not been called even if CsrfViewMiddleware is installed. So we need # to use @csrf_protect, in case the template needs {% csrf_token %}. # However, we can't just wrap this view; if no matching flatpage exists, # or a redirect is required for authentication, the 404 needs to be returned # without any CSRF checks. Therefore, we only # CSRF protect the internal implementation. def flatpage(request, url): pass ``` Here, we need to ensure that the `def flatpage` is precede by two empty lines. However, we want those two empty lines to be enforced from the _end_ of the comment block, _unless_ the comments are directly atop the `def flatpage`. I played with this a bit, and I think the simplest conceptual model and implementation is to instead treat those as trailing comments on the preceding node. The main difficulty with this approach is that, in order to be fully compatible with Black, we'd sometimes need to insert newlines _between_ the preceding node and its trailing comments. See, e.g.: ```python def func(): ... # comment x = 1 ``` In this case, we'd need to insert two blank lines between `def func(): ...` and `# comment`, but `# comment` is trailing comment on `def func(): ...`. So, we'd need to take this case into account in the various nodes that _require_ newlines after them: functions, classes, and imports. After some discussion, we've opted _not_ to support this, and just treat these as trailing comments -- so we won't insert newlines there. This means our handling is still identical to Black's on Black-formatted code, but avoids moving such trailing comments on unformatted code. I dislike that the empty handling is so complex, and that it's split between so many different nodes, but this is really tricky. Continuing to treat these as leading comments is very difficult too, since we'd need to do similar tricks for the leading comment handling in those nodes, and influencing leading comments is even harder, since they're all formatted _before_ the node itself. Closes https://github.com/astral-sh/ruff/issues/6761. ## Test Plan `cargo test` Surprisingly, it doesn't change the similarity at all (apart from a 0.00001 change in CPython), but I manually confirmed that it did fix the originating issue in Django. Before: \| project \| similarity index \| \|--------------\|------------------\| \| cpython \| 0.76082 \| \| django \| 0.99921 \| \| transformers \| 0.99854 \| \| twine \| 0.99982 \| \| typeshed \| 0.99953 \| \| warehouse \| 0.99648 \| \| zulip \| 0.99928 \| After: \| project \| similarity index \| \|--------------\|------------------\| \| cpython \| 0.76081 \| \| django \| 0.99921 \| \| transformers \| 0.99854 \| \| twine \| 0.99982 \| \| typeshed \| 0.99953 \| \| warehouse \| 0.99648 \| \| zulip \| 0.99928 \|	2023-08-31 20:55:05 +00:00
Micha Reiser	eb552da8a9	Avoid parenthesizing multiline strings in binary expressions (#6973 )	2023-08-30 16:03:17 +02:00
Charlie Marsh	e2b2b1759f	Handle keyword comments between = and value (#6883 ) ## Summary This PR adds comment handling for comments between the `=` and the `value` for keywords, as in the following cases: ```python func( x # dangling = # dangling # dangling 1, # dangling y ) ``` (Comments after the `` were already handled in some cases, but I've unified the handling with the `=` handling.) Note that, previously, comments between the `**` and its value were rendered as trailing comments on the value (so they'd appear after `y`). This struck me as odd since it effectively re-ordered the comment with respect to its closest AST node (the value). I've made them leading comments, though I don't know that that's a significant improvement. I could also imagine us leaving them where they are.	2023-08-30 09:52:51 -04:00
Chris Pryer	a3f4d7745a	Use reserved width to include line suffix measurement (#6901 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-30 08:07:11 +00:00
Charlie Marsh	b404e54f33	Remove unnecessary `Comment#slice` calls (#6997 )	2023-08-30 00:44:11 +00:00
Chris Pryer	039694aaed	Add `LineSuffix` reserved width (#6830 ) Thanks for working on this.	2023-08-28 07:46:54 +02:00
Charlie Marsh	059757a8c8	Implement `Ranged` on more structs (#6921 ) Now that it's in `ruff_text_size`, we can use it in a few places that we couldn't before.	2023-08-27 19:03:08 +00:00

1 2 3 4

178 commits