mirrors/ruff - Forgejo: Beyond coding. We Forge.

mirror of https://github.com/astral-sh/ruff.git synced 2025-08-15 16:10:17 +00:00

Author	SHA1	Message	Date
Micha Reiser	ac02d3aedd	Hug multiline-strings preview style (#9243 )	2024-01-10 12:47:34 +01:00
Micha Reiser	a06723da2b	Parenthesize multi-context managers (#9222 )	2023-12-22 03:41:03 +00:00
Micha Reiser	fa2c37b411	Parenthesize long type annotations in annotated assignments (#9210 )	2023-12-22 03:33:47 +00:00
Micha Reiser	25b2361411	Extend `can_omit_optional_parentheses` documentation (#9127 ) ## Summary Add some more documentation to `can_omit_optional_parentheses` because it is realy hard to understand. Restrict the `Attribute` and `None` `OperatorPrecedence` branches to ensure they only get applyied to the intended nodes. ## Test Plan Ecosystem check reports no differences. The compatibility index remains unchanged.	2023-12-15 11:18:40 +09:00
Dhruv Manilawala	189e947808	Split string formatting to individual nodes (#9058 ) This PR splits the string formatting code in the formatter to be handled by the respective nodes. Previously, the string formatting was done through a single `FormatString` interface. Now, the nodes themselves are responsible for formatting. The following changes were made: 1. Remove `StringLayout::ImplicitStringConcatenationInBinaryLike` and inline the call to `FormatStringContinuation`. After the refactor, the binary like formatting would delegate to `FormatString` which would then delegate to `FormatStringContinuation`. This removes the intermediary steps. 2. Add formatter implementation for `FStringPart` which delegates it to the respective string literal or f-string node. 3. Add `ExprStringLiteralKind` which is either `String` or `Docstring`. If it's a docstring variant, then the string expression would not be implicitly concatenated. This is guaranteed by the `DocstringStmt::try_from_expression` constructor. 4. Add `StringLiteralKind` which is either a `String`, `Docstring` or `InImplicitlyConcatenatedFString`. The last variant is for when the string literal is implicitly concatenated with an f-string (`"foo" f"bar {x}"`). 5. Remove `FormatString`. 6. Extract the f-string quote detection as a standalone function which is public to the crate. This is used to detect the quote to be used for an f-string at the expression level (`ExprFString` or `FormatStringContinuation`). ### Formatter ecosystem result This PR \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75804 \| 1799 \| 1648 \| \| django \| 0.99984 \| 2772 \| 34 \| \| home-assistant \| 0.99955 \| 10596 \| 214 \| \| poetry \| 0.99905 \| 321 \| 15 \| \| transformers \| 0.99967 \| 2657 \| 324 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99980 \| 3669 \| 18 \| \| warehouse \| 0.99976 \| 654 \| 14 \| \| zulip \| 0.99958 \| 1459 \| 36 \| main \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75804 \| 1799 \| 1648 \| \| django \| 0.99984 \| 2772 \| 34 \| \| home-assistant \| 0.99955 \| 10596 \| 214 \| \| poetry \| 0.99905 \| 321 \| 15 \| \| transformers \| 0.99967 \| 2657 \| 324 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99980 \| 3669 \| 18 \| \| warehouse \| 0.99976 \| 654 \| 14 \| \| zulip \| 0.99958 \| 1459 \| 36 \|	2023-12-14 12:55:10 -06:00
Micha Reiser	c99eae2c08	`can_omit_optional_parentheses`: Exit early for unparenthesized expressions (#9125 )	2023-12-14 06:02:53 +00:00
Micha Reiser	7256b882b9	Fix `can_omit_optional_parentheses` for expressions with a right most fstring (#9124 )	2023-12-14 04:58:17 +00:00
Micha Reiser	45f603000d	`prefer_splitting_right_hand_side_of_assignments` preview style (#8943 )	2023-12-13 03:43:23 +00:00
Dhruv Manilawala	cdac90ef68	New AST nodes for f-string elements (#8835 ) Rebase of #6365 authored by @davidszotten. ## Summary This PR updates the AST structure for an f-string elements. The main motivation behind this change is to have a dedicated node for the string part of an f-string. Previously, the existing `ExprStringLiteral` node was used for this purpose which isn't exactly correct. The `ExprStringLiteral` node should include the quotes as well in the range but the f-string literal element doesn't include the quote as it's a specific part within an f-string. For example, ```python f"foo {x}" # ^^^^ # This is the literal part of an f-string ``` The introduction of `FStringElement` enum is helpful which represent either the literal part or the expression part of an f-string. ### Rule Updates This means that there'll be two nodes representing a string depending on the context. One for a normal string literal while the other is a string literal within an f-string. The AST checker is updated to accommodate this change. The rules which work on string literal are updated to check on the literal part of f-string as well. #### Notes 1. The `Expr::is_literal_expr` method would check for `ExprStringLiteral` and return true if so. But now that we don't represent the literal part of an f-string using that node, this improves the method's behavior and confines to the actual expression. We do have the `FStringElement::is_literal` method. 2. We avoid checking if we're in a f-string context before adding to `string_type_definitions` because the f-string literal is now a dedicated node and not part of `Expr`. 3. Annotations cannot use f-string so we avoid changing any rules which work on annotation and checks for `ExprStringLiteral`. ## Test Plan - All references of `Expr::StringLiteral` were checked to see if any of the rules require updating to account for the f-string literal element node. - New test cases are added for rules which check against the literal part of an f-string. - Check the ecosystem results and ensure it remains unchanged. ## Performance There's a performance penalty in the parser. The reason for this remains unknown as it seems that the generated assembly code is now different for the `__reduce154` function. The reduce function body is just popping the `ParenthesizedExpr` on top of the stack and pushing it with the new location. - The size of `FStringElement` enum is the same as `Expr` which is what it replaces in `FString::format_spec` - The size of `FStringExpressionElement` is the same as `ExprFormattedValue` which is what it replaces I tried reducing the `Expr` enum from 80 bytes to 72 bytes but it hardly resulted in any performance gain. The difference can be seen here: - Original profile: https://share.firefox.dev/3Taa7ES - Profile after boxing some node fields: https://share.firefox.dev/3GsNXpD ### Backtracking I tried backtracking the changes to see if any of the isolated change produced this regression. The problem here is that the overall change is so small that there's only a single checkpoint where I can backtrack and that checkpoint results in the same regression. This checkpoint is to revert using `Expr` to the `FString::format_spec` field. After this point, the change would revert back to the original implementation. ## Review process The review process is similar to #7927. The first set of commits update the node structure, parser, and related AST files. Then, further commits update the linter and formatter part to account for the AST change. --------- Co-authored-by: David Szotten <davidszotten@gmail.com>	2023-12-07 10:28:05 -06:00
Micha Reiser	0bda1913d1	Create dedicated `is_*_enabled` functions for each preview style (#8988 )	2023-12-04 05:38:54 +00:00
Micha Reiser	8088c5367a	Refactor the comment handling of a statement's last expression (#8920 )	2023-12-04 05:12:12 +00:00
Charlie Marsh	019d9aebe9	Implement multiline dictionary and list hugging for preview style (#8293 ) ## Summary This PR implement's Black's new single-argument hugging for lists, sets, and dictionaries under preview style. For example, this: ```python foo( [ 1, 2, 3, ] ) ``` Would instead now be formatted as: ```python foo([ 1, 2, 3, ]) ``` A couple notes: - This doesn't apply when the argument has a magic trailing comma. - This _does_ apply when the argument is starred or double-starred. - We don't apply this when there are comments before or after the argument, though Black does in some cases (and moves the comments outside the call parentheses). It doesn't say it in the originating PR (https://github.com/psf/black/pull/3964), but I think this also applies to parenthesized expressions? At least, it does in my testing of preview vs. stable, though it's possible that behavior predated the linked PR. See: #8279. ## Test Plan Before: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75804 \| 1799 \| 1648 \| \| django \| 0.99984 \| 2772 \| 34 \| \| home-assistant \| 0.99963 \| 10596 \| 146 \| \| poetry \| 0.99925 \| 317 \| 12 \| \| transformers \| 0.99967 \| 2657 \| 322 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99980 \| 3669 \| 18 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 21 \| After: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75804 \| 1799 \| 1648 \| \| django \| 0.99984 \| 2772 \| 34 \| \| home-assistant \| 0.99963 \| 10596 \| 146 \| \| poetry \| 0.96215 \| 317 \| 34 \| \| transformers \| 0.99967 \| 2657 \| 322 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99980 \| 3669 \| 18 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 21 \|	2023-11-30 21:11:14 -05:00
Dhruv Manilawala	017e829115	Update string nodes for implicit concatenation (#7927 ) ## Summary This PR updates the string nodes (`ExprStringLiteral`, `ExprBytesLiteral`, and `ExprFString`) to account for implicit string concatenation. ### Motivation In Python, implicit string concatenation are joined while parsing because the interpreter doesn't require the information for each part. While that's feasible for an interpreter, it falls short for a static analysis tool where having such information is more useful. Currently, various parts of the code uses the lexer to get the individual string parts. One of the main challenge this solves is that of string formatting. Currently, the formatter relies on the lexer to get the individual string parts, and formats them including the comments accordingly. But, with PEP 701, f-string can also contain comments. Without this change, it becomes very difficult to add support for f-string formatting. ### Implementation The initial proposal was made in this discussion: https://github.com/astral-sh/ruff/discussions/6183#discussioncomment-6591993. There were various AST designs which were explored for this task which are available in the linked internal document[^1]. The selected variant was the one where the nodes were kept as it is except that the `implicit_concatenated` field was removed and instead a new struct was added to the `Expr*` struct. This would be a private struct would contain the actual implementation of how the AST is designed for both single and implicitly concatenated strings. This implementation is achieved through an enum with two variants: `Single` and `Concatenated` to avoid allocating a vector even for single strings. There are various public methods available on the value struct to query certain information regarding the node. The nodes are structured in the following way: ``` ExprStringLiteral - "foo" "bar" \|- StringLiteral - "foo" \|- StringLiteral - "bar" ExprBytesLiteral - b"foo" b"bar" \|- BytesLiteral - b"foo" \|- BytesLiteral - b"bar" ExprFString - "foo" f"bar {x}" \|- FStringPart::Literal - "foo" \|- FStringPart::FString - f"bar {x}" \|- StringLiteral - "bar " \|- FormattedValue - "x" ``` [^1]: Internal document: https://www.notion.so/astral-sh/Implicit-String-Concatenation-e036345dc48943f89e416c087bf6f6d9?pvs=4 #### Visitor The way the nodes are structured is that the entire string, including all the parts that are implicitly concatenation, is a single node containing individual nodes for the parts. The previous section has a representation of that tree for all the string nodes. This means that new visitor methods are added to visit the individual parts of string, bytes, and f-strings for `Visitor`, `PreorderVisitor`, and `Transformer`. ## Test Plan - `cargo insta test --workspace --all-features --unreferenced reject` - Verify that the ecosystem results are unchanged	2023-11-24 17:55:41 -06:00
Dhruv Manilawala	3e00ddce38	Preserve trailing semicolon for Notebooks (#8590 ) ## Summary This PR updates the formatter to preserve trailing semicolon for Jupyter Notebooks. The motivation behind the change is that semicolons in notebooks are typically used to hide the output, for example when plotting. This is highlighted in the linked issue. The conditions required as to when the trailing semicolon should be preserved are: 1. It should be a top-level statement which is last in the module. 2. For statement, it can be either assignment, annotated assignment, or augmented assignment. Here, the target should only be a single identifier i.e., multiple assignments or tuple unpacking isn't considered. 3. For expression, it can be any. ## Test Plan Add a new integration test in `ruff_cli`. The test notebook basically acts as a document as to which trailing semicolons are to be preserved. fixes: #8254	2023-11-10 21:53:35 +05:30
Micha Reiser	dd2d8cb579	Avoid parenthesizing unsplittable because of comments (#8431 )	2023-11-03 05:12:59 +00:00
Dhruv Manilawala	b5a4a9a356	Inline `ExprNumberLiteral` formatting logic (#8340 ) ## Summary This PR inlines the formatting logic for `ExprNumberLiteral` and removes the need of having dedicated `Format*` struct for each number type. ## Test Plan `cargo test`	2023-10-30 14:09:38 +05:30
Dhruv Manilawala	230c9ce236	Split `Constant` to individual literal nodes (#8064 ) ## Summary This PR splits the `Constant` enum as individual literal nodes. It introduces the following new nodes for each variant: * `ExprStringLiteral` * `ExprBytesLiteral` * `ExprNumberLiteral` * `ExprBooleanLiteral` * `ExprNoneLiteral` * `ExprEllipsisLiteral` The main motivation behind this refactor is to introduce the new AST node for implicit string concatenation in the coming PR. The elements of that node will be either a string literal, bytes literal or a f-string which can be implemented using an enum. This means that a string or bytes literal cannot be represented by `Constant::Str` / `Constant::Bytes` which creates an inconsistency. This PR avoids that inconsistency by splitting the constant nodes into it's own literal nodes, literal being the more appropriate naming convention from a static analysis tool perspective. This also makes working with literals in the linter and formatter much more ergonomic like, for example, if one would want to check if this is a string literal, it can be done easily using `Expr::is_string_literal_expr` or matching against `Expr::StringLiteral` as oppose to matching against the `ExprConstant` and enum `Constant`. A few AST helper methods can be simplified as well which will be done in a follow-up PR. This introduces a new `Expr::is_literal_expr` method which is the same as `Expr::is_constant_expr`. There are also intermediary changes related to implicit string concatenation which are quiet less. This is done so as to avoid having a huge PR which this already is. ## Test Plan 1. Verify and update all of the existing snapshots (parser, visitor) 2. Verify that the ecosystem check output remains unchanged for both the linter and formatter ### Formatter ecosystem check #### `main` \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \| #### `dhruv/constant-to-literal` \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \|	2023-10-30 12:13:23 +05:30
Micha Reiser	f5e850745c	Only omit optional parentheses for starting or ending with parentheses (#8238 )	2023-10-26 07:28:58 +01:00
Dhruv Manilawala	dbd84c947b	Formatter parentheses support for `IpyEscapeCommand` (#8207 ) ## Summary This PR removes the `todo!()` around `IpyEscapeCommand` in the formatter. The `NeedsParentheses` trait needs to be implemented which always return `Never`. The reason being that if an escape command is parenthesized, then that's not parsed as an escape command. IOW, the parentheses shouldn't be present around an escape command. In the similar way, the `CanSkipOptionalParenthesesVisitor` will skip this node. ## Test Plan Updated the `unformatted.ipynb` fixture with new cells containing IPython escape commands and the corresponding snapshot was verified. Also, tested it out in a few open source repositories containing notebooks (`openai/openai-cookbook`, `huggingface/notebooks`). #### New cells in `unformatted.ipynb` Cell 2 ```markdown A markdown cell ``` Cell 3 ```python def some_function(foo, bar): pass %matplotlib inline ``` Cell 4 ```python foo = %pwd def some_function(foo,bar,): foo = %pwd print(foo ) ``` fixes: #8204	2023-10-25 14:01:50 +00:00
Charlie Marsh	95702e408f	Respect parenthesized generators in `has_own_parentheses` (#8100 ) ## Summary When analyzing: ```python if "root" not in ( long_tree_name_tree.split("/")[0] for long_tree_name_tree in really_really_long_variable_name ): msg = "Could not find root. Please try a different forest." raise ValueError(msg) ``` We missed that the generator expression is parenthesized, because the parentheses are _part_ of the generator -- so `is_expression_parenthesized` returns `False`. We needed to take into account that generators and tuples may or may not be parenthesized when determining whether we can omit parentheses while splitting an expression. Closes https://github.com/astral-sh/ruff/issues/8090. ## Test Plan No changes in similarity. Before: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \| After: \| project \| similarity index \| total files \| changed files \| \|----------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.75803 \| 1799 \| 1647 \| \| django \| 0.99983 \| 2772 \| 34 \| \| home-assistant \| 0.99953 \| 10596 \| 186 \| \| poetry \| 0.99891 \| 317 \| 17 \| \| transformers \| 0.99966 \| 2657 \| 330 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99978 \| 3669 \| 20 \| \| warehouse \| 0.99977 \| 654 \| 13 \| \| zulip \| 0.99970 \| 1459 \| 22 \|	2023-10-22 19:58:25 -04:00
konsti	8f9753f58e	Comments outside expression parentheses (#7873 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Fixes https://github.com/astral-sh/ruff/issues/7448 Fixes https://github.com/astral-sh/ruff/issues/7892 I've removed automatic dangling comment formatting, we're doing manual dangling comment formatting everywhere anyway (the assert-all-comments-formatted ensures this) and dangling comments would break the formatting there. ## Test Plan New test file. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-10-19 09:24:11 +00:00
Charlie Marsh	d685107638	Move {AnyNodeRef, AstNode} to ruff_python_ast crate root (#8030 ) This is a do-over of https://github.com/astral-sh/ruff/pull/8011, which I accidentally merged into a non-`main` branch. Sorry!	2023-10-18 00:01:18 +00:00
Charlie Marsh	b42a8972bf	Use `Expr::is_*` methods in more matches (#7714 )	2023-09-29 17:28:50 +00:00
Charlie Marsh	1c02fcd7ce	Avoid unnecessary comments check in `maybe_parenthesize_expression` (#7686 ) ## Summary No-op refactor, but we can evaluate early if the first part of `preserve_parentheses \|\| has_comments` is `true`, and thus avoid looking up the node comments. ## Test Plan `cargo test`	2023-09-28 13:42:12 -04:00
Micha Reiser	6a4dbd622b	Add optimized `best_fit_parenthesize` IR (#7475 )	2023-09-19 06:29:05 +00:00
konsti	2cbe1733c8	Use CommentRanges in backwards lexing (#7360 ) ## Summary The tokenizer was split into a forward and a backwards tokenizer. The backwards tokenizer uses the same names as the forwards ones (e.g. `next_token`). The backwards tokenizer gets the comment ranges that we already built to skip comments. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-16 03:21:45 +00:00
Micha Reiser	a65efcf459	fix: Don't omit optional parentheses for subscripts (#7380 )	2023-09-14 08:43:53 +00:00
Micha Reiser	47a253fb62	Add PreviewMode option to formatter ## Summary This PR adds the `--preview` and `--no-preview` options to the `format` command (hidden) and passes it through to the formatte. ## Test Plan I added the `dbg(f.options().preview())` statement in `FormatNodeRule::fmt` and verified that the option gets correctly passed to the formatter.	2023-09-08 12:04:28 +02:00
Micha Reiser	e376c3ff7e	Split implicit concatenated strings before binary expressions (#7145 )	2023-09-08 06:51:26 +00:00
Charlie Marsh	ece30e7c69	Preserve parentheses around partial call chains (#7109 )	2023-09-04 10:57:04 +01:00
Micha Reiser	c05e4628b1	Introduce Token element (#7048 )	2023-09-02 10:05:47 +02:00
Micha Reiser	92143afeee	Group binary operators with same precedence only (#7010 )	2023-08-31 09:19:45 +02:00
Chris Pryer	a3f4d7745a	Use reserved width to include line suffix measurement (#6901 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-30 08:07:11 +00:00
Micha Reiser	715d86dae9	Remove Comprehension priority (#6947 )	2023-08-29 08:30:15 +02:00
Micha Reiser	60097bebcd	Handle implicit strings in `can_omit_parentheses (#6940 )	2023-08-28 12:20:29 +00:00
Charlie Marsh	edb9b0c62a	Use the formatter prelude in more files (#6882 ) Removes a bunch of imports that are made redundant by the prelude.	2023-08-25 16:51:07 -04:00
Charlie Marsh	813d7da7ec	Respect own-line leading comments before parenthesized nodes (#6820 ) ## Summary This PR ensures that if an expression has an own-line leading comment _before_ its open parentheses, we render it as such. For example, given: ```python [ # foo # bar ( # baz 1 ) ] ``` On `main`, we format as: ```python [ # foo ( # bar # baz 1 ) ] ``` As of this PR, we format as: ```python [ # foo # bar ( # baz 1 ) ] ``` ## Test Plan `cargo test`	2023-08-25 00:18:05 -04:00
Micha Reiser	04a9a8dd03	Maybe parenthesize long constants and names (#6816 )	2023-08-24 09:47:57 +00:00
Micha Reiser	ccac9681e1	Preserve yield parentheses (#6766 )	2023-08-22 10:27:20 +00:00
Micha Reiser	8b347cdaa9	Simplify IfRequired needs parentheses condition (#6678 )	2023-08-21 07:11:31 +00:00
Micha Reiser	0cea4975fc	Rename Comments methods (#6649 )	2023-08-18 06:37:01 +00:00
Charlie Marsh	1334232168	Introduce `ExpressionRef` (#6637 ) ## Summary This PR revives the `ExpressionRef` concept introduced in https://github.com/astral-sh/ruff/pull/5644, motivated by the change we want to make in https://github.com/astral-sh/ruff/pull/6575 to narrow the type of the expression that can be passed to `parenthesized_range`. ## Test Plan `cargo test`	2023-08-17 10:07:16 -04:00
Micha Reiser	4dc32a00d0	Support `fmt: skip` for simple-statements and decorators (#6561 )	2023-08-17 05:58:19 +00:00
Charlie Marsh	a3d4f08f29	Add general support for parenthesized comments on expressions (#6485 ) ## Summary This PR adds support for parenthesized comments. A parenthesized comment is a comment that appears within a parenthesis, but not within the range of the expression enclosed by the parenthesis. For example, the comment here is a parenthesized comment: ```python if ( # comment True ): ... ``` The parentheses enclose the `True`, but the range of `True` doesn’t include the `# comment`. There are at least two problems associated with parenthesized comments: (1) associating the comment with the correct (i.e., enclosed) node; and (2) formatting the comment correctly, once it has been associated with the enclosed node. The solution proposed here for (1) is to search for parentheses between preceding and following node, and use open and close parentheses to break ties, rather than always assigning to the preceding node. For (2), we handle these special parenthesized comments in `FormatExpr`. The biggest risk with this approach is that we forget some codepath that force-disables parenthesization (by passing in `Parentheses::Never`). I've audited all usages of that enum and added additional handling + test coverage for such cases. Closes https://github.com/astral-sh/ruff/issues/6390. ## Test Plan `cargo test` with new cases. Before: \| project \| similarity index \| \|--------------\|------------------\| \| build \| 0.75623 \| \| cpython \| 0.75472 \| \| django \| 0.99804 \| \| transformers \| 0.99618 \| \| typeshed \| 0.74233 \| \| warehouse \| 0.99601 \| \| zulip \| 0.99727 \| After: \| project \| similarity index \| \|--------------\|------------------\| \| build \| 0.75623 \| \| cpython \| 0.75472 \| \| django \| 0.99804 \| \| transformers \| 0.99618 \| \| typeshed \| 0.74237 \| \| warehouse \| 0.99601 \| \| zulip \| 0.99727 \|	2023-08-15 18:59:18 +00:00
Micha Reiser	29c0b9f91c	Use single lookup for leading, dangling, and trailing comments (#6589 )	2023-08-15 17:39:45 +02:00
Charlie Marsh	53246b725e	Allow return type annotations to use their own parentheses (#6436 ) ## Summary This PR modifies our logic for wrapping return type annotations. Previously, we _always_ wrapped the annotation in parentheses if it expanded; however, Black only exhibits this behavior when the function parameters is empty (i.e., it doesn't and can't break). In other cases, it uses the normal parenthesization rules, allowing nodes to bring their own parentheses. For example, given: ```python def xxxxxxxxxxxxxxxxxxxxxxxxxxxx() -> Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ]: ... def xxxxxxxxxxxxxxxxxxxxxxxxxxxx(x) -> Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ]: ... ``` Black will format as: ```python def xxxxxxxxxxxxxxxxxxxxxxxxxxxx() -> ( Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ] ): ... def xxxxxxxxxxxxxxxxxxxxxxxxxxxx( x, ) -> Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ]: ... ``` Whereas, prior to this PR, Ruff would format as: ```python def xxxxxxxxxxxxxxxxxxxxxxxxxxxx() -> ( Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ] ): ... def xxxxxxxxxxxxxxxxxxxxxxxxxxxx( x, ) -> ( Set[ "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx" ] ): ... ``` Closes https://github.com/astral-sh/ruff/issues/6431. ## Test Plan Before: - `zulip`: 0.99702 - `django`: 0.99784 - `warehouse`: 0.99585 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75988 - `typeshed`: 0.74853 After: - `zulip`: 0.99724 - `django`: 0.99791 - `warehouse`: 0.99586 - `build`: 0.75623 - `transformers`: 0.99474 - `cpython`: 0.75956 - `typeshed`: 0.74857	2023-08-11 18:19:21 +00:00
Charlie Marsh	d616c9b870	Avoid omitting optional parentheses for argument-less parentheses (#6484 ) ## Summary This PR fixes some misformattings around optional parentheses for expressions. I first noticed that we were misformatting this: ```python return ( unicodedata.normalize("NFKC", s1).casefold() == unicodedata.normalize("NFKC", s2).casefold() ) ``` The above is stable Black formatting, but we were doing: ```python return unicodedata.normalize("NFKC", s1).casefold() == unicodedata.normalize( "NFKC", s2 ).casefold() ``` Above, the "last" expression is a function call, so our `can_omit_optional_parentheses` was returning `true`... However, it turns out that Black treats function calls differently depending on whether or not they have arguments -- presumedly because they'll never split empty parentheses, and so they're functionally non-useful. On further investigation, I believe this applies to all parenthesized expressions. If Black can't split on the parentheses, it doesn't leverage them when removing optional parentheses. ## Test Plan Nice increase in similarity scores. Before: - `zulip`: 0.99702 - `django`: 0.99784 - `warehouse`: 0.99585 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75989 - `typeshed`: 0.74853 After: - `zulip`: 0.99705 - `django`: 0.99795 - `warehouse`: 0.99600 - `build`: 0.75623 - `transformers`: 0.99471 - `cpython`: 0.75989 - `typeshed`: 0.74853	2023-08-11 17:58:42 +00:00
Dhruv Manilawala	6a64f2289b	Rename `Magic` to `IpyEscape` (#6395 ) ## Summary This PR renames the `MagicCommand` token to `IpyEscapeCommand` token and `MagicKind` to `IpyEscapeKind` type to better reflect the purpose of the token and type. Similarly, it renames the AST nodes from `LineMagic` to `IpyEscapeCommand` prefixed with `Stmt`/`Expr` wherever necessary. It also makes renames from using `jupyter_magic` to `ipython_escape_commands` in various function names. The mode value is still `Mode::Jupyter` because the escape commands are part of the IPython syntax but the lexing/parsing is done for a Jupyter notebook. ### Motivation behind the rename: * IPython codebase defines it as "EscapeCommand" / "Escape Sequences": * Escape Sequences: `292e3a2345/IPython/core/inputtransformer2.py (L329-L333)` * Escape command: `292e3a2345/IPython/core/inputtransformer2.py (L410-L411)` * The word "magic" is used mainly for the actual magic commands i.e., the ones starting with `%`/`%%` (https://ipython.readthedocs.io/en/stable/interactive/reference.html#magic-command-system). So, this avoids any confusion between the Magic token (`%`, `%%`) and the escape command itself. ## Test Plan * `cargo test` to make sure all renames are done correctly. * `grep` for `jupyter_escape`/`magic` to make sure all renames are done correctly.	2023-08-09 13:28:18 +00:00
Charlie Marsh	3f0eea6d87	Rename `JoinedStr` to `FString` in the AST (#6379 ) ## Summary Per the proposal in https://github.com/astral-sh/ruff/discussions/6183, this PR renames the `JoinedStr` node to `FString`.	2023-08-07 17:33:17 +00:00
Zanie Blue	999d88e773	Fix formatting of chained boolean operations (#6394 ) Closes https://github.com/astral-sh/ruff/issues/6068 These commits are kind of a mess as I did some stumbling around here. Unrolls formatting of chained boolean operations to prevent nested grouping which gives us Black-compatible formatting where each boolean operation is on a new line.	2023-08-07 12:22:33 -05:00

1 2

80 commits