language-servers/ruff - Forgejo: Beyond coding. We Forge.

mirror of https://github.com/astral-sh/ruff.git synced 2025-09-29 05:15:12 +00:00

Author	SHA1	Message	Date
Charlie Marsh	256b98ab9a	Avoid if-else simplification for `TYPE_CHECKING` blocks (#8072 ) Closes https://github.com/astral-sh/ruff/issues/8071.	2023-10-19 19:15:54 -04:00
Tom Kuson	37d21c0d54	Check sequence type before triggering `unnecessary-enumerate` (`FURB148`) `len` suggestion (#7781 ) ## Summary Check that the sequence type is a list, set, dict, or tuple before recommending replacing the `enumerate(...)` call with `range(len(...))`. Document behaviour so users are aware of the type inference limitation leading to false negatives. Closes #7656.	2023-10-03 14:39:14 +00:00
Charlie Marsh	93b5d8a0fb	Implement our own small-integer optimization (#7584 ) ## Summary This is a follow-up to #7469 that attempts to achieve similar gains, but without introducing malachite. Instead, this PR removes the `BigInt` type altogether, instead opting for a simple enum that allows us to store small integers directly and only allocate for values greater than `i64`: ```rust /// A Python integer literal. Represents both small (fits in an `i64`) and large integers. #[derive(Clone, PartialEq, Eq, Hash)] pub struct Int(Number); #[derive(Debug, Clone, PartialEq, Eq, Hash)] pub enum Number { /// A "small" number that can be represented as an `i64`. Small(i64), /// A "large" number that cannot be represented as an `i64`. Big(Box<str>), } impl std::fmt::Display for Number { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { match self { Number::Small(value) => write!(f, "{value}"), Number::Big(value) => write!(f, "{value}"), } } } ``` We typically don't care about numbers greater than `isize` -- our only uses are comparisons against small constants (like `1`, `2`, `3`, etc.), so there's no real loss of information, except in one or two rules where we're now a little more conservative (with the worst-case being that we don't flag, e.g., an `itertools.pairwise` that uses an extremely large value for the slice start constant). For simplicity, a few diagnostics now show a dedicated message when they see integers that are out of the supported range (e.g., `outdated-version-block`). An additional benefit here is that we get to remove a few dependencies, especially `num-bigint`. ## Test Plan `cargo test`	2023-09-25 15:13:21 +00:00
Charlie Marsh	87a0cd219f	Detect `asyncio.get_running_loop` calls in RUF006 (#7562 ) ## Summary We can do a good enough job detecting this with our existing semantic model. Closes https://github.com/astral-sh/ruff/issues/3237.	2023-09-21 04:37:38 +00:00
Charlie Marsh	28b48ab902	Avoid flagging starred expressions in UP007 (#7505 ) ## Summary These can't be fixed, because fixing them would lead to invalid syntax. So flagging them also feels misleading. Closes https://github.com/astral-sh/ruff/issues/7452.	2023-09-19 03:37:38 +00:00
Jelle van der Waa	04183b0299	[pylint] Implement `too-many-public-methods` rule (PLR0904) (#6179 ) Implement https://pylint.pycqa.org/en/latest/user_guide/messages/refactor/too-many-public-methods.html Confusingly the rule page mentions a max of 7 while in practice it is 20. https://github.com/search?q=repo%3Apylint-dev%2Fpylint+max-public-methods&type=code ## Summary Implement pylint's R0904 ## Test Plan Unit tests.	2023-09-14 00:52:26 +00:00
qdegraaf	f3aaf84a28	Move `refurb/helpers` utils to `ruff_python_semantic` for broader use (#6990 ) ## Summary The utils added for `refurb` in its `helpers.rs` file could be useful for many other plugins. (Such as the PERF4XX codes, see e.g. https://github.com/astral-sh/ruff/pull/6132 ). This PR moves them to `ruff_python_semantic::analyzers::typing` as suggested in https://github.com/astral-sh/ruff/pull/6132#issuecomment-1697910093 ## Test Plan Confirmed `refurb` and all other tests still work	2023-08-29 14:45:09 -04:00
Jelle van der Waa	af61abc747	Re-use is_magic where possible (#6945 ) ## Summary Use `is_magic` where possible ## Test Plan Unit tests	2023-08-28 15:35:34 +00:00
Zanie Blue	417a1d0717	Update `mutable-argument-default` (`B006`) to use `extend-immutable-calls` when determining if annotations are immutable (#6781 ) Part of https://github.com/astral-sh/ruff/issues/3762	2023-08-23 15:44:35 +00:00
Charlie Marsh	17af12e57c	Add branch detection to the semantic model (#6694 ) ## Summary We have a few rules that rely on detecting whether two statements are in different branches -- for example, different arms of an `if`-`else`. Historically, the way this was implemented is that, given two statement IDs, we'd find the common parent (by traversing upwards via our `Statements` abstraction); then identify branches "manually" by matching the parents against `try`, `if`, and `match`, and returning iterators over the arms; then check if there's an arm for which one of the statements is a child, and the other is not. This has a few drawbacks: 1. First, the code is generally a bit hard to follow (Konsti mentioned this too when working on the `ElifElseClause` refactor). 2. Second, this is the only place in the codebase where we need to go from `&Stmt` to `StatementID` -- _everywhere_ else, we only need to go in the _other_ direction. Supporting these lookups means we need to maintain a mapping from `&Stmt` to `StatementID` that includes every `&Stmt` in the program. (We _also_ end up maintaining a `depth` level for every statement.) I'd like to get rid of these requirements to improve efficiency, reduce complexity, and enable us to treat AST modes more generically in the future. (When I looked at adding the `&Expr` to our existing statement-tracking infrastructure, maintaining a hash map with all the statements noticeably hurt performance.) The solution implemented here instead makes branches a first-class concept in the semantic model. Like with `Statements`, we now have a `Branches` abstraction, where each branch points to its optional parent. When we store statements, we store the `BranchID` alongside each statement. When we need to detect whether two statements are in the same branch, we just realize each statement's branch path and compare the two. (Assuming that the two statements are in the same scope, then they're on the same branch IFF one branch path is a subset of the other, starting from the top.) We then add some calls to the visitor to push and pop branches in the appropriate places, for `if`, `try`, and `match` statements. Note that a branch is not 1:1 with a statement; instead, each branch is closer to a suite, but not _every_ suite is a branch. For example, each arm in an `if`-`elif`-`else` is a branch, but the `else` in a `for` loop is not considered a branch. In addition to being much simpler, this should also be more efficient, since we've shed the entire `&Stmt` hash map, plus the `depth` that we track on `StatementWithParent` in favor of a single `Option<BranchID>` on `StatementWithParent` plus a single vector for all branches. The lookups should be faster too, since instead of doing a bunch of jumps around with the hash map + repeated recursive calls to find the common parents, we instead just do a few simple lookups in the `Branches` vector to realize and compare the branch paths. ## Test Plan `cargo test` -- we have a lot of coverage for this, which we inherited from PyFlakes	2023-08-19 21:28:17 +00:00
Charlie Marsh	96d310fbab	Remove `Stmt::TryStar` (#6566 ) ## Summary Instead, we set an `is_star` flag on `Stmt::Try`. This is similar to the pattern we've migrated towards for `Stmt::For` (removing `Stmt::AsyncFor`) and friends. While these are significant differences for an interpreter, we tend to handle these cases identically or nearly identically. ## Test Plan `cargo test`	2023-08-14 13:39:44 -04:00
Charlie Marsh	768686148f	Add support for unions to our Python builtins type system (#6541 ) ## Summary Fixes some TODOs introduced in https://github.com/astral-sh/ruff/pull/6538. In short, given an expression like `1 if x > 0 else "Hello, world!"`, we now return a union type that says the expression can resolve to either an `int` or a `str`. The system remains very limited, it only works for obvious primitive types, and there's no attempt to do inference on any more complex variables. (If any expression yields `Unknown` or `TypeError`, we propagate that result throughout and abort on the client's end.)	2023-08-13 18:00:50 -04:00
Charlie Marsh	446ceed1ad	Support `IfExp` with dual string arms in `invalid-envvar-value` (#6538 ) ## Summary Closes https://github.com/astral-sh/ruff/issues/6537. We need to improve the `PythonType` algorithm, so this also documents some of its limitations as TODOs.	2023-08-13 15:52:10 -04:00
Charlie Marsh	3f0eea6d87	Rename `JoinedStr` to `FString` in the AST (#6379 ) ## Summary Per the proposal in https://github.com/astral-sh/ruff/discussions/6183, this PR renames the `JoinedStr` node to `FString`.	2023-08-07 17:33:17 +00:00
Charlie Marsh	c439435615	Use dedicated AST nodes on `MemberKind` (#6374 ) ## Summary This PR leverages the unified function definition node to add precise AST node types to `MemberKind`, which is used to power our docstring definition tracking (e.g., classes and functions, whether they're methods or functions or nested functions and so on, whether they have a docstring, etc.). It was painful to do this in the past because the function variants needed to support a union anyway, but storing precise nodes removes like a dozen panics. No behavior changes -- purely a refactor. ## Test Plan `cargo test`	2023-08-07 17:17:58 +00:00
Charlie Marsh	daefa74e9a	Remove async AST node variants for `with`, `for`, and `def` (#6369 ) ## Summary Per the suggestion in https://github.com/astral-sh/ruff/discussions/6183, this PR removes `AsyncWith`, `AsyncFor`, and `AsyncFunctionDef`, replacing them with an `is_async` field on the non-async variants of those structs. Unlike an interpreter, we _generally_ have identical handling for these nodes, so separating them into distinct variants adds complexity from which we don't really benefit. This can be seen below, where we get to remove a _ton_ of code related to adding generic `Any*` wrappers, and a ton of duplicate branches for these cases. ## Test Plan `cargo test` is unchanged, apart from parser snapshots.	2023-08-07 16:36:02 +00:00
Charlie Marsh	b21abe0a57	Use separate structs for expression and statement tracking (#6351 ) ## Summary This PR fixes the performance degradation introduced in https://github.com/astral-sh/ruff/pull/6345. Instead of using the generic `Nodes` structs, we now use separate `Statement` and `Expression` structs. Importantly, we can avoid tracking a bunch of state for expressions that we need for parents: we don't need to track reference-to-ID pointers (we just have no use-case for this -- I'd actually like to remove this from statements too, but we need it for branch detection right now), we don't need to track depth, etc. In my testing, this entirely removes the regression on all-rules, and gets us down to 2ms slower on the default rules (as a crude hyperfine benchmark, so this is within margin of error IMO). No behavioral changes.	2023-08-07 15:27:42 +00:00
Zixuan Li	be657f5e7e	Respect typing_extensions imports of Annotated for B006. (#6361 ) `typing_extensions.Annotated` should be treated the same way as `typing.Annotated`.	2023-08-05 17:39:52 +00:00
Charlie Marsh	d788957ec4	Allow capitalized names for logger candidate heuristic match (#6356 ) Closes https://github.com/astral-sh/ruff/issues/6353.	2023-08-04 23:25:34 +00:00
Charlie Marsh	8a5bc93fdd	Make the `Nodes` vector generic on node type (#6328 )	2023-08-04 03:57:15 +00:00
Charlie Marsh	2fa508793f	Return a slice in `StmtClassDef#bases` (#6311 ) Slices are strictly more flexible, since you can always convert to an iterator, etc., but not the other way around. Suggested in https://github.com/astral-sh/ruff/pull/6259#discussion_r1282730994.	2023-08-03 16:21:55 +00:00
Charlie Marsh	9f3567dea6	Use `range: _` in lieu of `range: _range` (#6296 ) ## Summary `range: _range` is slightly inconvenient because you can't use it multiple times within a single match, unlike `_`.	2023-08-02 22:11:13 -04:00
Charlie Marsh	041946fb64	Remove `CallArguments` abstraction (#6279 ) ## Summary This PR removes a now-unnecessary abstraction from `helper.rs` (`CallArguments`), in favor of adding methods to `Arguments` directly, which helps with discoverability.	2023-08-02 13:25:43 -04:00
Charlie Marsh	981e64f82b	Introduce an `Arguments` AST node for function calls and class definitions (#6259 ) ## Summary This PR adds a new `Arguments` AST node, which we can use for function calls and class definitions. The `Arguments` node spans from the left (open) to right (close) parentheses inclusive. In the case of classes, the `Arguments` is an option, to differentiate between: ```python # None class C: ... # Some, with empty vectors class C(): ... ``` In this PR, we don't really leverage this change (except that a few rules get much simpler, since we don't need to lex to find the start and end ranges of the parentheses, e.g., `crates/ruff/src/rules/pyupgrade/rules/lru_cache_without_parameters.rs`, `crates/ruff/src/rules/pyupgrade/rules/unnecessary_class_parentheses.rs`). In future PRs, this will be especially helpful for the formatter, since we can track comments enclosed on the node itself. ## Test Plan `cargo test`	2023-08-02 10:01:13 -04:00
konsti	1df7e9831b	Replace `.map_or(false, $closure)` with `.is_some_and(closure)` (#6244 ) Summary [Option::is_some_and](https://doc.rust-lang.org/stable/std/option/enum.Option.html#method.is_some_and) and [Result::is_ok_and](https://doc.rust-lang.org/std/result/enum.Result.html#method.is_ok_and) are new methods is rust 1.70. I find them way more readable than `.map_or(false, ...)`. The changes are `s/.map_or(false,/.is_some_and(/g`, then manually switching to `is_ok_and` where the value is a Result rather than an Option. Test Plan n/a^	2023-08-01 19:29:42 +02:00
Micha Reiser	40f54375cb	Pull in RustPython parser (#6099 )	2023-07-27 09:29:11 +00:00
Micha Reiser	2cf00fee96	Remove parser dependency from ruff-python-ast (#6096 )	2023-07-26 17:47:22 +02:00
Charlie Marsh	f9726af4ef	Allow specification of `logging.Logger` re-exports via `logger-objects` (#5750 ) ## Summary This PR adds a `logger-objects` setting that allows users to mark specific symbols a `logging.Logger` objects. Currently, if a `logger` is imported, we only flagged it as a `logging.Logger` if it comes exactly from the `logging` module or is `flask.current_app.logger`. This PR allows users to mark specific loggers, like `logging_setup.logger`, to ensure that they're covered by the `flake8-logging-format` rules and others. For example, if you have a module `logging_setup.py` with the following contents: ```python import logging logger = logging.getLogger(__name__) ``` Adding `"logging_setup.logger"` to `logger-objects` will ensure that `logging_setup.logger` is treated as a `logging.Logger` object when imported from other modules (e.g., `from logging_setup import logger`). Closes https://github.com/astral-sh/ruff/issues/5694.	2023-07-24 00:38:20 -04:00
Charlie Marsh	626d8dc2cc	Use `.as_ref()` in lieu of `&**` (#5874 ) I find this less opaque (and often more succinct).	2023-07-19 00:49:13 +00:00
konsti	730e6b2b4c	Refactor `StmtIf`: Formatter and Linter (#5459 ) ## Summary Previously, `StmtIf` was defined recursively as ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub orelse: Vec<Stmt>, } ``` Every `elif` was represented as an `orelse` with a single `StmtIf`. This means that this representation couldn't differentiate between ```python if cond1: x = 1 else: if cond2: x = 2 ``` and ```python if cond1: x = 1 elif cond2: x = 2 ``` It also makes many checks harder than they need to be because we have to recurse just to iterate over an entire if-elif-else and because we're lacking nodes and ranges on the `elif` and `else` branches. We change the representation to a flat ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub elif_else_clauses: Vec<ElifElseClause>, } pub struct ElifElseClause { pub range: TextRange, pub test: Option<Expr>, pub body: Vec<Stmt>, } ``` where `test: Some(_)` represents an `elif` and `test: None` an else. This representation is different tradeoff, e.g. we need to allocate the `Vec<ElifElseClause>`, the `elif`s are now different than the `if`s (which matters in rules where want to check both `if`s and `elif`s) and the type system doesn't guarantee that the `test: None` else is actually last. We're also now a bit more inconsistent since all other `else`, those from `for`, `while` and `try`, still don't have nodes. With the new representation some things became easier, e.g. finding the `elif` token (we can use the start of the `ElifElseClause`) and formatting comments for if-elif-else (no more dangling comments splitting, we only have to insert the dangling comment after the colon manually and set `leading_alternate_branch_comments`, everything else is taken of by having nodes for each branch and the usual placement.rs fixups). ## Merge Plan This PR requires coordination between the parser repo and the main ruff repo. I've split the ruff part, into two stacked PRs which have to be merged together (only the second one fixes all tests), the first for the formatter to be reviewed by @michareiser and the second for the linter to be reviewed by @charliermarsh. * MH: Review and merge https://github.com/astral-sh/RustPython-Parser/pull/20 * MH: Review and merge or move later in stack https://github.com/astral-sh/RustPython-Parser/pull/21 * MH: Review and approve https://github.com/astral-sh/RustPython-Parser/pull/22 * MH: Review and approve formatter PR https://github.com/astral-sh/ruff/pull/5459 * CM: Review and approve linter PR https://github.com/astral-sh/ruff/pull/5460 * Merge linter PR in formatter PR, fix ecosystem checks (ecosystem checks can't run on the formatter PR and won't run on the linter PR, so we need to merge them first) * Merge https://github.com/astral-sh/RustPython-Parser/pull/22 * Create tag in the parser, update linter+formatter PR * Merge linter+formatter PR https://github.com/astral-sh/ruff/pull/5459 --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 13:40:15 +02:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
Charlie Marsh	3dc73395ea	Move `Literal` flag detection into recurse phase (#5768 ) ## Summary The AST pass is broken up into three phases: pre-visit (which includes analysis), recurse (visit all members), and post-visit (clean-up). We're not supposed to edit semantic model flags in the pre-visit phase, but it looks like we were for literal detection. This didn't matter in practice, but I'm looking into some AST refactors for which this _does_ cause issues. No behavior changes expected. ## Test Plan Good test coverage on these.	2023-07-15 02:04:15 +00:00
Charlie Marsh	932c9a4789	Extend PEP 604 rewrites to support some quoted annotations (#5725 ) ## Summary Python doesn't allow `"Foo" \| None` if the annotation will be evaluated at runtime (see the comments in the PR, or the semantic model documentation for more on what this means and when it is true), but it _does_ allow it if the annotation is typing-only. This, for example, is invalid, as Python will evaluate `"Foo" \| None` at runtime in order to populate the function's `__annotations__`: ```python def f(x: "Foo" \| None): ... ``` This, however, is valid: ```python def f(): x: "Foo" \| None ``` As is this: ```python from __future__ import annotations def f(x: "Foo" \| None): ... ``` Closes #5706.	2023-07-13 07:34:04 -04:00
Charlie Marsh	00fbbe4223	Remove some additional manual iterator matches (#5482 ) ## Summary I've done a few of these PRs, I thought I'd caught them all, but missed this pattern.	2023-07-03 16:29:59 +00:00
Anders Kaseorg	df13e69c3c	Format let-else with rustfmt nightly (#5461 ) Support for `let…else` formatting was just merged to nightly (rust-lang/rust#113225). Rerun `cargo fmt` with Rust nightly 2023-07-02 to pick this up. Followup to #939. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-07-03 02:13:35 +00:00
Charlie Marsh	cdbd0bd5cd	Respect `abc` decorators when classifying function types (#5315 ) Closes #5307.	2023-06-22 19:52:36 +00:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
Charlie Marsh	d0ad1ed0af	Replace static `CallPath` vectors with `matches!` macros (#5148 ) ## Summary After #5140, I audited the codebase for similar patterns (defining a list of `CallPath` entities in a static vector, then looping over them to pattern-match). This PR migrates all other such cases to use `match` and `matches!` where possible. There are a few benefits to this: 1. It more clearly denotes the intended semantics (branches are exclusive). 2. The compiler can help deduplicate the patterns and detect unreachable branches. 3. Performance: in the benchmark below, the all-rules performance is increased by nearly 10%... ## Benchmarks I decided to benchmark against a large file in the Airflow repository with a lot of type annotations ([`views.py`](https://raw.githubusercontent.com/apache/airflow/f03f73100e8a7d6019249889de567cb00e71e457/airflow/www/views.py)): ``` linter/default-rules/airflow/views.py time: [10.871 ms 10.882 ms 10.894 ms] thrpt: [19.739 MiB/s 19.761 MiB/s 19.781 MiB/s] change: time: [-2.7182% -2.5687% -2.4204%] (p = 0.00 < 0.05) thrpt: [+2.4805% +2.6364% +2.7942%] Performance has improved. linter/all-rules/airflow/views.py time: [24.021 ms 24.038 ms 24.062 ms] thrpt: [8.9373 MiB/s 8.9461 MiB/s 8.9527 MiB/s] change: time: [-8.9537% -8.8516% -8.7527%] (p = 0.00 < 0.05) thrpt: [+9.5923% +9.7112% +9.8342%] Performance has improved. Found 12 outliers among 100 measurements (12.00%) 5 (5.00%) high mild 7 (7.00%) high severe ``` The impact is dramatic -- nearly a 10% improvement for `all-rules`.	2023-06-16 17:34:42 +00:00
Charlie Marsh	5526699535	Use const-singleton helpers in more rules (#5142 )	2023-06-16 04:28:35 +00:00
Charlie Marsh	56476dfd61	Use `matches!` for `CallPath` comparisons (#5099 ) ## Summary This PR consistently uses `matches! for static `CallPath` comparisons. In some cases, we can significantly reduce the number of cases or checks. ## Test Plan `cargo test `	2023-06-14 17:06:34 -04:00
Charlie Marsh	bae183b823	Rename `semantic_model` and `model` usages to `semantic` (#5097 ) ## Summary As discussed in Discord, and similar to oxc, we're going to refer to this as `.semantic()` everywhere. While I was auditing usages of `model: &SemanticModel`, I also changed as many function signatures as I could find to consistently take the model as the _last_ argument, rather than the first.	2023-06-14 15:01:51 -04:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Charlie Marsh	d31eb87877	Extract shared simple AST node inference utility (#4871 )	2023-06-05 18:23:37 +00:00
Charlie Marsh	211d8e170d	Ignore error calls with `exc_info` in TRY400 (#4797 )	2023-06-02 04:59:45 +00:00
qdegraaf	fcbf5c3fae	Add PYI034 for `flake8-pyi` plugin (#4764 )	2023-06-02 02:15:57 +00:00
Jonathan Plasse	edadd7814f	Add `pyflakes.extend-generics` setting (#4677 )	2023-06-01 22:19:37 +00:00
Charlie Marsh	ea31229be0	Track `TYPE_CHECKING` blocks in `Importer` (#4593 )	2023-05-30 16:18:10 +00:00
Aarni Koskela	0106bce02f	[`flake8-future-annotations`] Implement `FA102` (#4702 )	2023-05-29 22:41:45 +00:00
qdegraaf	ccca11839a	Allow more immutable funcs for RUF009 (#4660 )	2023-05-26 15:18:52 -04:00
Charlie Marsh	d70f899f71	Use `SemanticModel` in lieu of `Checker` in more methods (#4568 )	2023-05-22 02:58:47 +00:00

1 2

65 commits