language-servers/ruff - Forgejo: Beyond coding. We Forge.

mirror of https://github.com/astral-sh/ruff.git synced 2025-10-01 22:31:47 +00:00

Author	SHA1	Message	Date
Alex Waygood	3aed14935d	[red-knot] Add support for `@final` classes (#15070 ) Co-authored-by: Carl Meyer <carl@astral.sh>	2024-12-19 21:02:14 +00:00
Alex Waygood	bcec5e615b	[red-knot] Rename and rework the `CoreStdlibModule` enum (#15071 )	2024-12-19 20:59:00 +00:00
Alex Waygood	a06099dffe	[red-knot] Move attribute access on `ModuleLiteral` types into a dedicated method (#15067 )	2024-12-19 16:02:16 +00:00
Alex Waygood	bb43085939	[red-knot] Reduce TODOs in `Type::member()` (#15066 )	2024-12-19 15:54:01 +00:00
Alex Waygood	40cba5dc8a	[red-knot] Cleanup various `todo_type!()` messages (#15063 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2024-12-19 13:03:41 +00:00
Douglas Creager	2802cbde29	Don't special-case class instances in unary expression inference (#15045 ) We have a handy `to_meta_type` that does the right thing for class instances, and also works for all of the other types that are “instances of” something. Unless I'm missing something, this should let us get rid of the catch-all clause in one fell swoop. cf #14548	2024-12-18 14:37:17 -05:00
InSync	ed2bce6ebb	[red-knot] Report invalid exceptions (#15042 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-12-18 18:31:24 +00:00
Micha Reiser	0fc4e8f795	Introduce `InferContext` (#14956 ) ## Summary I'm currently on the fence about landing the #14760 PR because it's unclear how we'd support tracking used and unused suppression comments in a performant way: * Salsa adds an "untracked" dependency to every query reading accumulated values. This has the effect that the query re-runs on every revision. For example, a possible future query `unused_suppression_comments(db, file)` would re-run on every incremental change and for every file. I don't expect the operation itself to be expensive, but it all adds up in a project with 100k+ files * Salsa collects the accumulated values by traversing the entire query dependency graph. It can skip over sub-graphs if it is known that they contain no accumulated values. This makes accumulators a great tool for when they are rare; diagnostics are a good example. Unfortunately, suppressions are more common, and they often appear in many different files, making the "skip over subgraphs" optimization less effective. Because of that, I want to wait to adopt salsa accumulators for type check diagnostics (we could start using them for other diagnostics) until we have very specific reasons that justify regressing incremental check performance. This PR does a "small" refactor that brings us closer to what I have in #14760 but without using accumulators. To emit a diagnostic, a method needs: * Access to the db * Access to the currently checked file This PR introduces a new `InferContext` that holds on to the db, the current file, and the reported diagnostics. It replaces the `TypeCheckDiagnosticsBuilder`. We pass the `InferContext` instead of the `db` to methods that might emit diagnostics. This simplifies some of the `Outcome` methods, which can now be called with a context instead of a `db` and the diagnostics builder. Having the `db` and the file on a single type like this would also be useful when using accumulators. This PR doesn't solve the issue that the `Outcome` types feel somewhat complicated nor that it can be annoying when you need to report a `Diagnostic,` but you don't have access to an `InferContext` (or the file). However, I also believe that accumulators won't solve these problems because: * Even with accumulators, it's necessary to have a reference to the file that's being checked. The struggle would be to get a reference to that file rather than getting a reference to `InferContext`. * Users of the `HasTy` trait (e.g., a linter) don't want to bother getting the `File` when calling `Type::return_ty` because they aren't interested in the created diagnostics. They just want to know what calling the current expression would return (and if it even is a callable). This is what the different methods of `Outcome` enable today. I can ask for the return type without needing extra data that's only relevant for emitting a diagnostic. A shortcoming of this approach is that it is now a bit confusing when to pass `db` and when an `InferContext`. An option is that we'd make the `file` on `InferContext` optional (it won't collect any diagnostics if `None`) and change all methods on `Type` to take `InferContext` as the first argument instead of a `db`. I'm interested in your opinion on this. Accumulators are definitely harder to use incorrectly because they remove the need to merge the diagnostics explicitly and there's no risk that we accidentally merge the diagnostics twice, resulting in duplicated diagnostics. I still value performance more over making our life slightly easier.	2024-12-18 12:22:33 +00:00
Douglas Creager	e8e461da6a	Prioritize attribute in from/import statement (#15041 ) This tweaks the new semantics from #15026 a bit when a symbol could be interpreted both as an attribute and a submodule of a package. For `from...import`, we should actually prioritize the attribute, because of how the statement itself is implemented [1]. > 1. check if the imported module has an attribute by that name > 2. if not, attempt to import a submodule with that name and then check the imported module again for that attribute [1] https://docs.python.org/3/reference/simple_stmts.html#the-import-statement	2024-12-17 16:58:23 -05:00
Douglas Creager	91c9168dd7	Handle nested imports correctly in `from ... import` (#15026 ) #14946 fixed our handling of nested imports with the `import` statement, but didn't touch `from...import` statements. cf https://github.com/astral-sh/ruff/issues/14826#issuecomment-2525344515	2024-12-17 14:23:34 -05:00
cake-monotone	f463fa7b7c	[red-knot] Narrowing For Truthiness Checks (`if x` or `if not x`) (#14687 ) ## Summary Fixes #14550. Add `AlwaysTruthy` and `AlwaysFalsy` types, representing the set of objects whose `__bool__` method can only ever return `True` or `False`, respectively, and narrow `if x` and `if not x` accordingly. ## Test Plan - New Markdown test for truthiness narrowing `narrow/truthiness.md` - unit tests in `types.rs` and `builders.rs` (`cargo test --package red_knot_python_semantic --lib -- types`)	2024-12-17 08:37:07 -08:00
Micha Reiser	c3b6139f39	Upgrade salsa (#15039 ) The only code change is that Salsa now requires the `Db` to implement `Clone` to create "lightweight" snapshots.	2024-12-17 15:50:33 +00:00
Alex Waygood	463046ae07	[red-knot] Explicitly test diagnostics are emitted for unresolvable submodule imports (#15035 )	2024-12-17 12:55:50 +00:00
Micha Reiser	dcb99cc817	Fix stale File status in tests (#15030 ) ## Summary Fixes https://github.com/astral-sh/ruff/issues/15027 The `MemoryFileSystem::write_file` API automatically creates non-existing ancestor directoryes but we failed to update the status of the now created ancestor directories in the `Files` data structure. ## Test Plan Tested that the case in https://github.com/astral-sh/ruff/issues/15027 now passes regardless of whether the Simple case is commented out or not	2024-12-17 12:45:36 +01:00
InSync	7c2e7cf25e	[red-knot] Basic support for other legacy `typing` aliases (#14998 ) ## Summary Resolves #14997. ## Test Plan Markdown tests.	2024-12-17 09:33:15 +00:00
Dhruv Manilawala	dcdc6e7c64	[red-knot] Avoid undeclared path when raising conflicting declarations (#14958 ) ## Summary This PR updates the logic when raising conflicting declarations diagnostic to avoid the undeclared path if present. The conflicting declaration diagnostics is added when there are two or more declarations in the control flow path of a definition whose type isn't equivalent to each other. This can be seen in the following example: ```py if flag: x: int x = 1 # conflicting-declarations: Unknown, int ``` After this PR, we'd avoid considering "Unknown" as part of the conflicting declarations. This means we'd still flag it for the following case: ```py if flag: x: int else: x: str x = 1 # conflicting-declarations: int, str ``` A solution that's local to the exception control flow was also explored which required updating the logic for merging the flow snapshot to avoid considering declarations using a flag. This is preserved here: https://github.com/astral-sh/ruff/compare/dhruv/control-flow-no-declarations?expand=1. The main motivation to avoid that is we don't really understand what the user experience is w.r.t. the Unknown type and the conflicting-declaration diagnostics. This makes us unsure on what the right semantics are as to whether that diagnostics should be raised or not and when to raise them. For now, we've decided to move forward with this PR and could decide to adopt another solution or remove the conflicting-declaration diagnostics in the future. Closes: #13966 ## Test Plan Update the existing mdtest case. Add an additional case specific to exception control flow to verify that the diagnostic is not being raised now.	2024-12-17 09:49:39 +05:30
Douglas Creager	4ddf9228f6	Bind top-most parent when importing nested module (#14946 ) When importing a nested module, we were correctly creating a binding for the top-most parent, but we were binding that to the nested module, not to that parent module. Moreover, we weren't treating those submodules as members of their containing parents. This PR addresses both issues, so that nested imports work as expected. As discussed in ~Slack~ whatever chat app I find myself in these days 😄, this requires keeping track of which modules have been imported within the current file, so that when we resolve member access on a module reference, we can see if that member has been imported as a submodule. If so, we return the submodule reference immediately, instead of checking whether the parent module's definition defines the symbol. This is currently done in a flow insensitive manner. The `SemanticIndex` now tracks all of the modules that are imported (via `import`, not via `from...import`). The member access logic mentioned above currently only considers module imports in the file containing the attribute expression. --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-12-16 16:15:40 -05:00
Alex Waygood	1389cb8e59	[red-knot] Emit an error if a bare `Annotated` or `Literal` is used in a type expression (#14973 )	2024-12-15 02:00:52 +00:00
Alex Waygood	fa46ba2306	[red-knot] Fix bugs relating to assignability of dynamic `type[]` types (#14972 )	2024-12-15 01:15:10 +00:00
github-actions[bot]	53c7ef8bfe	Sync vendored typeshed stubs (#14977 ) Co-authored-by: typeshedbot <> Co-authored-by: Alex Waygood <alex.waygood@gmail.com>	2024-12-15 01:02:41 +00:00
Alex Waygood	4d64cdb83c	[red-knot] `ClassLiteral(<T>)` is not a disjoint type from `Instance(<metaclass of T>)` (#14970 ) ## Summary A class is an instance of its metaclass, so `ClassLiteral("ABC")` is not disjoint from `Instance("ABCMeta")`. However, we erroneously consider the two types disjoint on the `main` branch. This PR fixes that. This bug was uncovered by adding some more core types to the property tests that provide coverage for classes that have custom metaclasses. The additions to the property tests are included in this PR. ## Test Plan New unit tests and property tests added. Tested with: - `cargo test -p red_knot_python_semantic` - `QUICKCHECK_TESTS=100000 cargo test -p red_knot_python_semantic -- --ignored types::property_tests::stable` The assignability property test fails on this branch, but that's a known issue that exists on `main`, due to https://github.com/astral-sh/ruff/issues/14899.	2024-12-14 11:28:09 -08:00
Carl Meyer	ac31b26a0e	[red-knot] type[] is disjoint from None, LiteralString (#14967 ) ## Summary Teach red-knot that `type[...]` is always disjoint from `None` and from `LiteralString`. Fixes #14925. This should properly be generalized to "all instances of final types which are not subclasses of `type`", but until we support finality, hardcoding `None` (which is known to be final) allows us to fix the subtype transitivity property test. ## Test Plan Existing tests pass, added new unit tests for `is_disjoint_from` and `is_subtype_of`. `QUICKCHECK_TESTS=100000 cargo test -p red_knot_python_semantic -- --ignored types::property_tests::stable` fails only the "assignability is reflexive" test, which is known to fail on `main` (#14899). The same command, with `property_tests.rs` edited to prevent generating intersection tests (the cause of #14899), passes all quickcheck tests.	2024-12-14 11:02:49 +01:00
Alex Waygood	224c8438bd	[red-knot] Minor simplifications to `types.rs` (#14962 )	2024-12-13 20:31:51 +00:00
Alex Waygood	90a5439791	[red-knot] Use `type[Unknown]` rather than `Unknown` as the fallback metaclass for invalid classes (#14961 )	2024-12-13 19:48:51 +00:00
Alex Waygood	4b2b126b9f	[red-knot] Make `is_subtype_of` exhaustive (#14924 )	2024-12-13 19:31:22 +00:00
InSync	9798556eb5	[red-knot] Alphabetize rules (#14960 ) ## Summary Follow-up from #14950. ## Test Plan Purely stylistic change. Shouldn't affect any functionalities.	2024-12-13 10:39:18 -08:00
InSync	aa1938f6ba	[red-knot] Understand `Annotated` (#14950 ) ## Summary Resolves #14922. ## Test Plan Markdown tests. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2024-12-13 09:41:37 -08:00
Dhruv Manilawala	3533d7f5b4	[red-knot] Display definition range in trace logs (#14955 ) I've mainly opened this PR to get some opinions. I've found having some additional information in the tracing logs to be useful to determine what we are currently inferring. For the `Definition` ingredient, the range seems to be much useful. I thought of using the identifier name but we would have to deconstruct the `Expr` to find out the identifier which seems a lot for just trace logs. Additionally, multiple identifiers _could_ have the same name where range would be useful. The ranges are isolated to the names that have been defined by the definition except for the `except` block where the entire range is being used because the name is optional. *Before:* ``` 3 ├─ 0.074671s 54ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_definition_types(Id(1402)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1402), file=/Users/dhruv/playground/ruff/type_inference/isolated3/play.py} 3 ┌─┘ 3 ├─ 0.074768s 54ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: inner_fn_name_(Id(2800)) } } 3 ├─ 0.074807s 54ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_deferred_types(Id(1735)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_deferred_types{definition=Id(1735), file=vendored://stdlib/typing.pyi} 3 ├─ 0.074842s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_definition_types(Id(14f3)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(14f3), file=vendored://stdlib/typing.pyi} 3 ├─ 0.074871s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_expression_types(Id(1820)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_expression_types{expression=Id(1820), file=vendored://stdlib/typing.pyi} 3 ├─ 0.074924s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_definition_types(Id(1429)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1429), file=vendored://stdlib/typing.pyi} 3 ├─ 0.074958s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(3), kind: WillExecute { database_key: infer_definition_types(Id(1428)) } } 3 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1428), file=vendored://stdlib/typing.pyi} 3 ┌─┘ ``` *After:* ``` 12 ├─ 0.074609s 55ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_definition_types(Id(1402)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1402), range=36..37, file=/Users/dhruv/playground/ruff/type_inference/isolated3/play.py} 12 ┌─┘ 12 ├─ 0.074705s 55ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: inner_fn_name_(Id(2800)) } } 12 ├─ 0.074742s 55ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_deferred_types(Id(1735)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_deferred_types{definition=Id(1735), range=30225..30236, file=vendored://stdlib/typing.pyi} 12 ├─ 0.074775s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_definition_types(Id(14f3)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(14f3), range=9472..9474, file=vendored://stdlib/typing.pyi} 12 ├─ 0.074803s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_expression_types(Id(1820)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_expression_types{expression=Id(1820), range=9477..9490, file=vendored://stdlib/typing.pyi} 12 ├─ 0.074855s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_definition_types(Id(1429)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1429), range=3139..3146, file=vendored://stdlib/typing.pyi} 12 ├─ 0.074892s 0ms TRACE red_knot_workspace::db Salsa event: Event { thread_id: ThreadId(12), kind: WillExecute { database_key: infer_definition_types(Id(1428)) } } 12 └─┐red_knot_python_semantic::types::infer::infer_definition_types{definition=Id(1428), range=3102..3107, file=vendored://stdlib/typing.pyi} 12 ┌─┘ ```	2024-12-13 14:29:53 +00:00
Alex Waygood	0bbe166720	[red-knot] Move the `ClassBase` enum to its own submodule (#14957 )	2024-12-13 13:12:39 +00:00
David Peter	c3a64b44b7	[red-knot] mdtest: python version requirements (#14954 ) ## Summary This is not strictly required yet, but makes these tests future-proof. They need a `python-version` requirement as they rely on language features that are not available in 3.9.	2024-12-13 10:40:38 +01:00
David Peter	e96b13c027	[red-knot] Support `typing.TYPE_CHECKING` (#14952 ) ## Summary Add support for `typing.TYPE_CHECKING` and `typing_extensions.TYPE_CHECKING`. relates to: https://github.com/astral-sh/ruff/issues/14170 ## Test Plan New Markdown-based tests	2024-12-13 09:24:48 +00:00
Micha Reiser	f52b1f4a4d	Add tracing support to mdtest (#14935 ) ## Summary This PR extends the mdtest configuration with a `log` setting that can be any of: * `true`: Enables tracing * `false`: Disables tracing (default) * String: An ENV_FILTER similar to `RED_KNOT_LOG` ```toml log = true ``` Closes https://github.com/astral-sh/ruff/issues/13865 ## Test Plan I changed a test and tried `log=true`, `log=false`, and `log=INFO`	2024-12-13 09:10:01 +00:00
David Peter	2ccc9b19a7	[red-knot] Improve `match` mdtests (#14951 ) ## Summary Minor improvement for the `match` tests to make sure we can't infer statically whether or not a certain `case` applies.	2024-12-13 09:50:17 +01:00
Micha Reiser	c1837e4189	Rename `custom-typeshed-dir`, `target-version` and `current-directory` CLI options (#14930 ) ## Summary This PR renames the `--custom-typeshed-dir`, `target-version`, and `--current-directory` cli options to `--typeshed`, `--python-version`, and `--project` as discussed in the CLI proposal document. I added aliases for `--target-version` (for Ruff compat) and `--custom-typeshed-dir` (for Alex) ## Test Plan Long help ``` An extremely fast Python type checker. Usage: red_knot [OPTIONS] [COMMAND] Commands: server Start the language server help Print this message or the help of the given subcommand(s) Options: --project <PROJECT> Run the command within the given project directory. All `pyproject.toml` files will be discovered by walking up the directory tree from the project root, as will the project's virtual environment (`.venv`). Other command-line arguments (such as relative paths) will be resolved relative to the current working directory."#, --venv-path <PATH> Path to the virtual environment the project uses. If provided, red-knot will use the `site-packages` directory of this virtual environment to resolve type information for the project's third-party dependencies. --typeshed-path <PATH> Custom directory to use for stdlib typeshed stubs --extra-search-path <PATH> Additional path to use as a module-resolution source (can be passed multiple times) --python-version <VERSION> Python version to assume when resolving types [possible values: 3.7, 3.8, 3.9, 3.10, 3.11, 3.12, 3.13] -v, --verbose... Use verbose output (or `-vv` and `-vvv` for more verbose output) -W, --watch Run in watch mode by re-running whenever files change -h, --help Print help (see a summary with '-h') -V, --version Print version ``` Short help ``` An extremely fast Python type checker. Usage: red_knot [OPTIONS] [COMMAND] Commands: server Start the language server help Print this message or the help of the given subcommand(s) Options: --project <PROJECT> Run the command within the given project directory --venv-path <PATH> Path to the virtual environment the project uses --typeshed-path <PATH> Custom directory to use for stdlib typeshed stubs --extra-search-path <PATH> Additional path to use as a module-resolution source (can be passed multiple times) --python-version <VERSION> Python version to assume when resolving types [possible values: 3.7, 3.8, 3.9, 3.10, 3.11, 3.12, 3.13] -v, --verbose... Use verbose output (or `-vv` and `-vvv` for more verbose output) -W, --watch Run in watch mode by re-running whenever files change -h, --help Print help (see more with '--help') -V, --version Print version ``` --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-12-13 08:21:52 +00:00
David Peter	d7ce548893	[red-knot] Add narrowing for 'while' loops (#14947 ) ## Summary Add type narrowing for `while` loops and corresponding `else` branches. closes #14861 ## Test Plan New Markdown tests.	2024-12-13 07:40:14 +01:00
David Peter	657d26ff20	[red-knot] Tests for 'while' loop boundness (#14944 ) ## Summary Regression test(s) for something that broken while implementing #14759. We have similar tests for other control flow elements, but feel free to let me know if this seems superfluous. ## Test Plan New mdtests	2024-12-12 21:06:56 +01:00
Alex Waygood	dbc191d2d6	[red-knot] Fixes to `Type::to_meta_type` (#14942 )	2024-12-12 19:55:11 +00:00
Alex Waygood	71239f248e	[red-knot] Add explicit TODO branches for many typing special forms and qualifiers (#14936 )	2024-12-12 17:57:26 +00:00
Alex Waygood	58930905eb	[red-knot] Fixup a few edge cases regarding `type[]` (#14918 )	2024-12-12 16:53:03 +00:00
Alex Waygood	45b565cbb5	[red-knot] `Any` cannot be parameterized (#14933 )	2024-12-12 11:50:34 +00:00
InSync	e4885a2fb2	[red-knot] Understand `typing.Tuple` (#14927 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-12-12 00:58:06 +00:00
David Peter	a7e5e42b88	[red-knot] Make `attributes.md` test future-proof (#14923 ) ## Summary Using `typing.LiteralString` breaks as soon as we understand `sys.version_info` branches, as it's only available in 3.11 and later. ## Test Plan Made sure it didn't fail on my #14759 branch anymore.	2024-12-11 20:46:24 +01:00
Alex Waygood	c361cf66ad	[red-knot] Precise inference for `__class__` attributes on objects of all types (#14921 )	2024-12-11 17:30:34 +00:00
Alex Waygood	a54353392f	[red-knot] Add failing test for use of `type[]` as a base class (#14913 ) We support using `typing.Type[]` as a base class (and we have tests for it), but not yet `builtins.type[]`. At some point we should fix that, but I don't think it';s worth spending much time on now (and it might be easier once we've implemented generics?). This PR just adds a failing test with a TODO.	2024-12-11 17:08:00 +00:00
Alex Waygood	ef153a0cce	[red-knot] Remove an unnecessary branch and a confusing TODO comment (#14915 )	2024-12-11 16:57:40 +00:00
Alex Waygood	7135a49aea	[red-knot] Record the TODO message in `ClassBase::Todo`, same as in `Type::Todo` (#14919 )	2024-12-11 15:17:56 +00:00
Alex Waygood	1d91dae11f	[red-knot] Minor simplifications to `mro.rs` (#14912 )	2024-12-11 13:14:12 +00:00
Micha Reiser	881375a8d9	[red-knot] Lint registry and rule selection (#14874 ) ## Summary This is the third and last PR in this stack that adds support for toggling lints at a per-rule level. This PR introduces a new `LintRegistry`, a central index of known lints. The registry is required because we want to support lint rules from many different crates but need a way to look them up by name, e.g., when resolving a lint from a name in the configuration or analyzing a suppression comment. Adding a lint now requires two steps: 1. Declare the lint with `declare_lint` 2. Register the lint in the registry inside the `register_lints` function. I considered some more involved macros to avoid changes in two places. Still, I ultimately decided against it because a) it's just two places and b) I'd expect that registering a type checker lint will differ from registering a lint that runs as a rule in the linter. I worry that any more opinionated design could limit our options when working on the linter, so I kept it simple. The second part of this PR is the `RuleSelection`. It stores which lints are enabled and what severity they should use for created diagnostics. For now, the `RuleSelection` always gets initialized with all known lints and it uses their default level. ## Linter crates Each crate that defines lints should export a `register_lints` function that accepts a `&mut LintRegistryBuilder` to register all its known lints in the registry. This should make registering all known lints in a top-level crate easy: Just call `register_lints` of every crate that defines lint rules. I considered defining a `LintCollection` trait and even some fancy macros to accomplish the same but decided to go for this very simplistic approach for now. We can add more abstraction once needed. ## Lint rules This is a bit hand-wavy. I don't have a good sense for how our linter infrastructure will look like, but I expect we'll need a way to register the rules that should run as part of the red knot linter. One way is to keep doing what Ruff does by having one massive `checker` and each lint rule adds a call to itself in the relevant AST visitor methods. An alternative is that we have a `LintRule` trait that provides common hooks and implementations will be called at the "right time". Such a design would need a way to register all known lint implementations, possibly with the lint. This is where we'd probably want a dedicated `register_rule` method. A third option is that lint rules are handled separately from the `LintRegistry` and are specific to the linter crate. The current design should be flexible enough to support the three options. ## Documentation generation The documentation for all known lints can be generated by creating a factory, registering all lints by calling the `register_lints` methods, and then querying the registry for the metadata. ## Deserialization and Schema generation I haven't fully decided what the best approach is when it comes to deserializing lint rule names: * Reject invalid names in the deserializer. This gives us error messages with line and column numbers (by serde) * Don't validate lint rule names during deserialization; defer the validation until the configuration is resolved. This gives us more control over handling the error, e.g. emit a warning diagnostic instead of aborting when a rule isn't known. One technical challenge for both deserialization and schema generation is that the `Deserialize` and `JSONSchema` traits do not allow passing the `LintRegistry`, which is required to look up the lints by name. I suggest that we either rely on the salsa db being set for the current thread (`salsa::Attach`) or build our own thread-local storage for the `LintRegistry`. It's the caller's responsibility to make the lint registry available before calling `Deserialize` or `JSONSchema`. ## CLI support I prefer deferring adding support for enabling and disabling lints from the CLI for now because I think it will be easier to add once I've figured out how to handle configurations. ## Bitset optimization Ruff tracks the enabled rules using a cheap copyable `Bitset` instead of a hash map. This helped improve performance by a few percent (see https://github.com/astral-sh/ruff/pull/3606). However, this approach is no longer possible because lints have no "cheap" way to compute their index inside the registry (other than using a hash map). We could consider doing something similar to Salsa where each `LintMetadata` stores a `LazyLintIndex`. ``` pub struct LazyLintIndex { cached: OnceLock<(Nonce, LintIndex)> } impl LazyLintIndex { pub fn get(registry: &LintRegistry, lint: &'static LintMetadata) { let (nonce, index) = self.cached.get_or_init(\|\| registry.lint_index(lint)); if registry.nonce() == nonce { index } else { registry.lint_index(lint) } } ``` Each registry keeps a map from `LintId` to `LintIndex` where `LintIndex` is in the range of `0...registry.len()`. The `LazyLintIndex` is based on the assumption that every program has exactly one registry. This assumption allows to cache the `LintIndex` directly on the `LintMetadata`. The implementation falls back to the "slow" path if there is more than one registry at runtime. I was very close to implementing this optimization because it's kind of fun to implement. I ultimately decided against it because it adds complexity and I don't think it's worth doing in Red Knot today: * Red Knot only queries the rule selection when deciding whether or not to emit a diagnostic. It is rarely used to detect if a certain code block should run. This is different from Ruff where the rule selection is queried many times for every single AST node to determine which rules should run. * I'm not sure if a 2-3% performance improvement is worth the complexity I suggest revisiting this decision when working on the linter where a fast path for deciding if a rule is enabled might be more important (but that depends on how lint rules are implemented) ## Test Plan I removed a lint from the default rule registry, and the MD tests started failing because the diagnostics were no longer emitted.	2024-12-11 13:25:19 +01:00
InSync	f30227c436	[red-knot] Understand `typing.Type` (#14904 ) Co-authored-by: Alex Waygood <alex.waygood@gmail.com>	2024-12-11 11:01:38 +00:00
Dimitri Papadopoulos Orfanos	a55722e740	Revert disjointness->disjointedness (#14906 ) ## Summary Partially revert #14880. While `disjointness` is missing from the [OED](https://www.oed.com/search/dictionary/?q=disjointness) and [SCOWL (And Friends)](http://app.aspell.net/lookup?dict=en_US-large;words=disjointness), it is commonly used in mathematics to describe disjoint sets. ## Test Plan CI tests.	2024-12-11 08:26:45 +00:00

1 2 3 4 5 ...

468 commits