 b01003f81d
			
		
	
	
		b01003f81d
		
			
		
	
	
	
	
		
			
			## Summary
This PR includes a behavioral change to how we infer types for public
uses of symbols within a module. Where we would previously use the type
that a use at the end of the scope would see, we now consider all
reachable bindings and union the results:
```py
x = None
def f():
    reveal_type(x)  # previously `Unknown | Literal[1]`, now `Unknown | None | Literal[1]`
f()
x = 1
f()
```
This helps especially in cases where the the end of the scope is not
reachable:
```py
def outer(x: int):
    def inner():
        reveal_type(x)  # previously `Unknown`, now `int`
    raise ValueError
```
This PR also proposes to skip the boundness analysis of public uses.
This is consistent with the "all reachable bindings" strategy, because
the implicit `x = <unbound>` binding is also always reachable, and we
would have to emit "possibly-unresolved" diagnostics for every public
use otherwise. Changing this behavior allows common use-cases like the
following to type check without any errors:
```py
def outer(flag: bool):
    if flag:
        x = 1
        def inner():
            print(x)  # previously: possibly-unresolved-reference, now: no error
```
closes https://github.com/astral-sh/ty/issues/210
closes https://github.com/astral-sh/ty/issues/607
closes https://github.com/astral-sh/ty/issues/699
## Follow up
It is now possible to resolve the following TODO, but I would like to do
that as a follow-up, because it requires some changes to how we treat
implicit attribute assignments, which could result in ecosystem changes
that I'd like to see separately.
315fb0f3da/crates/ty_python_semantic/src/semantic_index/builder.rs (L1095-L1117)
## Ecosystem analysis
[**Full report**](https://shark.fish/diff-public-types.html)
* This change obviously removes a lot of `possibly-unresolved-reference`
diagnostics (7818) because we do not analyze boundness for public uses
of symbols inside modules anymore.
* As the primary goal here, this change also removes a lot of
false-positive `unresolved-reference` diagnostics (231) in scenarios
like this:
    ```py
    def _(flag: bool):
        if flag:
            x = 1
    
            def inner():
                x
    
            raise
    ```
* This change also introduces some new false positives for cases like:
    ```py
    def _():
        x = None
    
        x = "test"
    
        def inner():
x.upper() # Attribute `upper` on type `Unknown | None | Literal["test"]`
is possibly unbound
    ```
We have test cases for these situations and it's plausible that we can
improve this in a follow-up.
## Test Plan
New Markdown tests
		
	
			
		
			
				
	
	
	
	
		
			7.5 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	Eager scopes
Some scopes are executed eagerly: references to variables defined in enclosing scopes are resolved immediately. This is in contrast to (for instance) function scopes, where those references are resolved when the function is called.
Function definitions
Function definitions are evaluated lazily.
x = 1
def f():
    reveal_type(x)  # revealed: Unknown | Literal[1, 2]
x = 2
Class definitions
Class definitions are evaluated eagerly.
def _():
    x = 1
    class A:
        reveal_type(x)  # revealed: Literal[1]
        y = x
    x = 2
    reveal_type(A.y)  # revealed: Unknown | Literal[1]
List comprehensions
List comprehensions are evaluated eagerly.
def _():
    x = 1
    # revealed: Literal[1]
    [reveal_type(x) for a in range(1)]
    x = 2
Set comprehensions
Set comprehensions are evaluated eagerly.
def _():
    x = 1
    # revealed: Literal[1]
    {reveal_type(x) for a in range(1)}
    x = 2
Dict comprehensions
Dict comprehensions are evaluated eagerly.
def _():
    x = 1
    # revealed: Literal[1]
    {a: reveal_type(x) for a in range(1)}
    x = 2
Generator expressions
Generator expressions don't necessarily run eagerly, but in practice usually they do, so assuming they do is the better default.
def _():
    x = 1
    # revealed: Literal[1]
    list(reveal_type(x) for a in range(1))
    x = 2
But that does lead to incorrect results when the generator expression isn't run immediately:
def evaluated_later():
    x = 1
    # revealed: Literal[1]
    y = (reveal_type(x) for a in range(1))
    x = 2
    # The generator isn't evaluated until here, so at runtime, `x` will evaluate to 2, contradicting
    # our inferred type.
    print(next(y))
Though note that “the iterable expression in the leftmost for clause is immediately evaluated”
[spec]:
def iterable_evaluated_eagerly():
    x = 1
    # revealed: Literal[1]
    y = (a for a in [reveal_type(x)])
    x = 2
    # Even though the generator isn't evaluated until here, the first iterable was evaluated
    # immediately, so our inferred type is correct.
    print(next(y))
Top-level eager scopes
All of the above examples behave identically when the eager scopes are directly nested in the global scope.
Class definitions
x = 1
class A:
    reveal_type(x)  # revealed: Literal[1]
    y = x
x = 2
reveal_type(A.y)  # revealed: Unknown | Literal[1]
List comprehensions
x = 1
# revealed: Literal[1]
[reveal_type(x) for a in range(1)]
x = 2
# error: [unresolved-reference]
[y for a in range(1)]
y = 1
Set comprehensions
x = 1
# revealed: Literal[1]
{reveal_type(x) for a in range(1)}
x = 2
# error: [unresolved-reference]
{y for a in range(1)}
y = 1
Dict comprehensions
x = 1
# revealed: Literal[1]
{a: reveal_type(x) for a in range(1)}
x = 2
# error: [unresolved-reference]
{a: y for a in range(1)}
y = 1
Generator expressions
x = 1
# revealed: Literal[1]
list(reveal_type(x) for a in range(1))
x = 2
# error: [unresolved-reference]
list(y for a in range(1))
y = 1
evaluated_later.py:
x = 1
# revealed: Literal[1]
y = (reveal_type(x) for a in range(1))
x = 2
# The generator isn't evaluated until here, so at runtime, `x` will evaluate to 2, contradicting
# our inferred type.
print(next(y))
iterable_evaluated_eagerly.py:
x = 1
# revealed: Literal[1]
y = (a for a in [reveal_type(x)])
x = 2
# Even though the generator isn't evaluated until here, the first iterable was evaluated
# immediately, so our inferred type is correct.
print(next(y))
Lazy scopes are "sticky"
As we look through each enclosing scope when resolving a reference, lookups become lazy as soon as we encounter any lazy scope, even if there are other eager scopes that enclose it.
Eager scope within eager scope
If we don't encounter a lazy scope, lookup remains eager. The resolved binding is not necessarily in
the immediately enclosing scope. Here, the list comprehension and class definition are both eager
scopes, and we immediately resolve the use of x to (only) the x = 1 binding.
def _():
    x = 1
    class A:
        # revealed: Literal[1]
        [reveal_type(x) for a in range(1)]
    x = 2
Class definition bindings are not visible in nested scopes
Class definitions are eager scopes, but any bindings in them are explicitly not visible to any nested scopes. (Those nested scopes are typically (lazy) function definitions, but the rule also applies to nested eager scopes like comprehensions and other class definitions.)
def _():
    x = 1
    class A:
        x = 4
        # revealed: Literal[1]
        [reveal_type(x) for a in range(1)]
        class B:
            # revealed: Literal[1]
            [reveal_type(x) for a in range(1)]
    x = 2
x = 1
def _():
    class C:
        # revealed: Unknown | Literal[1]
        [reveal_type(x) for _ in [1]]
        x = 2
Eager scope within a lazy scope
The list comprehension is an eager scope, and it is enclosed within a function definition, which is a lazy scope. Because we pass through this lazy scope before encountering any bindings or definitions, the lookup is lazy.
def _():
    x = 1
    def f():
        # revealed: Unknown | Literal[1, 2]
        [reveal_type(x) for a in range(1)]
    x = 2
Lazy scope within an eager scope
The function definition is a lazy scope, and it is enclosed within a class definition, which is an eager scope. Even though we pass through an eager scope before encountering any bindings or definitions, the lookup remains lazy.
def _():
    x = 1
    class A:
        def f():
            # revealed: Unknown | Literal[1, 2]
            reveal_type(x)
    x = 2
Lazy scope within a lazy scope
No matter how many lazy scopes we pass through before encountering a binding or definition, the lookup remains lazy.
def _():
    x = 1
    def f():
        def g():
            # revealed: Unknown | Literal[1, 2]
            reveal_type(x)
    x = 2
Eager scope within a lazy scope within another eager scope
We have a list comprehension (eager scope), enclosed within a function definition (lazy scope),
enclosed within a class definition (eager scope), all of which we must pass through before
encountering any binding of x. Even though the last scope we pass through is eager, the lookup is
lazy, since we encountered a lazy scope on the way.
def _():
    x = 1
    class A:
        def f():
            # revealed: Unknown | Literal[1, 2]
            [reveal_type(x) for a in range(1)]
    x = 2
Annotations
Type annotations are sometimes deferred. When they are, the types that are referenced in an annotation are looked up lazily, even if they occur in an eager scope.
Eager annotations in a Python file
from typing import ClassVar
x = int
class C:
    var: ClassVar[x]
reveal_type(C.var)  # revealed: int
x = str
Deferred annotations in a Python file
from __future__ import annotations
from typing import ClassVar
x = int
class C:
    var: ClassVar[x]
reveal_type(C.var)  # revealed: Unknown | int | str
x = str
Deferred annotations in a stub file
from typing import ClassVar
x = int
class C:
    var: ClassVar[x]
# TODO: should ideally be `str`, but we currently consider all reachable bindings
reveal_type(C.var)  # revealed: int | str
x = str