mirrors/uv - Forgejo: Beyond coding. We Forge.

mirrors/uv

mirror of https://github.com/astral-sh/uv.git synced 2025-07-07 13:25:00 +00:00

Author	SHA1	Message	Date
Zanie Blue	9997dc0870	There are no rage requests here (#5037 )	2024-07-13 15:35:51 +00:00
Charlie Marsh	c345484c93	Fall back to streaming wheel when `Content-Length` header is absent (#5000 ) ## Summary Closes https://github.com/astral-sh/uv/issues/4993	2024-07-12 01:04:21 +00:00
Zanie Blue	f5dce1124b	Retry on connection reset network errors (#4960 ) See helpful discussion at https://github.com/seanmonstar/reqwest/issues/1602#issuecomment-1220990725 and https://github.com/astral-sh/uv/issues/3514#issuecomment-2216986250 Should help with #3514 though I'll wait to close until it's confirmed as we cannot reproduce this.	2024-07-10 10:08:47 -05:00
Ibraheem Ahmed	d833910a5d	Avoid reparsing wheel URLs (#4947 ) ## Summary We currently store wheel URLs in an unparsed state because we don't have a stable parsed representation to use with rykv. Unfortunately this means we end up reparsing unnecessarily in a lot of places, especially when constructing a `Lock`. This PR adds a `UrlString` type that lets us avoid reparsing without losing the validity of the `Url`. ## Test Plan Shaves off another ~10 ms from https://github.com/astral-sh/uv/issues/4860. ``` ➜ transformers hyperfine "../../uv/target/profiling/uv lock" "../../uv/target/profiling/baseline lock" --warmup 3 Benchmark 1: ../../uv/target/profiling/uv lock Time (mean ± σ): 120.9 ms ± 2.5 ms [User: 126.0 ms, System: 80.6 ms] Range (min … max): 116.8 ms … 125.7 ms 23 runs Benchmark 2: ../../uv/target/profiling/baseline lock Time (mean ± σ): 129.9 ms ± 4.2 ms [User: 127.1 ms, System: 86.1 ms] Range (min … max): 123.4 ms … 141.2 ms 23 runs Summary ../../uv/target/profiling/uv lock ran 1.07 ± 0.04 times faster than ../../uv/target/profiling/baseline lock ```	2024-07-10 05:16:30 -04:00
Charlie Marsh	32ea636585	Preserve verbatim URLs for `--find-links` (#4838 ) Also gets rid of a lot of duplicated logic for `--find-links`. Closes https://github.com/astral-sh/uv/issues/4797	2024-07-05 16:57:40 -05:00
Danny	35afcfd053	Enable Registry Client Builder to be created from Base Client Builder (#4729 ) <!-- Thank you for contributing to uv! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary <!-- What's the purpose of the change? What does it do, and why? --> Addresses https://github.com/astral-sh/uv/issues/4330, to reduce duplication in the client creation logic. ## Test Plan <!-- How was it tested? --> https://github.com/astral-sh/uv/pull/4729#issuecomment-2204681655	2024-07-04 15:53:05 +00:00
konsti	4b19319485	Show when we retried requests (#4725 ) In #3514 and #2755, users had intermittent network errors, but it was not always clear whether we had already retried these requests or not. Building upon https://github.com/TrueLayer/reqwest-middleware/pull/159, this PR adds the number of retries to the error message, so we can see at first glance where we're missing retries and where we might need to change retry settings. Example error trace: ``` Could not connect, are you offline? Caused by: Request failed after 3 retries Caused by: error sending request for url (https://pypi.org/simple/uv/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known ``` This code is ugly since i'm missing a better pattern for attaching context to reqwest middleware errors in https://github.com/TrueLayer/reqwest-middleware/pull/159.	2024-07-02 19:04:11 +02:00
Ibraheem Ahmed	be2a67cd9b	Replace `map_or(false, ..)` uses with `is_some_and` and `is_ok_and` (#4703 ) ## Summary Looks like there isn't a clippy lint for this yet.	2024-07-01 19:28:42 +00:00
Charlie Marsh	9701ead5be	Flatten errors in registry fetch (#4546 ) ## Summary Right now, the outer error is "fatal" and the inner error is "recoverable" (in some cases), but ultimately it's all the same error type?	2024-06-26 13:05:46 +00:00
Charlie Marsh	a5b5856521	Gracefully handle non-existent packages in local indexes (#4545 ) ## Summary Ensures that local indexes can be used as `--extra-index-url` by gracefully handling "404" errors. Closes https://github.com/astral-sh/uv/issues/4540.	2024-06-26 12:54:38 +00:00
Charlie Marsh	a07e70d93a	Avoid panic for invalid, non-base index URLs (#4527 ) ## Summary See: https://github.com/astral-sh/uv/issues/4510	2024-06-25 18:32:58 +00:00
Zanie Blue	1ce21475a5	Respect `.python-version` files and fetch manged toolchains in uv project commands (#4361 ) As in #4360, updates the uv project CLI to respect `.python-version` files as default Python version requests. Additionally, updates project interpreter discovery to fetch managed toolchains as in `uv venv --preview`.	2024-06-18 09:43:52 -05:00
Charlie Marsh	c996e8e3f3	Enable workspace lint configuration in remaining crates (#4329 ) ## Summary We didn't have Clippy enabled (to match our workspace settings) in a few crates.	2024-06-18 03:02:28 +00:00
Charlie Marsh	6dae1920af	Make missing `METADATA` file a recoverable error (#4247 ) ## Summary I don't have a great way to test it, but this makes the error described in https://github.com/astral-sh/uv/issues/4246 an incompatibility rather than a fatal error. Closes https://github.com/astral-sh/uv/issues/4246.	2024-06-11 19:49:38 +00:00
Charlie Marsh	656fc427b9	Add support for local directories with `--index-url` (#4226 ) ## Summary Closes #4078.	2024-06-10 22:27:04 -04:00
samypr100	68abf85f0d	feat: mTLS support (#4171 ) ## Summary Closes https://github.com/astral-sh/uv/issues/3626 This adds mTLS support to uv via the standard env var `SSL_CLIENT_CERT`. ## Test Plan Tested locally using a [nginx proxy to pypi](https://github.com/hauntsaninja/nginx_pypi_cache) using my own self-signed ca + certs + client certs generated via [mkcert](https://github.com/FiloSottile/mkcert). Used this proxy with both uv and pip to make sure we have feature partity in mTLS functionality.	2024-06-10 20:11:35 -05:00
Zanie Blue	f9ea304be4	Drop "registry" prefix from request timeout log (#4144 ) We use this base client for more than registry requests	2024-06-07 16:56:32 -05:00
konsti	63c84ed4a6	Log transient network request failures (#3933 ) We retry several kinds of network request failures, but it's often unclear whether a request was retried or not (https://github.com/astral-sh/uv/issues/3514#issuecomment-2105485773). This PR adds a small intermediary layer that logs all transient request failures, adding the `DEBUG Transient request failure` lines: ``` DEBUG Searching for Python interpreter in virtual environments DEBUG Found CPython 3.12.3 at `/home/konsti/projects/uv/.venv/bin/python3` (active virtual environment) DEBUG Using Python 3.12.3 environment at .venv/bin/python3 DEBUG Acquired lock for `.venv` DEBUG At least one requirement is not satisfied: tqdm DEBUG Using registry request timeout of 30s DEBUG Solving with target Python version 3.12.3 DEBUG Adding direct dependency: tqdm* DEBUG No cache entry for: https://pypi.org/simple/tqdm/ DEBUG Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known DEBUG Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known DEBUG Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known DEBUG Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known error: Could not connect, are you offline? Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known ``` I decided for multi-line logging to show the complete error trace since only `Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/)` doesn't tell you the actual problem (a dns error). Note that running with `-v` will not show messages about retry backoff timing, but running with `RUST_LOG=debug` now shows a complete picture: ``` DEBUG starting new connection: https://pypi.org/ DEBUG resolving host="pypi.org" DEBUG Transient request failure for https://pypi.org/simple/tqdm/, retrying: Request error: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: error sending request for url (https://pypi.org/simple/tqdm/) Caused by: client error (Connect) Caused by: dns error: failed to lookup address information: Name or service not known Caused by: failed to lookup address information: Name or service not known WARN Retry attempt #2. Sleeping 528.728192ms before the next attempt ``` Fixes #3572	2024-06-04 15:39:16 +02:00
Charlie Marsh	11324646cb	Remove some `anyhow` usages (#3962 )	2024-06-01 20:11:23 +00:00
Charlie Marsh	cedd18e4c6	Remove some unused `pub` functions (#3872 ) ## Summary I wrote a bad Python script to find these.	2024-05-28 15:58:13 +00:00
Zanie Blue	84afca2696	Add offline support to `uv tool run` and `uv run` (#3676 ) Adds `--offline` support to `uv tool run` and `uv run` because I needed it on the airplane today. I think we should move `--offline` to the global settings like `--native-tls`.	2024-05-21 15:58:15 -05:00
Charlie Marsh	558f628ef1	Propagate URL errors in verbatim parsing (#3720 ) ## Summary Closes https://github.com/astral-sh/uv/issues/3715. ## Test Plan ``` ❯ echo "/../test" \| cargo run pip compile - error: Couldn't parse requirement in `-` at position 0 Caused by: path could not be normalized: /../test /../test ^^^^^^^^ ❯ echo "-e /../test" \| cargo run pip compile - error: Invalid URL in `-`: `/../test` Caused by: path could not be normalized: /../test Caused by: cannot normalize a relative path beyond the base directory ```	2024-05-21 19:58:59 +00:00
Charlie Marsh	cf997080b0	Rename `DistInfoMetadata` to `CoreMetadata` (#3699 ) ## Summary This reflects the change codified in PEP 714.	2024-05-21 18:26:59 +00:00
Charlie Marsh	fee344db6f	Add PEP 714 support for HTML API client (#3697 ) ## Summary If `data-core-metadata` is set, we need to respect that over `data-dist-info-metadata` in the HTML client. See: https://github.com/astral-sh/uv/issues/3689	2024-05-21 18:05:40 +00:00
Charlie Marsh	5205165d42	Add PEP 714 support for JSON API client (#3698 ) ## Summary Closes https://github.com/astral-sh/uv/issues/3689. ## Test Plan Manually verified we pick up `core-metadata` from PyPI if I remove the aliases.	2024-05-21 15:52:37 +00:00
Charlie Marsh	1124df9bc5	Remove subdirectory from direct wheel URL type (#3667 ) ## Summary Closes #3665.	2024-05-20 02:01:57 +00:00
Charlie Marsh	963f2a778b	URL-decode hashes in HTML fragments (#3655 ) ## Summary Closes https://github.com/astral-sh/uv/issues/3654	2024-05-18 22:19:55 -04:00
Andrew Gallant	018a7150d6	uv-distribution: include all wheels in distribution types (#3595 ) Our current flow of data from "simple registry package" to "final resolved distribution" goes through a number of types: * `SimpleMetadata` is the API response from a registry that includes all published versions for a package. Each version has an assortment of metadata associated with it. * `VersionFiles` is the aforementioned metadata. It is split in two: a group of files for source distributions and a group of files for wheels. * `PrioritizedDist` collects a subset of the files from `VersionFiles` to form a selection of the "best" sdist and the "best" wheel for the current environment. * `CompatibleDist` is created from a borrowed `PrioritizedDist` that, perhaps among other things, encapsulates the decision of whether to pick an sdist or a wheel. (This decision depends both on compatibility and the action being performed. e.g., When doing installation, a `CompatibleDist` will sometimes select an sdist over a wheel.) * `ResolvedDistRef` is like a `ResolvedDist`, but borrows a `Dist`. * `ResolvedDist` is the almost-final-form of a distribution in a resolution and is created from a `ResolvedDistRef`. * `AnnotatedResolvedDist` is a new data type that is the actual final form of a distribution that a universal lock file cares about. It bundles a `ResolvedDist` with some metadata needed to generate a lock file. One of the requirements of a universal lock file is that we include all wheels (and maybe all source distributions? but at least one if it's present) associated with a distribution. But the above flow of data (in the step from `VersionFiles` to `PrioritizedDist`) drops all wheels except for the best one. To remedy this, in this PR, we rejigger `PrioritizedDist`, `CompatibleDist` and `ResolvedDistRef` so that all wheel data is preserved. And when a `ResolvedDistRef` is finally turned into a `ResolvedDist`, we copy all of the wheel data. And finally, we adjust the `Lock` constructor to read this new data and include it in the lock file. To make this work, we also modify `RegistryBuiltDist` so that it can contain one or more wheels instead of just one. One shortcoming here (called out in the code as a FIXME) is that if a source distribution is selected as the "best" thing to use (perhaps there are no compatible wheels), then the wheels won't end up in the lock file. I plan to fix this in a follow-up PR. We also aren't totally consistent on source distribution naming. Sometimes we use `sdist`. Sometimes `source`. Sometimes `source_dist`. I think it'd be nice to just use `sdist` everywhere, but I do prefer the type names to be `SourceDist`. And sometimes you want function names to match the type names (i.e., `from_source_dist`), which in turn leads to an appearance of inconsistency. I'm open to ideas. Closes #3351	2024-05-15 15:07:28 -04:00
Charlie Marsh	55aedda379	Separate cache construction from initialization (#3607 ) ## Summary Ensures that we only initialize the cache for commands that require it. Closes https://github.com/astral-sh/uv/issues/3539.	2024-05-15 12:29:39 -04:00
konsti	c22c7cad4c	Add parsed URL fields to `Dist` variants (#3429 ) Avoid reparsing urls by storing the parsed parts across resolution on `Dist`. Part 2 of https://github.com/astral-sh/uv/issues/3408 and part of #3409 Closes #3408	2024-05-14 01:23:27 +00:00
Dimitri Papadopoulos Orfanos	d2ee567fe7	Fix a few typos found by codespell (#3543 ) <!-- Thank you for contributing to uv! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Just fix typos. While `alpha-numeric` is not really a misspelling: - it is missing from mainstream curated dictionaries, all of them suggest `alphanumeric`; - it is less used than `alphanumeric` (more than ⨉10 less) according to the Google [Ngram Viewer](https://books.google.com/ngrams/graph?content=alpha-numeric%2Calphanumeric&year_start=1900&year_end=2019&corpus=en-2019); - it is [missing from SCOWL](http://app.aspell.net/lookup?dict=en_US-large;words=alpha-numeric). ## Test Plan CI jobs.	2024-05-13 11:55:10 +00:00
Andrew Gallant	7d67b7bb49	pep508: un-export fields for MarkerEnvironment We now use the getters and setters everywhere. There were some places where we wanted to build a `MarkerEnvironment` out of whole cloth, usually in tests. To facilitate those use cases, we add a `MarkerEnvironmentBuilder` that provides a convenient constructor. It's basically like a `MarkerEnvironment::new`, but with named parameters. That's useful here because there are so many fields (and they many have the same type).	2024-05-09 10:06:02 -04:00
Charlie Marsh	18d229e2bb	Upgrade `async_http_range_reader` to v0.8.0 (#3460 ) ## Summary Closes #2025. Closes https://github.com/astral-sh/uv/issues/3255. Closes https://github.com/astral-sh/uv/pull/2843.	2024-05-08 10:54:08 -04:00
Ibraheem Ahmed	94cf604574	Remove unnecessary uses of `DashMap` and `Arc` (#3413 ) ## Summary All of the resolver code is run on the main thread, so a lot of the `Send` bounds and uses of `DashMap` and `Arc` are unnecessary. We could also switch to using single-threaded versions of `Mutex` and `Notify` in some places, but there isn't really a crate that provides those I would be comfortable with using. The `Arc` in `OnceMap` can't easily be removed because of the uv-auth code which uses the [reqwest-middleware](https://docs.rs/reqwest-middleware/latest/reqwest_middleware/trait.Middleware.html) crate, that seems to adds unnecessary `Send` bounds because of `async-trait`. We could duplicate the code and create a `OnceMapLocal` variant, but I don't feel that's worth it.	2024-05-06 22:30:43 -04:00
Andrew Gallant	1089abda3f	require serde and rkyv everywhere; remove optional serde and rkyv features (#3345 ) In some places in our crates, `serde` (and `rkyv`) are optional dependencies. I believe this was done out of reasons of "good sense," that is, it follows a Rust ecosystem pattern where serde integration tends to be an opt-in crate feature. (And similarly for `rkyv`.) However, ultimately, `uv` itself requires `serde` and `rkyv` to function. Since our crates are strictly internal, there are limited consumers for our crates without `serde` (and `rkyv`) enabled. I think one possibility is that optional `serde` (and `rkyv`) integration means that someone can do this: cargo test -p pep440_rs And this will run tests _without_ `serde` or `rkyv` enabled. That in turn could lead to faster iteration time by reducing compile times. But, I'm not sure this is worth supporting. The iterative compilation times of individual crates are probably fast enough in debug mode, even with `serde` and `rkyv` enabled. Namely, `serde` and `rkyv` themselves shouldn't need to be re-compiled in most cases. On `main`: ``` from-scratch: `cargo test -p pep440_rs --lib` 0.685 incremental: `cargo test -p pep440_rs --lib` 0.278s from-scratch: `cargo test -p pep440_rs --features serde,rkyv --lib` 3.948s incremental: `cargo test -p pep440_rs --features serde,rkyv --lib` 0.321s ``` So while a from-scratch build does take significantly longer, an incremental build is about the same. The benefit of doing this change is two-fold: 1. It brings out crates into alignment with "reality." In particular, some crates were _implicitly_ relying on `serde` being enabled without explicitly declaring it. This technically means that our `Cargo.toml`s were wrong in some cases, but it is hard to observe it because of feature unification in a Cargo workspace. 2. We no longer need to deal with the cognitive burden of writing `#[cfg_attr(feature = "serde", ...)]` everywhere.	2024-05-03 10:21:03 -04:00
Tim de Jager	9ae116f82b	fix: remove cache generic from builder (#3322 ) Just a small fix, remove generic argument that I think was unused.	2024-04-30 08:27:55 -05:00
Charlie Marsh	eabefbf8a2	Ignore 401 errors with multiple indexes (#3292 ) ## Summary It seems like Azure might return a 401 when you request a package that doesn't exist (even with valid credentials)? But I admittedly haven't tested this. (We already skip 403, and this seems similar?) Closes https://github.com/astral-sh/uv/issues/3291.	2024-04-28 10:06:43 -04:00
Yorick	43181f1933	Implement `--index-strategy unsafe-best-match` (#3138 ) ## Summary This index strategy resolves every package to the latest possible version across indexes. If a version is in multiple indexes, the first available index is selected. Implements #3137 This closely matches pip. ## Test Plan Good question. I'm hesitant to use my certifi example here, since that would inevitably break when torch removes this package. Please comment!	2024-04-27 01:24:54 +00:00
konsti	bed730571d	Fix single crate tokio features (#3234 ) Previously, uv-auth would fail to compile due to a missing process feature. I chose to make all tokio features we use top level features, so we can share the tokio cache between all test invocations.	2024-04-24 08:55:15 +00:00
konsti	d10903f0a4	30s default http read timeout (#3182 ) Since we're now using read timeouts and not total timeouts, we can use a lower threshold, a single read shouldn't take 5 min (and not even 10s). The 10s value is somewhat arbitrary. Like #3144, this is a breaking change in some sense.	2024-04-22 19:05:44 -04:00
Zanie Blue	f98eca8843	Fix authentication for URLs with a shared realm (#3130 ) In #2976 I made some changes that led to regressions: - We stopped tracking URLs that we had not seen credentials for in the cache - This means the cache no longer returns a value to indicate we've seen a realm before - We stopped seeding the cache with URLs - Combined with the above, this means we no longer had a list of locations that we would never attempt to fetch credentials for - We added caching of credentials found on requests - Previously the cache was only populated from the seed or credentials found in the netrc or keyring - This meant that the cache was populated for locations that we previously did not cache, i.e. GitHub artifacts(?) Unfortunately this unveiled problems with the granularity of our cache. We cache credentials per realm (roughly the hostname) but some realms have mixed authentication modes i.e. different credentials per URL or URLs that do not require credentials. Applying credentials to a URL that does not require it can lead to a failed request, as seen in #3123 where GitHub throws a 401 when receiving credentials. To resolve this, the cache is expanded to supporting caching at two levels: - URL, cached URL must be a prefix of the request URL - Realm, exact match required When we don't have URL-level credentials cached, we attempt the request without authentication first. On failure, we'll search for realm-level credentials or fetch credentials from external services. This avoids providing credentials to new URLs unless we know we need them. Closes https://github.com/astral-sh/uv/issues/3123	2024-04-22 13:06:57 -05:00
Charlie Marsh	9f2bc19eaf	Enforce HTTP timeouts on a per-read (rather than per-request) basis (#3144 ) ## Summary This leverages the new `read_timeout` property, which ensures that (like pip) our timeout is not applied to the _entire_ request, but rather, to each individual read operation. Closes: #1921. See: #1912.	2024-04-19 16:49:53 -04:00
elbaro	ab74263cbc	Skip HEAD requests for Pypicloud with Private S3 (#3070 )	2024-04-16 18:25:35 +00:00
Zanie Blue	c0efeeddf6	Rewrite `uv-auth` (#2976 ) Closes - #2822 - https://github.com/astral-sh/uv/issues/2563 (via #2984) Partially address: - https://github.com/astral-sh/uv/issues/2465 - https://github.com/astral-sh/uv/issues/2464 Supersedes: - https://github.com/astral-sh/uv/pull/2947 - https://github.com/astral-sh/uv/pull/2570 (via #2984) Some significant refactors to the whole `uv-auth` crate: - Improving the API - Adding test coverage - Fixing handling of URL-encoded passwords - Fixing keyring authentication - Updated middleware (see #2984 for more)	2024-04-16 11:48:37 -05:00
samypr100	7c7f06f62b	feat: convert linehaul tests to use snapshots (#2923 ) <!-- Thank you for contributing to uv! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Closes #2564 ## Test Plan 1. Changed existing linehaul tests to leverage insta. 2. Ran tests in various linux distros (Debian, Ubuntu, Centos, Fedora, Alpine) to ensure they also pass locally again. --------- Co-authored-by: konstin <konstin@mailbox.org>	2024-04-11 09:41:09 +00:00
Charlie Marsh	3dd673677a	Add `--find-links` source distributions to the registry cache (#2986 ) ## Summary Source distributions in `--find-links` are now properly picked up in the cache. Closes https://github.com/astral-sh/uv/issues/2978.	2024-04-11 01:25:58 +00:00
Charlie Marsh	1f3b5bb093	Add hash-checking support to `install` and `sync` (#2945 ) ## Summary This PR adds support for hash-checking mode in `pip install` and `pip sync`. It's a large change, both in terms of the size of the diff and the modifications in behavior, but it's also one that's hard to merge in pieces (at least, with any test coverage) since it needs to work end-to-end to be useful and testable. Here are some of the most important highlights: - We store hashes in the cache. Where we previously stored pointers to unzipped wheels in the `archives` directory, we now store pointers with a set of known hashes. So every pointer to an unzipped wheel also includes its known hashes. - By default, we don't compute any hashes. If the user runs with `--require-hashes`, and the cache doesn't contain those hashes, we invalidate the cache, redownload the wheel, and compute the hashes as we go. For users that don't run with `--require-hashes`, there will be no change in performance. For users that _do_, the only change will be if they don't run with `--generate-hashes` -- then they may see some repeated work between resolution and installation, if they use `pip compile` then `pip sync`. - Many of the distribution types now include a `hashes` field, like `CachedDist` and `LocalWheel`. - Our behavior is similar to pip, in that we enforce hashes when pulling any remote distributions, and when pulling from our own cache. Like pip, though, we _don't_ enforce hashes if a distribution is _already_ installed. - Hash validity is enforced in a few different places: 1. During resolution, we enforce hash validity based on the hashes reported by the registry. If we need to access a source distribution, though, we then enforce hash validity at that point too, prior to running any untrusted code. (This is enforced in the distribution database.) 2. In the install plan, we _only_ add cached distributions that have matching hashes. If a cached distribution is missing any hashes, or the hashes don't match, we don't return them from the install plan. 3. In the downloader, we _only_ return distributions with matching hashes. 4. The final combination of "things we install" are: (1) the wheels from the cache, and (2) the downloaded wheels. So this ensures that we never install any mismatching distributions. - Like pip, if `--require-hashes` is provided, we require that _all_ distributions are pinned with either `==` or a direct URL. We also require that _all_ distributions have hashes. There are a few notable TODOs: - We don't support hash-checking mode for unnamed requirements. These should be _somewhat_ rare, though? Since `pip compile` never outputs unnamed requirements. I can fix this, it's just some additional work. - We don't automatically enable `--require-hashes` with a hash exists in the requirements file. We require `--require-hashes`. Closes #474. ## Test Plan I'd like to add some tests for registries that report incorrect hashes, but otherwise: `cargo test`	2024-04-10 19:09:03 +00:00
Charlie Marsh	ddf02e7d5f	Remove unused `task-local-extensions` dependency (#2974 ) ## Summary Made obsolete with the `reqwest` upgrade.	2024-04-10 14:56:39 -04:00
Charlie Marsh	48ba7df98a	Move `FlatIndex` into the `uv-resolver` crate (#2972 ) ## Summary This lets us remove circular dependencies (in the future, e.g., #2945) that arise from `FlatIndex` needing a bunch of resolver-specific abstractions (like incompatibilities, required hashes, etc.) that aren't necessary to _fetch_ the flat index entries.	2024-04-10 14:38:42 -04:00
Charlie Marsh	a01143980a	Upgrade `reqwest` to v0.12.3 (#2817 ) ## Summary Closes #2814.	2024-04-10 11:20:44 -04:00

1 2 3

128 commits