LibCST

mirror of https://github.com/Instagram/LibCST.git synced 2025-12-23 10:35:53 +00:00

Author	SHA1	Message	Date
Tom Forbes	75b6331d55	Switch to using thread_local regular expressions to avoid regex mutex contention (#996 )	2023-08-26 15:21:05 +01:00
Tom Forbes	b28777e9e5	Remove criterion-cycles-per-byte dependency and related benchmark measurement (#995 )	2023-08-26 13:34:27 +01:00
Zsolt Dollenstein	380f045fe0	parser: use references instead of smart pointers for Tokens (#691 ) * Add cst_node proc macro * Split CST nodes into Deflated/Inflated versions	2022-06-07 04:08:37 -06:00
Zsolt Dollenstein	f3811a0e3f	native: add overall benchmark (#692 ) * Fix benchmarks on windows * add benchmark to cover parse_module	2022-05-29 10:27:11 +01:00
Zsolt Dollenstein	c91655fbba	fix copyright headers and add a script to check (#635 )	2022-02-01 11:13:17 +00:00
Zsolt Dollenstein	c44ff0500b	Fix license headers (#560 ) * Facebook -> Meta * remove year from doc copyright	2021-12-28 11:55:18 +00:00
Zsolt Dollenstein	c02de9b718	Implement a Python PEG parser in Rust (#566 ) This massive PR implements an alternative Python parser that will allow LibCST to parse Python 3.10's new grammar features. The parser is implemented in Rust, but it's turned off by default through the `LIBCST_PARSER_TYPE` environment variable. Set it to `native` to enable. The PR also enables new CI steps that test just the Rust parser, as well as steps that produce binary wheels for a variety of CPython versions and platforms. Note: this PR aims to be roughly feature-equivalent to the main branch, so it doesn't include new 3.10 syntax features. That will be addressed as a follow-up PR. The new parser is implemented in the `native/` directory, and is organized into two rust crates: `libcst_derive` contains some macros to facilitate various features of CST nodes, and `libcst` contains the `parser` itself (including the Python grammar), a `tokenizer` implementation by @bgw, and a very basic representation of CST `nodes`. Parsing is done by 1. tokenizing the input utf-8 string (bytes are not supported at the Rust layer, they are converted to utf-8 strings by the python wrapper) 2. running the PEG parser on the tokenized input, which also captures certain anchor tokens in the resulting syntax tree 3. using the anchor tokens to inflate the syntax tree into a proper CST Co-authored-by: Benjamin Woodruff <github@benjam.info>	2021-12-21 18:14:39 +00:00

7 commits