datafusion-sqlparse

mirror of https://github.com/apache/datafusion-sqlparser-rs.git synced 2025-08-31 03:07:20 +00:00

Author	SHA1	Message	Date
Daniël Heres	15d5f71646	Add CREATE TABLE AS support (#206 ) We parse it as a regular `CREATE TABLE` statement followed by an `AS <query>`, which is how BigQuery works: https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language#create_table_statement ANSI SQL and PostgreSQL only support a plain list of columns after the table name in a CTAS `CREATE TABLE t (a) AS SELECT a FROM foo` We currently only allow specifying a full schema with data types, or omitting it altogether. https://www.postgresql.org/docs/12/sql-createtableas.html https://jakewheat.github.io/sql-overview/sql-2016-foundation-grammar.html#as-subquery-clause Finally, when no schema is specified, we print empty parens after a plain `CREATE TABLE t ();` as required by PostgreSQL, but skip them in a CTAS: `CREATE TABLE t AS ...`. This affects serialization only, the parser allows omitting the schema in a regular `CREATE TABLE` too since the first release of the parser: `7d27abdfb4/src/sqlparser.rs (L325-L332)` Co-authored-by: Nickolay Ponomarev <asqueella@gmail.com>	2020-06-23 16:30:22 +03:00
Jovansonlee Cesar	26361fd854	Implement ALTER TABLE DROP COLUMN (#148 ) This implements `DROP [ COLUMN ] [ IF EXISTS ] column_name [ CASCADE ]` sub-command of `ALTER TABLE`, which is what PostgreSQL supports https://www.postgresql.org/docs/12/sql-altertable.html (except for the RESTRICT option) Co-authored-by: Nickolay Ponomarev <asqueella@gmail.com>	2020-06-16 23:39:52 +03:00
mz	faeb7d440a	Implement ALTER TABLE ADD COLUMN and RENAME (#203 ) Based on sqlite grammar https://www.sqlite.org/lang_altertable.html	2020-06-16 22:52:37 +03:00
Daniël Heres	b24dbe513c	Replace FromStr with normal parser function for FileFormat (#201 ) The previous version accepted quoting file format keywords (`STORED AS "TEXTFILE"`) and was inconsistent with the way WindowFrameUnits was parsed.	2020-06-13 15:38:01 +03:00
Daniël Heres	68afa2a764	Make FileFormat case insensitive (#200 )	2020-06-12 18:10:44 +03:00
Daniël Heres	f4fbd9b6b3	Take slice as input for parse_keywords (#199 )	2020-06-12 02:10:17 +03:00
Max Countryman	6cdd4a146d	Support general "typed string" literals (#187 ) Fixes #168 by enabling `DATE` and other keywords to be used as identifiers when not followed by a string literal. A "typed string" is our term for generalized version of `DATE '...'`/`TIME '...'`/ `TIMESTAMP '...'` literals, represented as `TypedString { data_type, value }` in the AST. Unlike DATE/TIME/TIMESTAMP literals, this is a non-standard extension supported by PostgreSQL at least. This is a port of MaterializeInc/materialize#3146 Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com> Co-authored-by: Nickolay Ponomarev <asqueella@gmail.com>	2020-06-12 00:04:43 +03:00
Daniël Heres	34548e890b	Change Word::keyword to a enum (#193 ) This improves performance and paves the way to future API enhancements as discussed in the PR https://github.com/andygrove/sqlparser-rs/pull/193	2020-06-11 22:00:35 +03:00
Nickolay Ponomarev	0fe3a8ec39	Use Token::EOF instead of Option<Token> (#195 ) This simplifies codes slightly, removing the need deal with the EOF case explicitly. The clone kludge in `_ => self.expected("date/time field", Token::Word(w.clone())))` will become unnecessary once we stop using a separate match for the keywords, as suggested in https://github.com/andygrove/sqlparser-rs/pull/193#issuecomment-641607194	2020-06-10 14:05:17 +03:00
Max Countryman	846c52f450	Allow omitting units after INTERVAL (#184 ) Alter INTERVAL to support postgres syntax This patch updates our INTERVAL implementation such that the Postgres and Redshfit variation of the syntax is supported: namely that 'leading field' is optional. Fixes #177.	2020-06-10 09:32:13 +03:00
Daniël Heres	d842f495db	Add line and column number to TokenizerError (#194 ) Addresses https://github.com/andygrove/sqlparser-rs/issues/179 for tokenize errors	2020-06-10 09:15:44 +03:00
Daniël Heres	d32df527e6	Accept &str in `Parse::parse_sql` (#182 ) It is more generic to accept a `&str` than a `String` in an API, and avoids having to convert a string to a `String` when not needed, avoiding a copy.	2020-06-03 23:31:41 +03:00
Daniël Heres	b4699bd4a7	Support bitwise and, or, xor (#181 ) Operator precedence is coming from: https://cloud.google.com/bigquery/docs/reference/standard-sql/operators	2020-06-03 19:02:05 +03:00
Daniël Heres	00dc490f72	Support the string concat operator (#178 ) The selected precedence is based on BigQuery documentation, where it is equal to `*` and `/`: https://cloud.google.com/bigquery/docs/reference/standard-sql/operators	2020-06-02 21:24:30 +03:00
Max Countryman	5f3c1bda01	Provide LISTAGG implementation (#174 ) This patch provides an initial implemenation of LISTAGG[1]. Notably this implemenation deviates from ANSI SQL by allowing both WITHIN GROUP and the delimiter to be optional. We do so because Redshift SQL works this way and this approach is ultimately more flexible. Fixes #169. [1] https://modern-sql.com/feature/listagg	2020-05-30 18:50:17 +03:00
QP Hou	418b9631ce	add nulls first/last support to order by expression (#176 ) Following `<sort specification list>` from the standard https://jakewheat.github.io/sql-overview/sql-2016-foundation-grammar.html#_10_10_sort_specification_list	2020-05-30 17:05:15 +03:00
Alex Dukhno	91f769e460	added create and drop schema	2020-05-28 19:50:16 +03:00
Christoph Müller	98f97d09db	Add support for "on delete cascade" column option (#170 ) Specifically, `FOREIGN KEY REFERENCES <foreign_table> (<referred_columns>)` can now be followed by `ON DELETE <referential_action>` and/or by `ON UPDATE <referential_action>`.	2020-05-27 18:24:23 +03:00
Nickolay Ponomarev	320d2f2d05	Update CHANGELOG.md and a fix last-minute review nit	2020-05-27 05:04:22 +03:00
mashuai	5aacc5ebcd	add create index and drop index support	2020-05-27 09:27:57 +08:00
Nickolay Ponomarev	327e6cd9f1	Report an error for unterminated string literals ...updated the TODOs regarding single-quoted literals parsing while at it.	2020-05-10 21:21:01 +03:00
Alex Dukhno	5ad578e3e5	Implement CREATE TABLE IF NOT EXISTS (#163 ) A non-standard feature supported at least by Postgres https://www.postgresql.org/docs/12/sql-createtable.html	2020-04-21 16:28:02 +03:00
Matt Jibson	c0b0b5924d	Add support for OFFSET with the ROWS keyword MySQL doesn't support the ROWS part of OFFSET. Teach the parser to remember which variant it saw, including just ROW.	2020-04-19 20:06:08 -06:00
Nickolay Ponomarev	05a29212ff	Update comments (follow-up to PR #155 ) https://github.com/andygrove/sqlparser-rs/pull/155	2020-04-20 04:58:24 +03:00
Eyal Leshem	3255fd3ea8	Add support to to table_name inside parenthesis	2020-04-12 20:31:09 +03:00
Alex Kyllo	172ba42001	Add support for MSSQL's SELECT TOP N syntax (#150 ) Add support for MSSQL SELECT TOP (N) [PERCENT] [WITH TIES] syntax.	2020-01-12 23:20:48 -05:00
Robert Grimm	b1cbc55128	Turn type Ident into struct Ident The Ident type was previously an alias for a String. Turn it into a full fledged struct, so that the parser can preserve the distinction between identifier value and quote style already made by the tokenizer's Word structure.	2019-10-20 00:16:41 -04:00
gaffneyk	2bb38c9b27	Parse START TRANSACTION when followed by a semicolon Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com>	2019-09-13 22:54:21 -04:00
Nikhil Benesch	e9c5567b04	Merge pull request #135 from andygrove/show-columns Support MySQL `SHOW COLUMNS` statement	2019-09-02 07:40:57 -04:00
Nikhil Benesch	a0aca824e8	Optionally parse numbers into BigDecimals With `--features bigdecimal`, parse numbers into BigDecimals instead of leaving them as strings.	2019-09-01 13:21:49 -04:00
Nikhil Benesch	b5621c0fe8	Don't lose precision when parsing decimal fractions The SQL standard requires that numeric literals with a decimal point, like 1.23, are represented exactly, up to some precision. That means that parsing these literals into f64s is invalid, as it is impossible to represent many decimal numbers exactly in binary floating point (for example, 0.3). This commit parses all numeric literals into a new `Value` variant `Number(String)`, removing the old `Long(u64)` and `Double(f64)` variants. This is slightly less convenient for downstream consumers, but far more flexible, as numbers that do not fit into a u64 and f64 are now representable.	2019-09-01 13:21:30 -04:00
Nikhil Benesch	e1ded184f8	Support `SHOW <var>` and `SET <var>`	2019-09-01 13:20:37 -04:00
Brandon W Maister	f64928e994	Support MySQL `SHOW COLUMNS` statement Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com>	2019-08-14 15:13:05 -04:00
Brennan Vincent	41d4ea480f	Add and use `expect_keywords` function The code for parsing chains of expected keywords is more readable with this helper function. Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com>	2019-08-13 15:42:14 -04:00
Nickolay Ponomarev	086ba1281c	Amend WindowFrame docs The note about WindowFrameBound::Following being only valid "in WindowFrame::end_bound" was both - confusing, as it was based on the ANSI SQL syntax the parser doesn't adhere to -- though it sounded like a promise about the AST one could expect to get from the parser - and incomplete, as the reality is that the bounds validation the SQL engine might want to perform is more complex. For example Postgres documentation says <https://www.postgresql.org/docs/11/sql-expressions.html#SYNTAX-WINDOW-FUNCTIONS>: > Restrictions are that frame_start cannot be UNBOUNDED FOLLOWING, > frame_end cannot be UNBOUNDED PRECEDING, and the frame_end choice > cannot appear earlier in the above list of frame_start and frame_end > options than the frame_start choice does — for example RANGE BETWEEN > CURRENT ROW AND offset PRECEDING is not allowed. But, for example, > ROWS BETWEEN 7 PRECEDING AND 8 PRECEDING is allowed, even though it > would never select any rows.	2019-07-09 03:27:20 +03:00
Nickolay Ponomarev	f31636d339	Simplify parse_window_frame It used to consume the `RParen` closing the encompassing `OVER (`, even when no window frame was parsed, which confused me a bit, even though I wrote it initially. After fixing that, I took the opportunity to reduce nesting and duplication a bit.	2019-07-09 03:27:20 +03:00
Nickolay Ponomarev	9314371d3b	Remove parse_expr_list, as it's now trivial	2019-07-09 03:27:20 +03:00
Nickolay Ponomarev	03efcf6fa6	Add parse_comma_separated to simplify the parser To use the new helper effectively, a few related changes were required: - Each of the parse_..._list functions (`parse_cte_list`, `parse_order_by_expr_list`, `parse_select_list`) was replaced with a version that parses a single element of the list (e.g. `parse_cte`), with their callers now using `self.parse_comma_separated(Parser::parse_<one_element>)?` - `parse_with_options` now parses the WITH keyword and a separate `parse_sql_option` function (named after the struct it produces) was added to parse a single k=v option. - `parse_list_of_ids` is gone, with the '.'-separated parsing moved to `parse_object_name`. Custom comma-separated parsing is still used in: - parse_transaction_modes (where the comma separator is optional) - parse_columns (allows optional trailing comma, before the closing ')')	2019-07-09 03:27:20 +03:00
Nikhil Benesch	ed76cd68f8	Merge pull request #124 from vemoo/impl-display implement fmt::Display instead of ToString	2019-07-01 16:24:53 -04:00
Nickolay Ponomarev	7d4b488336	Update comments after the renaming done in PR #105	2019-07-01 04:45:08 +03:00
Bernardo	b2b159fed1	implement fmt::Display instead of ToString	2019-06-30 17:32:51 +02:00
Nikhil Benesch	106c9f8efb	Remove "SQL" prefix from "SQLDateTimeField" struct I realized a moment too late that I'd missed a type name in when removing the "SQL" prefix from types in `ac555d7e8`. As far as I can tell, this was the only oversight.	2019-06-25 13:24:31 -04:00
Nikhil Benesch	ac555d7e86	Remove "SQL" prefix from types The rationale here is the same as the last commit: since this crate exclusively parses SQL, there's no need to restate that in every type name. (The prefix seems to be an artifact of this crate's history as a submodule of Datafusion, where it was useful to explicitly call out which types were related to SQL parsing.) This commit has the additional benefit of making all type names consistent; over type we'd added some types which were not prefixed with "SQL".	2019-06-25 13:11:11 -04:00
Nikhil Benesch	cf655ad1a6	Remove "sql" prefix from module names Since this crate only deals with SQL parsing, the modules are understood to refer to SQL and don't need to restate that explicitly.	2019-06-24 12:56:26 -04:00
Andy Grove	0c23392adb	replace with code from datafusion	2018-09-03 09:56:39 -06:00
Andy Grove	a86bd30515	Refactoring	2018-09-03 09:13:43 -06:00
Andy Grove	375671e208	Refactoring	2018-09-03 08:04:20 -06:00
Andy Grove	a1696ccdb8	Refactoring	2018-09-03 07:59:05 -06:00
Andy Grove	fa2ef528b7	Refactoring	2018-09-03 07:45:48 -06:00
Andy Grove	037ebb0f73	Refactoring	2018-09-02 19:15:07 -06:00

1 2 3 4 5

210 commits