dan/oxc - BGit

dan/oxc

mirror of https://github.com/danbulant/oxc synced 2026-05-23 06:08:47 +00:00

Author	SHA1	Message	Date
leaysgur	f8e1907c4f	feat(regular_expression): Intro `ConstructorParser`(and `LiteralParser`) to handle escape sequence in RegExp('pat') (#6635 ) Preparation for #6141 `oxc_regular_expression` can already parse and validate both `/regexp-literal/` and `new RegExp("string-literal")`. But one thing that is not well-supported was reporting `Span` for the `RegExp("string-literal-with-\\escape")` case. For example, these two cases produce the same `RegExp` instances in JavaScript: - `/\d+/` - `new RegExp("\\d+")` For now, mainly in `oxc_linter`, the latter case is parsed with `oxc_parser` -> `ast::literal::StringLiteral` AST node -> `value` property. At this point, escape sequences are resolved(!), `oxc_regular_expression` can handle aligned `&str` as an argument without any problem in both cases. However, in terms of `Span` representation, these cases should be handled differently because of the `\\` in string literals... As a result, the parsed AST's `Span` for `new RegExp("string-literal")` is not accurate if it contains escape sequences. e.g. `a01a5dfdaf/crates/oxc_linter/src/snapshots/no_invalid_regexp.snap (L118-L122)` Each time the `\` appears, the subsequent position is shifted. `_` should be placed under `*` in this case. So... to resolve this issue, we need to implement `string_literal_parser` first, and use them as reading units of `oxc_regular_expression`.	2024-10-21 07:07:27 +00:00
leaysgur	5a73a663dc	refactor(regular_expression)!: Simplify public APIs (#6262 ) This PR makes 2 changes to improve the existing API that are not very useful. - Remove `(Literal)Parser` and `FlagsParser` and their ASTs - Add `with_flags(flags_text)` helper to `ParserOptions` Here are the details. > Remove `(Literal)Parser` and `FlagsParser` and their ASTs Previously, the `oxc_regular_expression` crate exposed 3 parsers. - `(Literal)Parser`: assumes `/pattern/flags` format - `PatternParser`: assumes `pattern` part only - `FlagsParser`: assumes `flags` part only However, it turns out that in actual usecases, only the `PatternParser` is actually sufficient, as the pattern and flags are validated and sliced in advance on the `oxc_parser` side. The current usecase for `(Literal)Parser` is mostly for internal testing. There were also some misuses of `(Literal)Parser` that restore `format!("/{pattern}/{flags}")` back and use `(Literal)Parser`. Therefore, only `PatternParser` is now published, and unnecessary ASTs have been removed. (This also obsoletes #5592 .) > Added `with_flags(flags_text)` helper to `ParserOptions` Strictly speaking, there was a subtle difference between the "flag" strings that users were aware of and the "mode" recognised by the parser. Therefore, it was a common mistake to forget to enable `unicode_mode` when using the `v` flag. With this helper, crate users no longer need to distinguish between flags and modes.	2024-10-03 02:47:08 +00:00
Boshen	2da9a4d298	chore(regular_expression): rename visitor example to regex_visitor closes #6116 To avoid name collision with parser/visitor.rs.	2024-09-28 00:33:11 +08:00
camchenry	8d026e1dd9	feat(regular_expression): implement `GetSpan` for RegExp AST nodes (#6056 ) To make it easier to get the `Span` for some node in the Regex AST, I've implemented the `GetSpan` trait for all necessary structs.	2024-09-26 05:51:35 +00:00
camchenry	77647931e4	feat(regular_expression): implement visitor pattern trait for regex AST (#6055 ) - resolves https://github.com/oxc-project/oxc/issues/5977 - supersedes https://github.com/oxc-project/oxc/pull/5951 To facilitate easier traversal of the Regex AST, this PR defines a `Visit` trait with default implementations that will walk the entirety of the Regex AST. Methods in the `Visit` trait can be overridden with custom implementations to do things like analyzing only certain nodes in a regular expression, which will be useful for regex-related `oxc_linter` rules. In the future, we should consider automatically generating this code as it is very repetitive, but for now a handwritten visitor is sufficient.	2024-09-26 05:04:46 +00:00
Boshen	dd3ad4d68e	chore(regular_expression): remove circular dependency Error: Circular dependency detected: oxc_parser -> oxc_regular_expression	2024-08-23 16:16:10 +08:00
leaysgur	c7b81f5762	chore(regular_expression): Update example to support RegExp constructor (#5106 ) - Fix example to handle `new RegExp()` too - Update NOTE comments - - - Until I tried interacting with the actual AST parsed by `oxc_parser`, I thought that the current `oxc_regular_expression` lacked support for the `RegExp` constructor due to escape sequences. This was because `"\""` remained `"\""` after reading the source text from `.js` files. However, once it was parsed by `oxc_parser`, I found that everything was [resolved](`8ef85a43c0/crates/oxc_parser/src/lexer/string.rs`)! (Wonderful work as usual. 👏🏻 ) Now there is nothing to worry about. 😌	2024-08-23 04:57:32 +00:00
leaysgur	96f57984eb	refactor(regular_expression): Misc refactoring for body_parser (#5062 ) - Add examples to list all `RegExp`s in source code - Refactor `MayContainStrings` related part	2024-08-22 11:21:41 +00:00
Boshen	081e2a37d9	refactor(regular_expression): s/`RegExpLiteral`/`RegularExpression`	2024-08-20 14:26:32 +08:00
Boshen	8d3f61bb54	chore(oxc_regular_expression): rename crate	2024-08-20 10:59:00 +08:00

10 commits