dan/oxc - BGit

dan/oxc

mirror of https://github.com/danbulant/oxc synced 2026-05-25 04:42:10 +00:00

Author	SHA1	Message	Date
DonIsaac	d5c4b190aa	fix(parser): fix enum member parsing (#4543 ) Closes #4449	2024-07-30 10:43:09 +00:00
Boshen	7446e986c3	feat(codegen): align more esbuild implementations (#4510 )	2024-07-28 13:35:37 +00:00
Boshen	35654e665c	feat(codegen): align operator precedence with esbuild (#4509 ) closes #4339	2024-07-28 11:48:51 +00:00
Dunqing	f68b659713	refactor(ast)!: reorder fields of `ArrowFunctionExpression` (#4364 ) Found in #4328 Same as #4248	2024-07-19 14:12:05 +00:00
Dunqing	ace4f1ff77	refactor(semantic): update the order of `visit_function` and `Visit` fields in the builder to be consistent (#4248 ) Same as #4195	2024-07-14 11:39:15 +00:00
Dunqing	20cdb1fe0a	feat(semantic): align class scope with typescript (#4195 ) ```ts class Klass <T> extends Root <R> {} // ^^^^^ ^^^ ^^^^ ^^^ ^^ // id type_paramter super_class super_type_parameters body ``` I reorder fields according to the order above The class scope is not defined in the spec. But we need to create a scope for `class` to store `TypeParamters`	2024-07-14 03:19:30 +00:00
rzvxa	48947a26d2	fix(ast): put `decorators` before everything else. (#4143 ) Won't fix #4142 It is similar to #3994 but for those types that weren't relying on this order. It seems to be the right order. technically speaking it is a breaking change but I know as a fact that it won't have a big difference on our downstream. If you want it to be chronically correct feel free to merge as a breaking change.	2024-07-10 02:03:05 +00:00
rzvxa	b936162093	refactor(ast/ast_builder)!: shorter allocator utility method names. (#4122 ) This PR serves two purposes, First off it would lower the amount of characters we have to type in for a simple operation such as wrapping an expression in a vector. Secondly, it would follow the generated names more closely since nowhere else in the builder we do have `new_xxx`, We always say `xxx` since a builder always constructs something. ``` new_vec -> vec new_vec_single -> vec1* new_vec_from_iter -> vec_from_iter new_vec_with_capacity -> vec_with_capacity new_str -> str new_atom -> atom ``` `*` This one is the main motivation behind this PR, It saves 10 characters!	2024-07-09 12:16:38 +00:00
rzvxa	d347aedfda	feat(ast)!: generate `ast_builder.rs`. (#3890 ) ### Every structure has 2 builder methods: 1. `xxx` e.g. `block_statement` ```rust #[inline] pub fn block_statement(self, span: Span, body: Vec<'a, Statement<'a>>) -> BlockStatement<'a> { BlockStatement { span, body, scope_id: Default::default() } } ``` 2. `alloc_xxx` e.g. `alloc_block_statement` ```rust #[inline] pub fn alloc_block_statement( self, span: Span, body: Vec<'a, Statement<'a>>, ) -> Box<'a, BlockStatement<'a>> { self.block_statement(span, body).into_in(self.allocator) } ``` ### We generate 3 types of methods for enums: 1. `yyy_xxx` e.g. `statement_block` ```rust #[inline] pub fn statement_block(self, span: Span, body: Vec<'a, Statement<'a>>) -> Statement<'a> { Statement::BlockStatement(self.alloc(self.block_statement(span, body))) } ``` 2. `yyy_from_xxx` e.g. `statement_from_block` ```rust #[inline] pub fn statement_from_block<T>(self, inner: T) -> Statement<'a> where T: IntoIn<'a, Box<'a, BlockStatement<'a>>>, { Statement::BlockStatement(inner.into_in(self.allocator)) } ``` 3. `yyy_xxx` where `xxx` is inherited e.g. `statement_declaration` ```rust #[inline] pub fn statement_declaration(self, inner: Declaration<'a>) -> Statement<'a> { Statement::from(inner) } ``` ------------ ### Generic parameters: We no longer accept `Box<'a, ADT>`, `Atom` or `&'a str`, Instead we use `IntoIn<'a, Box<'a, ADT>>`, `IntoIn<'a, Atom<'a>>` and `IntoIn<'a, &'a str>` respectively. It allows us to rewrite things like this: ```rust let ident = IdentifierReference::new(SPAN, Atom::from("require")); let number_literal_expr = self.ast.expression_numeric_literal( right_expr.span(), num, raw, self.ast.new_str(num.to_string().as_str()), NumberBase::Decimal, ); ``` As this: ```rust let ident = IdentifierReference::new(SPAN, "require"); let number_literal_expr = self.ast.expression_numeric_literal( right_expr.span(), num, raw, num.to_string(), NumberBase::Decimal, ); ```	2024-07-09 00:57:26 +00:00
Boshen	243c9f35b0	refactor(parser): use function instead of trait to parse list with rest element (#4028 ) closes #3887	2024-07-02 13:43:14 +00:00
Boshen	1dacb1fc5b	refactor(parser): use function instead of trait to parse delimited lists (#4014 ) relates #3887 The rest of the list parsing trait implementations involves ... parsing `rest`, which I'll refactor in another PR.	2024-07-02 14:47:56 +08:00
Boshen	d0eac46fc8	refactor(parser): use function instead of trait to parse normal lists (#4003 ) To reduce boilerplate and code noise. relates #3887	2024-07-01 15:57:36 +00:00
DonIsaac	63f36daae0	feat(parser): parse modifiers with `parse_modifiers` (take 2) (#3977 ) Same as #3948, with fixes for bugs found by @Boshen.	2024-06-30 03:46:34 +00:00
Boshen	14bc31ee74	Revert "feat(parser): parse modifiers with `parse_modifiers` (#3948 )" This reverts commit `7b38bde073`.	2024-06-29 14:16:26 +08:00
DonIsaac	7b38bde073	feat(parser): parse modifiers with `parse_modifiers` (#3948 ) Closes #3929	2024-06-29 05:29:47 +00:00
Boshen	a471e62e2d	refactor(parser): clean up `try_parse` (#3925 )	2024-06-26 11:18:02 +00:00
Boshen	3db2553dc2	refactor(parser): improve parsing of TypeScript type arguments (#3923 )	2024-06-26 07:16:18 +00:00
Boshen	4cf3c7645f	refactor(parser): improve parsing of TypeScript types (#3903 ) - [x] fix everything	2024-06-26 05:58:16 +00:00
rzvxa	1f85f1a5f7	refactor(ast)!: revert adding `span` field to the `BindingPattern` type. (#3899 ) Since this is a temporary solution in the time that we are waiting for the `#[span]` hint, And there are already other workarounds used in our `ast_codegen` I propose removing it right away - sorry in my opinion adding it in the first place was a mistake - in favor of adding an edge case in the codegen. It is better to do the refactoring in the codegen instead of the production code which people may depend on.	2024-06-25 09:43:48 +00:00
Boshen	187f0782c1	refactor(parser): improve parsing of `parse_function_or_constructor_type` (#3892 ) part of #3502	2024-06-25 03:43:02 +00:00
Dunqing	ef82c78a72	fix(parser): trailing comma is not allowed in ParenthesizedExpression (#3885 ) close: #3878 The implementation is copied from `SeparatedList`'s `print_list`. ```diff fn parse_list(&mut self, p: &mut ParserImpl<'a>) -> Result<()> { p.expect(self.open())?; let mut first = true; while !p.at(self.close()) && !p.at(Kind::Eof) { if first { first = false; } else { p.expect(self.separator())?; - if p.at(self.close()) { - break; - } } self.parse_element(p)?; } p.expect(self.close())?; Ok(()) } ```	2024-06-24 14:56:32 +00:00
rzvxa	363d3d57d7	refactor(ast): add span field to the `BindingPattern` type. (#3855 ) So we don't have to introduce a special case while generating `GetSpan` implementations for all of our Ast types.	2024-06-23 16:00:40 +00:00
Boshen	13754cbd3d	fix(parser): change diagnostic to "modifier cannot be used here" (#3853 )	2024-06-23 15:14:27 +00:00
Boshen	97d59fc2f3	refactor(parser): move code around for parsing `Modifiers` (#3849 )	2024-06-23 12:46:42 +00:00
Boshen	ae09a97a09	refactor(ast)!: remove `Modifiers` from ts nodes (#3846 )	2024-06-23 19:44:35 +08:00
Boshen	0673677317	refactor(ast)!: replace `Modifiers` with `declare` on `Function` (#3844 )	2024-06-23 10:34:54 +00:00
Boshen	ee6ec4ee57	refactor(ast)!: replace `Modifiers` with `declare` and `abstract` on `Class` (#3841 )	2024-06-23 10:34:53 +00:00
Boshen	9b38119ec9	refactor(ast)!: replace `Modifiers` with `declare` on `VariableDeclaration` (#3839 ) part of #2958	2024-06-23 10:34:52 +00:00
Boshen	cfcef241db	feat(ast)!: add `directives` field to `TSModuleBlock` (#3830 ) closes #3564	2024-06-22 18:14:08 +00:00
Boshen	445603444f	feat(ast)!: add `IdentifierReference` to `ExportSpecifier` (#3820 ) closes #3795 closes #3796	2024-06-22 11:43:41 +00:00
overlookmotel	4f7ff7e3ad	perf: do not pass `&Atom` to functions (#3818 ) `Atom` is just a wrapper around `&str`, so better not to pass `&Atom` to functions, as that's a double-reference. Prefer `Atom` or `&str` instead to avoid indirection.	2024-06-22 04:48:00 +00:00
Boshen	dd540c8f0f	feat(minifier): add skeleton for ReplaceGlobalDefines ast pass (#3803 )	2024-06-21 13:53:59 +00:00
Boshen	051ceb6539	chore: improve some format by running `cargo +nightly fmt`	2024-06-19 00:48:30 +08:00
Dunqing	350cd9158a	fix(parser): should parser error when function declaration has no name (#3461 ) https://oxc-project.github.io/oxc/playground/?code=3YCAAICNgICAgICAgICzncl%2FKeF7k4Y7upgY2l43c79%2FYxaAgA%3D%3D	2024-05-30 19:58:50 +08:00
Dunqing	cf41513d28	fix(parser): parse const extends in arrow functions correctly (#3450 ) close: #3443	2024-05-28 21:34:52 +08:00
Boshen	6078a6d292	fix(parser): fix lexer error while parsing parenthesized arrow expressions (#3400 ) closes #3399	2024-05-24 09:59:17 +00:00
Boshen	89a1f97320	refactor(parser): improve expression parsing (#3352 )	2024-05-19 21:28:16 +08:00
Boshen	e818fba21c	refactor(parser): improve `parse_simple_arrow_function_expression` (#3349 )	2024-05-19 06:05:39 +00:00
Boshen	1e802c71d5	refactor(parser): clean up `ParserState` (#3345 )	2024-05-19 01:30:16 +08:00
Boshen	0742081921	refactor(parser): improve is_parenthesized_arrow_function_expression (#3343 )	2024-05-18 16:39:51 +00:00
Boshen	6b3d019631	refactor(paresr): move some structs to js module (#3341 )	2024-05-18 14:41:32 +00:00
Boshen	9ced605487	refactor(parser): start porting arrow function parsing from tsc (#3340 ) relates #3320	2024-05-18 22:35:29 +08:00
Don Isaac	27030b9eb4	perf(lexer): use bitshifting when parsing known integers (#3296 ) ## What This PR Does - perf(lexer): use bit shifting when parsing hex, octal, and binary integers instead of `mul_add`-ing on `f64`s. Check out the difference in assembly generated [here](https://godbolt.org/z/zMEKaeYzh) - perf(lexer): skip redundant utf8 check when parsing BigInts - refactor(lexer): remove `unsafe` usage (as per @overlookmotel's request [here](https://github.com/oxc-project/oxc/pull/3283#issuecomment-2111814598)) - test(lexer): add numeric parsing unit tests I don't expect this PR to have a large performance improvement, since the most common case (`Kind::Decimal`) is not affected. We could do this, however, by splitting `Kind::Decimal` into `Kind::DecimalFloat` and `Kind::DecimalInt` when the lexer encounters a `.`	2024-05-17 09:39:10 +08:00
Don Isaac	508dae6f8f	perf(lexer): dedupe numeric separator check (#3283 ) ## What This PR Does Updates numeric literal token lexing to record when separator characters (`_`) are found in a new `Token` flag. This then gets passed to `parse_int` and `parse_float`, removing the need for a second `_` check in those two functions. When run locally, I see no change to lexer benchmarks and minor improvements to codegen benchmarks. For some reason, semantic and source map benches seem to be doing slightly worse. Note that I attempted to implement this with `bitflags!` (making `escaped` and `is_on_newline` flags as well) and this caused performance degradation. My best guess is that it turned reads on these flags from a `mov` to a `mov` + a binary and. --------- Co-authored-by: Boshen <boshenc@gmail.com>	2024-05-15 01:48:21 +00:00
Boshen	b27a905958	refactor(parser): simplify `Context` passing (#3266 )	2024-05-14 12:22:27 +08:00
Boshen	c4ccf9f4d8	fix(parser): parse `DecoratorCallExpression` when `Arguments` contains `MemberExpression` (#3265 ) closes #3261 closes #3262 ``` DecoratorCallExpression[Yield, Await] : DecoratorMemberExpression[?Yield, ?Await] Arguments[?Yield, ?Await] ```	2024-05-14 03:18:41 +00:00
Dunqing	eefb66f750	feat(ast): add type to AccessorProperty to support TSAbractAccessorProperty (#3256 )	2024-05-13 12:35:58 +00:00
Boshen	2064ae9e0a	refactor(parser,diagnostic): one diagnostic struct to eliminate monomorphization of generic types (#3214 ) part of #3213 We should only have one diagnostic struct instead 353 copies of them, so we don't end up choking LLVM with 50k lines of the same code due to monomorphization. If the proposed approach is good, then I'll start writing a codemod to turn all the existing structs to plain functions. --- Background: Using `--timings`, we see `oxc_linter` is slow on codegen (the purple part). ![image](https://github.com/zkat/miette/assets/1430279/c1df4f7d-90ef-4c0f-9956-2ec3194db7ca) The crate currently contains 353 miette errors. [cargo-llvm-lines](https://github.com/dtolnay/cargo-llvm-lines) displays ``` cargo llvm-lines -p oxc_linter --lib --release Lines Copies Function name ----- ------ ------------- 830350 33438 (TOTAL) 29252 (3.5%, 3.5%) 808 (2.4%, 2.4%) <alloc::boxed::Box<T,A> as core::ops::drop::Drop>::drop 23298 (2.8%, 6.3%) 353 (1.1%, 3.5%) miette::eyreish::error::object_downcast 19062 (2.3%, 8.6%) 706 (2.1%, 5.6%) core::error::Error::type_id 12610 (1.5%, 10.1%) 65 (0.2%, 5.8%) alloc::raw_vec::RawVec<T,A>::grow_amortized 12002 (1.4%, 11.6%) 706 (2.1%, 7.9%) miette::eyreish::ptr::Own<T>::boxed 9215 (1.1%, 12.7%) 115 (0.3%, 8.2%) core::iter::traits::iterator::Iterator::try_fold 9150 (1.1%, 13.8%) 1 (0.0%, 8.2%) oxc_linter::rules::RuleEnum::read_json 8825 (1.1%, 14.9%) 353 (1.1%, 9.3%) <miette::eyreish::error::ErrorImpl<E> as core::error::Error>::source 8822 (1.1%, 15.9%) 353 (1.1%, 10.3%) miette::eyreish::error::<impl miette::eyreish::Report>::construct 8119 (1.0%, 16.9%) 353 (1.1%, 11.4%) miette::eyreish::error::object_ref 8119 (1.0%, 17.9%) 353 (1.1%, 12.5%) miette::eyreish::error::object_ref_stderr 7413 (0.9%, 18.8%) 353 (1.1%, 13.5%) <miette::eyreish::error::ErrorImpl<E> as core::fmt::Display>::fmt 7413 (0.9%, 19.7%) 353 (1.1%, 14.6%) miette::eyreish::ptr::Own<T>::new 6669 (0.8%, 20.5%) 39 (0.1%, 14.7%) alloc::raw_vec::RawVec<T,A>::try_allocate_in 6173 (0.7%, 21.2%) 353 (1.1%, 15.7%) miette::eyreish::error::<impl miette::eyreish::Report>::from_std 6027 (0.7%, 21.9%) 70 (0.2%, 16.0%) <alloc::vec::Vec<T> as alloc::vec::spec_from_iter_nested::SpecFromIterNested<T,I>>::from_iter 6001 (0.7%, 22.7%) 353 (1.1%, 17.0%) miette::eyreish::error::object_drop 6001 (0.7%, 23.4%) 353 (1.1%, 18.1%) miette::eyreish::error::object_drop_front 5648 (0.7%, 24.1%) 353 (1.1%, 19.1%) <miette::eyreish::error::ErrorImpl<E> as core::fmt::Debug>::fmt ``` It's totalling more than 50k llvm lines, and is putting pressure on rustc codegen (the purple part on `oxc_linter` in the image above. --- It's pretty obvious by looking at https://github.com/zkat/miette/blob/main/src/eyreish/error.rs, the generics can expand out to lots of code.	2024-05-11 04:56:22 +00:00
Dunqing	0ba7778e5e	fix(parser): correctly parse cls.fn<C> = x (#3208 ) close: #3206	2024-05-09 10:23:45 +08:00
Boshen	a8af5de8f5	refactor(syntax): move number related functions to number module (#3130 )	2024-04-29 18:54:35 +08:00

1 2 3 4

193 commits