dan/oxc - BGit

dan/oxc

mirror of https://github.com/danbulant/oxc synced 2026-05-25 12:51:57 +00:00

Author	SHA1	Message	Date
rzvxa	60f89d1fcf	improvement(semantic/cfg): better throw control flow. (#3473 ) https://github.com/rzvxa/oxlint-ecosystem-ci/actions/runs/9306698136	2024-06-06 07:55:50 +00:00
rzvxa	61bae74f76	improvement(semantic/cfg): better CFG API (#3472 )	2024-06-06 07:55:37 +00:00
rzvxa	3c7ee85ce4	improvement(semantic/cfg): better `break` and `continue` flow. (#3462 ) This PR adds a new edge type called `Jump` to distinguish between normal edges and jumps. There is also a control flow context which is used to keep track of cfg scopes and labels. It replaces the old `preserve_state` and `restore_state`. It corrects some mistakes - such as labeled blocks especially labeled continue which wasn't easy to implement with the old approach - in the old control flow but other than that it is mostly refactored to have a more declarative API instead of a procedural approach.	2024-06-06 07:55:31 +00:00
rzvxa	ff3f37dbbd	improvement(semantic/cfg): better control flow for `ForStatement`s. (#3453 ) similar to #3451 and #3452	2024-06-06 05:41:08 +00:00
rzvxa	91c999546c	improvement(semantic/cfg): better control flow for `DoWhileStatement`s. (#3452 ) similar to #3451	2024-06-06 05:41:06 +00:00
rzvxa	0012094188	improvement(semantic/cfg): better control flow for `WhileStatement`s. (#3451 ) Fixes a simple cfg issue, previously it wouldn't follow the body subgraph correctly.	2024-06-06 05:41:04 +00:00
rzvxa	209a99d49f	improvement(semantic/cfg): rework basic blocks. (#3381 ) I've replaced the `BasicBlockElement` with an `Instruction` type which would keep both the instruction kind and its associated AstNodeId. I also removed the register scheme in the control flow in favor of a simpler approach using explicit enums. https://github.com/oxc-project/oxc/pull/3381#issuecomment-2126622774	2024-06-06 05:41:01 +00:00
Dunqing	6978269be0	refactor(transformer/typescript): replace reference collector with symbols references (#3533 ) https://github.com/oxc-project/oxc/pull/3524 handled the references correctly, now we can remove the reference collector.	2024-06-05 09:02:51 +00:00
overlookmotel	c00598b9d4	fix(transformer): JSX set `reference_id` on refs to imports (#3524 ) Set `reference_id` for references to new imported bindings. e.g. `_jsx` in `_jsx(Foo, {})` where JSX transform has inserted `import {jsx as _jsx} from "react/jsx-runtime";`.	2024-06-05 10:57:05 +08:00
overlookmotel	7bbd3da0f2	refactor(traverse): `generate_uid` return `SymbolId` (#3520 ) Preparatory step for setting correct `SymbolId` for identifiers added to AST in transformer.	2024-06-04 03:02:37 +00:00
overlookmotel	6f71541db0	fix(semantic): set program scope_id for TS definition files (#3496 ) Semantic does not visit the AST for TS definition files, but it does create a root scope. Record this scope ID in `Program`'s `scope_id` field.	2024-06-01 10:13:45 +00:00
overlookmotel	55bbde2888	refactor(ast): move scope from `TSModuleBlock` to `TSModuleDeclaration` (#3488 ) Closes #3471. Remove scope from `TSModuleBlock` and add scope to `TSModuleDeclaration` instead.	2024-05-31 17:33:05 +00:00
Dunqing	9c58231c62	refactor(semantic): use a simpler way to resolve reference for ReferenceFlag::Type (#3430 )	2024-05-27 22:43:52 +08:00
overlookmotel	d4371e8f95	fix(transformer): use UIDs in TS namespace transforms (#3395 ) Use the `TraverseCtx::generate_uid` method introduced in #3395 to fix some of the TS namespace test cases. But... I honestly have no idea what I'm doing here! I am running up against a combination of 3 different areas where I know very little: 1. I am unfamiliar with `Semantic`. 2. I am unfamiliar with Oxc's conventions for writing transforms. 3. I don't "speak" Typescript! This PR should not be merged as is. I'm pretty sure it's wrong. I've done it mostly just to test out `generate_uid`. In particular: 1. Is `SymbolFlags::FunctionScopedVariable` the right flags to use in all cases here? 2. `generate_uid` creates a symbol, and registers a binding for it in the scope tree. So if there are any branches in this logic where `name` doesn't actually get used after `generate_uid` is called, then it's not right. 3. Identifiers which are added to AST should also have corresponding `ReferenceId`s created for them (but I imagine this is also missing in many other places in the transformer). On point 2: We could split up `generate_uid` into 2 functions. One function would find a valid UID name but not register a binding for it. If you want to actually use that name, you'd call a 2nd function to generate the binding. Would that be a better API? Could someone help me out to progress this please? (Sorry my lack of knowledge is a bit useless here. I will learn all these things in time, but just trying to be honest about where I'm at right now. I'm sure I could figure it out myself, but it would take me hours, whereas others will probably look at it and know what to do in about 5 mins.)	2024-05-27 08:53:13 +08:00
mysteryven	5e06298ec2	fix(linter): memorize visited block id in `neighbors_filtered_by_edge_weight` (#3407 ) closes: #3396 we visit the same node too many times, I picked some result from run require-render-return rule on [timeserieseexploere.js](https://github.com/elastic/kibana/blob/main/x-pack/plugins/ml/public/application/timeseriesexplorer/timeseriesexplorer.js) ```bash NodeIndex(64): 368640, NodeIndex(67): 737280, NodeIndex(70): 2949120, NodeIndex(73): 5971968, NodeIndex(76): 11943936, NodeIndex(43): 184320, NodeIndex(71): 2985984, NodeIndex(65): 368640, NodeIndex(68): 1474560, NodeIndex(74): 5971968, NodeIndex(77): 23887872, NodeIndex(44): 184320, NodeIndex(41): 73728, NodeIndex(35): 36864, NodeIndex(66): 737280, NodeIndex(69): 1474560, NodeIndex(72): 2985984, NodeIndex(75): 11943936, ```	2024-05-26 08:11:48 +00:00
overlookmotel	bcdc658bbb	feat(transformer): add `TraverseCtx::generate_uid` (#3394 ) Add `TraverseCtx::generate_uid` method. This is modelled on Babel's `scope.generateUid()` method. As discussed in https://github.com/oxc-project/oxc/discussions/3251#discussioncomment-9416826, this is required to fix most of the remaining failing tests in transformer Milestone 1. I have implemented this to work as closely as possible to Babel, so that it will generate same output as Babel for our tests. However, as mentioned in the code comments, this means it's a pretty expensive function to call. Those code comments suggest 2 ways in which we could make it much more efficient, but we'd need to decide how we're going to handle divergence from Babel before we can decide which route to go. I've left it as a `TODO(improve-on-babel)` for now.	2024-05-24 17:19:02 +08:00
Yuji Sugiura	3a5f088ca3	feat(linter/jsdoc): Implement require-returns rule (#3218 ) Part of #1170 > https://github.com/gajus/eslint-plugin-jsdoc/blob/main/docs/rules/require-returns.md	2024-05-23 21:12:10 +08:00
rzvxa	78e6326e48	refactor(semantic/cfg): alias petgraph's `NodeIndex` as `BasicBlockId`. (#3380 ) Hides petgraph's general `NodeIndex` type behind `BasicBlockId`.	2024-05-22 03:09:38 +00:00
Ali Rezvani	bb25c54b94	improvement(semantic/cfg): add the `ControlFlowGraphBuilder`. (#3372 ) It is a simple change, Before this, we had a lot of fields in our control flow graph that were only used during the build process. This PR aims to simplify the ControlFlowGraph. PS: Sorry for the long branch name, Apprerantly graphite gets confused when you write the commit body using the `gt create` command.	2024-05-21 19:36:42 +08:00
Dunqing	8a30a98465	fix(semantic) incorrect reference flag caused by MemberExpression (#3354 ) fix: https://github.com/oxc-project/oxc/issues/3326#issuecomment-2118858837	2024-05-19 14:55:18 +00:00
overlookmotel	6f3b1c8724	refactor(semantic): semantic populate `scope_id` fields in AST (#3303 ) `oxc_semantic` populate `scope_id` fields in AST nodes as it walks the tree. This does produce some duplication - scope IDs are stored both in the AST itself, and in `AstNode`. Will clean this up later on.	2024-05-16 16:21:19 +00:00
Jelle van der Waa	44b16ef79d	feat(linter/eslint): Implement max-classes-per-file (#3241 ) Rule Detail: [link](https://eslint.org/docs/latest/rules/max-classes-per-file)	2024-05-13 17:23:01 +08:00
rzvxa	1f135ce4ce	feat(linter/react): add the `rules_of_hooks` rule. (#3071 ) [RulesOfHooks](https://github.com/facebook/react/blob/main/packages/eslint-plugin-react-hooks/src/RulesOfHooks.js), [docs](https://react.dev/reference/rules/rules-of-hooks), [npm](https://www.npmjs.com/package/eslint-plugin-react-hooks) This one sounds like something straight out of `Mortal Kombat`!	2024-05-12 19:20:38 +00:00
Boshen	893af230c0	refactor: clean up more diagnostics usages	2024-05-12 21:46:03 +08:00
Boshen	d8173e1000	refactor: remove all usages of `Into<Error>`	2024-05-12 12:01:26 +08:00
Boshen	c5588c9b1c	refactor(semantic): clean up redeclaration diagnostic	2024-05-12 01:18:30 +08:00
Boshen	312f74bb63	refactor(diagnostics): s/OxcDiagnostic::new/OxcDiagnostic::error	2024-05-12 01:08:54 +08:00
Boshen	09f34fc942	refactor(semantic): unify diagnostic in checker	2024-05-12 01:07:28 +08:00
Ali Rezvani	20f643063e	improvement(semantic/cfg): add explicit break block element. (#3223 ) Works similar to how `throw` is working but is done for `break` statements.	2024-05-10 22:22:30 +08:00
Ali Rezvani	5e36e0d575	fix(semantic): add `cfg` nodes for `ConditionalExpression`s. (#3127 ) It is done similarly to how `IfStatement`s are structured at the moment.	2024-05-10 22:16:55 +08:00
Ali Rezvani	c91d26129c	fix(semantic): connect `test` expression of `for` statements to the cfg. (#3122 ) I don't know if it is correct or not, Fixes my issues with dangling cfg nodes created in for statements. #3071	2024-05-10 22:11:25 +08:00
renovate[bot]	56a0db8621	chore(deps): update dependency rust to v1.78.0 (#3151 )	2024-05-03 00:01:42 +08:00
Boshen	a8af5de8f5	refactor(syntax): move number related functions to number module (#3130 )	2024-04-29 18:54:35 +08:00
overlookmotel	7e1fe36c68	refactor(ast): squash nested enums (#3115 ) OK, this is a big one... I have done this as part of work on Traversable AST, but I believe it has wider benefits, so thought better to spin it off into its own PR. ## What this PR does This PR squashes all nested AST enum types (#2685). e.g.: Previously: ```rs pub enum Statement<'a> { BlockStatement(Box<'a, BlockStatement<'a>>), /* ...other Statement variants... / Declaration(Declaration<'a>), } pub enum Declaration<'a> { VariableDeclaration(Box<'a, VariableDeclaration<'a>>), / ...other Declaration variants... / } ``` After this PR: ```rs #[repr(C, u8)] pub enum Statement<'a> { BlockStatement(Box<'a, BlockStatement<'a>>) = 0, / ...other Statement variants... / VariableDeclaration(Box<'a, VariableDeclaration<'a>>) = 32, / ...other Declaration variants... / } #[repr(C, u8)] pub enum Declaration<'a> { VariableDeclaration(Box<'a, VariableDeclaration<'a>>) = 32, / ...other Declaration variants... / } ``` All `Declaration`'s variants are combined into `Statement`, but `Declaration` type still exists. As both types are `#[repr(C, u8)]`, and the discriminants are aligned, a `Declaration` can be transmuted to a `Statement` at zero cost. This is the same thing as #2847, but here applied to all* nested enums in the AST, and with improved helper methods. No enums increase in size, and a few get smaller. Indirection is reduced for some types (this removes multiple levels of boxing). ## Why? 1. It is a prerequisite for Traversable AST (#2987). 2. It would help a lot with AST Transfer (#2409) - it solves the only remaining blocker for this. 3. It is a step closer to making the whole AST `#[repr(C)]`. ## Why is it a good thing for the AST to be `#[repr(C)]`? Oxc's direction appears to be increasingly to build up control over the fundamental primitives we use, in order to unlock performance and features. We have our own allocator, our own custom implementations for `Box` and `Vec`, our own `IndexVec` (TBC). The AST is the central building block of Oxc, and taking control of its memory layout feels like a step in this same direction. Oxc has a major advantage over other similar libraries in that it keeps all the AST data in an arena. This opens the door to treating the AST either as Rust types or as pure data (just bytes). That data can be moved around and manipulated beyond what Rust natively allows. However, to enable that, the types need to be well-specified, with completely stable layouts. `#[repr(C)]` is the only tool Rust provides to do this. Once the types are `#[repr(C)]`, various features become possible: 1. Cheap transfer of the AST across boundaries without ser/deser - the property used by AST Transfer. 2. Having multiple versions of the AST (standard, read-only, traversable), and these AST representations can be converted to one other at zero cost via transmute - the property used by Traversable AST scheme. 3. Caching AST data on disk (#3079) or transferring across network. 4. Stuff we haven't thought of yet! Allowing the AST to be treated as pure data will likely unlock other "next level" features further down the track (caching for "edge bundling" comes to mind). ## The problem with `#[repr(C)]` It's not required to squash nested enums to make the AST `#[repr(C)]`. But the problem with `#[repr(C)]` is that it disables some compiler optimizations. Without `#[repr(C)]`, the compiler squashes enums itself in some cases (which is how `Statement` is currently 16 bytes). But making the types `#[repr(C)]` as they are currently disables this optimization. So this PR essentially makes explicit what the compiler is already doing - and in fact goes a bit further with the optimization than the compiler is able to, in squashing 3 or 4 layers of nested enums (the compiler only does up to 2 layers). ## Implementation One enum "inheriting" variants from another is implemented with `inherit_variants!` macro. ```rs inherit_variants! { #[repr(C, u8)] pub enum Statement<'a> { BlockStatement(Box<'a, BlockStatement<'a>>), /* ...other Statement variants... / // `Declaration` variants added here by `inherit_variants!` macro @inherit Declaration // `ModuleDeclaration` variants added here by `inherit_variants!` macro @inherit ModuleDeclaration } } ``` The macro is fairly* lightweight, and I think the above is quite easy to understand. No proc macros. The macro also implements utility methods for converting between enums e.g. `Statement::as_declaration`. These methods are all zero-cost (essentially transmutes). New patterns for dealing with nested enums are introduced: Creation: ```rs // Old let stmt = Statement::Declaration(Declaration::VariableDeclaration(var_decl)); // New let stmt = Statement::VariableDeclaration(var_decl); ``` Conversion: ```rs // Old let stmt = Statement::Declaration(decl); // New let stmt = Statement::from(decl); ``` Testing: ```rs // Old if matches!(stmt, Statement::Declaration(_)) { } if matches!(stmt, Statement::ModuleDeclaration(m) if m.is_import()) { } // New if stmt.is_declaration() { } if matches!(stmt, Statement::ImportDeclaration(_)) { } ``` Branching: ```rs // Old if let Statement::Declaration(decl) = &stmt { decl.do_stuff() }; // New if let Some(decl) = stmt.as_declaration() { decl.do_stuff() }; ``` Matching: ```rs // Old match stmt { Statement::Declaration(decl) => visitor.visit(decl), } // New (exhaustive match) match stmt { match_declaration!(Statement) => visitor.visit(stmt.to_declaration()), } // New (alternative) match stmt { _ if stmt.is_declaration() => visitor.visit(stmt.to_declaration()), } ``` New syntax has pluses and minuses vs the old. `match` syntax is worse, but when working with a deeply nested enum, the code is much nicer - it's shorter and easier to read. This PR removes 200 lines from the linter with changes like this: https://github.com/oxc-project/oxc/pull/3115/files#diff-dc417ff57352da6727a760ec6dee22de6816f8231fb69dbef1bf05d478699103L92-R95 ```diff - let AssignmentTarget::SimpleAssignmentTarget(simple_assignment_target) = - &assignment_expr.left - else { - return; - }; - let SimpleAssignmentTarget::AssignmentTargetIdentifier(ident) = - simple_assignment_target + let AssignmentTarget::AssignmentTargetIdentifier(ident) = &assignment_expr.left else { return; }; ```	2024-04-28 20:40:37 +08:00
Dunqing	c3d8a85eb5	feat(semantic): report that enum member must have initializer (#3113 ) See https://www.typescriptlang.org/play/?target=99#code/KYOwrgtgBAglDeAoKKoEMoF4oFk0BcALAOgGcBLEACgEYBKAGmVQCMmBfRRUSKAIQTMUGbACIaAJgDMopqihtEnRABtg+KADMwIAMZYoVOlgB8USVIDc3cNADCg+SK07dRuaw6IgA	2024-04-27 22:08:02 +08:00
Yuji Sugiura	5866086d17	feat(linter/jsdoc): Implement no-defaults rule (#3098 ) Part of #1170 > https://github.com/gajus/eslint-plugin-jsdoc/blob/main/docs/rules/no-defaults.md#repos-sticky-header	2024-04-26 00:03:21 +08:00
Dunqing	2dd96df847	feat(semantic): report namespace related errors (#3093 ) See https://www.typescriptlang.org/play/?target=99#code/GYVwdgxgLglg9mABAIzgEwJ4AoCUiDeAUIomAIYC2ApgM4AOZEViAIgQL6HtA	2024-04-25 14:01:58 +08:00
Ali Rezvani	dcb2528861	fix(semantic): revert test code pushed to the main by accident. (#3085 ) I'm terribly sorry, I've reverted the wrong commit in #3084.	2024-04-25 08:43:49 +08:00
Ali Rezvani	8d17ab36c3	fix(semantic): allow `root_node` to be empty for empty trees. (#3084 ) related to #3082, #3030 and #3069	2024-04-25 01:10:59 +08:00
Boshen	84c43c8282	fix(semantic): correctly resolve identifiers inside catch parameter initializers (#3050 ) closes #2499	2024-04-21 23:53:38 +08:00
Boshen	92d709bf21	feat(ast): add `CatchParameter` node (#3049 )	2024-04-21 23:43:39 +08:00
Boshen	1f7033e7ab	fix(semantic): correctly resolve identifiers inside parameter initializers (#3046 ) close: #2644 This fixes function parameters. I think we need an extra AST node to fix catch parameters, which will be the next PR.	2024-04-21 23:38:31 +08:00
Ali Rezvani	57ad6c4aa7	feat(semantic): add root node to the `AstNodes` structure. (#3032 ) Adds a way to fetch the root node without iterating over all ancestors which has a nondeterministic time - best case O(1) worst case O(n!) - It is only possible to set this field in the semantic builder.	2024-04-20 13:36:20 +08:00
Yuji Sugiura	2c325ef3d6	fix(semantic/jsdoc): Skip parsing `@` inside of backticks (#3017 ) This PR aims to support these cases. ````js /** * This is normal comment, `@xxx` should not parsed as tag. * * @example ```ts // @comment @decoratorInComment class Foo { } ``` */ ```` Only `@example` should be parsed as tag.	2024-04-18 19:18:46 +08:00
Yuji Sugiura	40af2b1662	feat(semantic/jsdoc): Handle optional type syntax for type name part (#2960 ) It seems `JSDocTypeNamePart` can contain whitespace like... ```js /** @property [cfg.n12="default value"] Config... */ ```	2024-04-15 10:18:39 +08:00
branchseer	f159f60084	Make ast types covariant over the allocator lifetime. (#2943 ) ## Why Due to the usage of `&'alloc mut T` in `oxc_allocator::Box`, and `bumpalo::collections::Vec` in `oxc_allocator::Vec`, ast types are currently invariant over their allocator lifetime `'a`. This prevents `ouroboros` from generating `borrow_` on ast type fields, leading to the unfriendly `with_` api: `c250b288ef/crates/oxc_parser/examples/multi-thread.rs (L82-L84)` ## How - For `oxc_allocator::Vec`, switch to `allocator_api2::vec::Vec`, which has a covariant relationship with the allocator lifetime. - For `oxc_allocator::Box`, use `std::ptr::NonNull` which is specifically designed to be covariant. I don't use `allocator_api2::boxed::Box` because it holds the allocator for dropping, so the size is bigger. ## Downside Now that `oxc_allocator::Box` uses the unsafe `NonNull`. It has to be a private field to be safe. This make it impossible to do `Box(....)` pattern matching.	2024-04-12 18:12:18 +08:00
Yuji Sugiura	0a77d621e2	refactor(semantic/jsdoc): Rework JSDoc struct for better Span handling (#2917 ) > The error message emphasizes "empty text" so I would put the span on the extra text. > https://github.com/oxc-project/oxc/pull/2893#discussion_r1548843621 To address this, special `Span` handling should be implemented for comment part. So, this PR introduces: - `JSDocCommentPart` struct holds raw `.span` and special `.span_trimmed_first_line()` - Add `JSDocKindPart`, `JSDocTypePart` and `JSDocTypeNamePart` in the same manner - `JSDocTag` uses these depending on the purpose	2024-04-10 14:32:51 +08:00
Don Isaac	1ea24ea0a7	fix(semantic): symbols inside functions and classes incorrectly flagged as exported (#2896 ) # What This PR Does Symbols declared inside exported functions and classes were being incorrectly flagged with `SymbolFlags::Export`. For example, ```ts export function foo<T>(a: T) { let b = String(a) return b } ``` `T`, `a`, and `b` were all flagged as exported. ## Further Work It doesn't seem like exported enums and interfaces are being included in `ModuleRecord`. Am I looking in the wrong place, or are they actually missing?	2024-04-05 23:01:27 +08:00
Yuji Sugiura	aa63b6491f	feat(linter): Implement jsdoc/check-access (#2642 ) Part of #1170, Finally... 🗻 Some preparation PRs may be needed in advance. - [x] settings.jsdoc #2706 - [x] new struct design #2765 - [x] handle `Span` for diagnostics #2815 Implement plugin itself. - https://github.com/gajus/eslint-plugin-jsdoc/blob/main/docs/rules/check-access.md - https://github.com/gajus/eslint-plugin-jsdoc/blob/main/src/rules/checkAccess.js I'll send a PR to make this plugin public after confirming that a few more rules can be implemented without any problems.	2024-04-04 22:36:39 +08:00
Don Isaac	d3eb1c3318	fix(semantic): flag function expressions with `SymbolFlags::Function` (#2891 ) # What This PR Does Consider the following code snippet ```js const x = function y () {{ ``` `y` will now be flagged with `SymbolFlags::Function`. This follow's tsc's behavior. - [Example in OXC Playground](https://oxc-project.github.io/oxc/playground/?code=3YCAAICagICAgICAgICxG0qZRraXZOpcCHVsSRwDq2kRR0HprsTfRRT5WMw%2Ff2epoIA%3D) - [Example in TypeScript AST Viewer](https://ts-ast-viewer.com/#code/MYewdgzgLgBAHjAvDAZgVzMKBLcMCeAFAJQwDeAvgFBA)	2024-04-03 10:42:11 +08:00

1 2 3 4 5 ...

269 commits