dan/oxc - BGit

dan/oxc

mirror of https://github.com/danbulant/oxc synced 2026-05-25 04:42:10 +00:00

Author	SHA1	Message	Date
overlookmotel	0bdecb5043	refactor(parser): wrapper type for parser (#2339 ) Split parser into public interface `Parser` and internal implementation `ParserImpl`. This involves no changes to public API. This change is a bit annoying, but justification is that it's required for #2341, which I believe to be very worthwhile. The `ParserOptions` type also makes it a bit clearer what the defaults for `allow_return_outside_function` and `preserve_parens` are. It came as a surprise to me that `preserve_parens` defaults to `true`, and this refactor makes that a bit more obvious when reading the code. All the real changes are in [oxc_parser/src/lib.rs](https://github.com/oxc-project/oxc/pull/2339/files#diff-8e59dfd35fc50b6ac9a9ccd991e25c8b5d30826e006d565a2e01f3d15dc5f7cb). The rest of the diff is basically replacing `Parser` with `ParserImpl` everywhere else.	2024-02-07 23:22:08 +08:00
Dunqing	55011e2793	feat(codegen): avoid printing comma in ArrayAssignmentTarget if the elements is empty (#2331 )	2024-02-06 22:45:19 +08:00
Boshen	721f6cb74e	fix(codegen): format new expression + call expression with the correct parentheses (#2330 ) closes #2328	2024-02-06 22:06:12 +08:00
Dunqing	40e9541cec	feat(semantic): add export binding for ExportDefaultDeclarations in module record (#2329 )	2024-02-06 22:01:16 +08:00
luhc228	8771c6410f	feat: add typescript-eslint rule array-type (#2292 ) Ref: https://github.com/oxc-project/oxc/issues/2180	2024-02-06 11:35:29 +08:00
Boshen	6fe9300880	chore(linter); add regression case for require-yield (#2326 ) closes #2323 closes #2324	2024-02-05 22:57:28 +08:00
Boshen	1db780960c	Revert "refactor(semantic): get function by scope_id in set_function_node_flag (#2208 )" This reverts commit `c62495d23f`.	2024-02-05 22:49:10 +08:00
overlookmotel	cdef41d552	refactor(parser): lexer replace `Chars` with `Source` (#2288 ) This PR replaces the `Chars` iterator in the lexer with a new structure `Source`. ## What it does `Source` holds the source text, and allows: * Iterating through source text char-by-char (same as `Chars` did). * Iterating byte-by-byte. * Getting a `SourcePosition` for current position, which can be used later to rewind to that position, without having to clone the entire `Source` struct. `Source` has the same invariants as `Chars` - cursor must always be positioned on a UTF-8 character boundary (i.e. not in the middle of a multi-byte Unicode character). However, unsafe APIs are provided to allow a caller to temporarily break that invariant, as long as they satisfy it again before they pass control back to safe code. This will be useful for processing batches of bytes. ## Why I envisage most of the Lexer migrating to byte-by-byte iteration, and I believe it'll make a significant impact on performance. It will allow efficiently processing batches of bytes (e.g. consuming identifiers or whitespace) without the overhead of calculating code points for every character. It should also make all the many `peek()`, `next_char()` and `next_eq()` calls faster. `Source` is also more performant than `Chars` in itself. This wasn't my intent, but seems to be a pleasant side-effect of it being less opaque to the compiler than `Chars`, so it can apply more optimizations. In addition, because checkpoints don't need to store the entire `Source` struct, but only a `SourcePosition` (8 bytes), was able to reduce the size of `LexerCheckpoint` and `ParserCheckpoint`, and make them both `Copy`. ## Notes on implementation `Source` is heavily based on Rust's `std::str::Chars` and `std::slice::Iter` iterators and I've copied the code/concepts from them as much as possible. As it's a low-level primitive, it uses raw pointers and contains a lot of unsafe code. I think I've crossed the T's and dotted the I's, and I've commented the code extensively, but I'd appreciate a close review if anyone has time. I've split it into 2 commits. * First commit is all the substantive changes. * 2nd commit just does away with `lexer.current` which is no longer needed, and replaces `lexer.current.token` with `lexer.token` everywhere. Hopefully looking just at the 1st commit will reduce the noise and make it easier to review. ### `SourcePosition` There is one annoyance with the API which I haven't been able solve: `SourcePosition` is a wrapper around a pointer, which can only be created from the current position of `Source`. Due to the invariant mentioned above, therefore `SourcePosition` is always in bounds of the source text, and points to a UTF-8 character boundary. So `Source` can be rewound to a `SourcePosition` cheaply, without any checks. I had originally envisaged `Source::set_position` being a safe function, as `SourcePosition` enforces the necessary invariants itself. The fly in the ointment is that a `SourcePosition` could theoretically have been created from another `Source`. If that was the case, it would be out of bounds, and it would be instant UB. Consequently, `Source::set_position` has to be an unsafe function. This feels rather ridiculous. Of course the parser won't create 2 Lexers at the same time. But still it's possible, so I think better to take the strict approach and make it unsafe until can find a way to statically prove the safety by some other means. Any ideas? ## Oddity in the benchmarks There's something really odd going on with the semantic benchmark for `pdf.mjs`. While I was developing this, small and seemingly irrelevant changes would flip that benchmark from +0.5% or so to -4%, and then another small change would flip it back. What I don't understand is that parsing happens outside of the measurement loop in the semantic benchmark, so the parser shouldn't have any effect either way on semantic's benchmarks. If CodSpeed's flame graph is to be believed, most of the negative effect appears to be a large Vec reallocation happening somewhere in semantic. I've ruled out a few things: The AST produced by the parser for `pdf.mjs` after this PR is identical to what it was before. And semantic's `nodes` and `scopes` Vecs are same length as they were before. Nothing seems to have changed! I really am at a loss to explain it. Have you seen anything like this before? One possibility is a fault in my unsafe code which is manifesting only with `pdf.mjs`, and it's triggering UB, which I guess could explain the weird effects. I'm running the parser on `pdf.mjs` in Miri now and will see if it finds anything (Miri doesn't find any problem running the tests). It's been running for over an hour now. Hopefully it'll be done by morning! I feel like this shouldn't merged until that question is resolved, so marking this as draft in the meantime.	2024-02-05 13:51:46 +00:00
Yuji Sugiura	b27079cf8e	chore(linter): Add more tests for ESLintConfig (#2284 ) Before trying #2258 , I'd like to prevent regression. 🦺 ### Overview - Rename `ESLintConfig::new(path)` -> `from_file(path)` - Split `from_file()` implementation into 2 parts - Parse path, strip json comment, check `.json` ext part - `from_value()`: Read +parse JSON contents part - ☝🏻used in tests - Add tests for parsing rules, settings, env ### TODOs found, for next PR - `rules` parser should handle `"no-debugger": 1` form - `settings.xxx_components` should go under `settings.react.` ### Notes - `rules`'s type - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L12 - `settings`'s type is `Object` 😅 - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L53 - and its usage is extended by each plugin - https://github.com/jsx-eslint/eslint-plugin-react?tab=readme-ov-file#configuration-legacy-eslintrc- - https://github.com/jsx-eslint/eslint-plugin-jsx-a11y/?tab=readme-ov-file#configurations - https://nextjs.org/docs/pages/building-your-application/configuring/eslint#eslint-plugin - `env`'s type is just a `Record<string, boolean>` - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L40	2024-02-05 20:42:03 +08:00
magic-akari	577d7ab72f	feat(prettier): Support TSImportEqualsDeclaration (#2321 )	2024-02-05 20:37:26 +08:00
magic-akari	c6273732f6	feat(prettier): Support TSExportAssignment (#2320 )	2024-02-05 20:33:03 +08:00
Dunqing	d571839ab8	feat(ast): enter AstKind::ExportDefaultDeclaration, AstKind::ExportNamedDeclaration and AstKind::ExportAllDeclaration (#2317 )	2024-02-05 17:43:30 +08:00
Dunqing	a3570d41f0	feat(semantic): report parameter related errors for setter/getter (#2316 )	2024-02-05 17:38:43 +08:00
Dunqing	9ca13d040d	feat(semantic): report type parameter list cannot be empty (#2315 )	2024-02-05 16:05:51 +08:00
Boshen	a762d17603	feat(linter): promote `no-this-before-super` to correctness (#2313 ) I've tested this in all real world test repos and found no false positives. Thank you so much @u9g @TzviPM for making this happen!	2024-02-05 16:01:09 +08:00
renovate[bot]	41d1876650	chore(deps): update rust crates (#2302 )	2024-02-05 14:36:53 +08:00
Dunqing	540b2a0396	fix(semantic): remove unnecessary SymbolFlags::Import (#2311 )	2024-02-05 14:16:29 +08:00
Dunqing	f53c54ced9	feat(semantic): report unexpected type annotation in ArrayPattern (#2309 )	2024-02-05 13:45:52 +08:00
Dunqing	f3035f1bbe	feat(semantic): apply ImportSpecifier's binder and remove ModuleDeclaration's binder (#2307 ) Added in #2230, But i forgot to call.	2024-02-05 13:16:05 +08:00
overlookmotel	9811c3a2c3	refactor(parser): name byte handler functions (#2301 ) This PR solves the problem of lexer byte handlers all being called `core::ops::function::FnOnce::call_once` in the flame graphs on CodSpeed, by defining them as named functions instead of closures. Pure refactor, no substantive changes.	2024-02-05 13:06:09 +08:00
Dunqing	cb17a83f4f	fix(semantic): remove ignore cases (#2300 )	2024-02-04 22:40:41 +08:00
Boshen	6002560fa1	feat(span): fix memory leak by implementing inlineable string for oxc_allocator (#2294 ) closes #1803 This string is currently unsafe, but I want to get miri working before introducing more changes. I want to make a progress from memory leak to unsafe then to safety. It's harder to do the steps in one go.	2024-02-04 19:28:23 +08:00
Boshen	1822cfe18d	refactor(ast): fix BigInt memory leak by removing it (#2293 ) relates We'll need to evaluate the value by other means.	2024-02-04 16:47:00 +08:00
Boshen	b5e43fbc5d	fix(linter): fix no_dupe_keys false postive on similar key names (#2291 ) closes #2287	2024-02-04 14:54:09 +08:00
Tzvi Melamed	0060d6a730	feat(linter): Implement no_this_before_super with cfg (#2254 ) Implements `eslint/no-this-before-super` in #479. Closes #2279	2024-02-04 13:51:04 +08:00
Boshen	d2b304b1f8	Publish crates v0.6.0	2024-02-03 22:35:30 +08:00
Wenzhe Wang	0c225a49aa	fix(codegen): print space before with clause in import (#2278 )	2024-02-02 14:52:32 +00:00
Dunqing	37a2676e1e	fix(linter): AllowFunction doesn't support generator (#2277 )	2024-02-02 21:53:44 +08:00
Boshen	28daf83b19	feat(semantic): report no class name error (#2273 ) closes #2144	2024-02-02 19:05:00 +08:00
Boshen	6849c047ef	chore(parser): add visitor example (#2271 ) closes #2256	2024-02-02 17:08:00 +08:00
Boshen	63dbac764f	chore(wasm): remove `console_error_panic_hook` We don't really need it.	2024-02-02 17:02:01 +08:00
Dunqing	8ac0202c9a	feat(codegen): keep shorthand in ObjectPattern and ObjectProperty (#2265 ) close: #2262 Do I need to add a test for this?	2024-02-02 08:32:51 +00:00
Boshen	650f6c942f	refactor: use our forked version of miette::Reporter for tests (#2266 )	2024-02-02 16:15:31 +08:00
Dunqing	da2ffdf7a0	feat(semantic): check parameters property (#2264 )	2024-02-02 15:58:32 +08:00
Dunqing	d71175e712	feat(semantic): check optional parameters (#2263 )	2024-02-02 15:54:04 +08:00
Boshen	8d99a15ac9	feat(semantic): report error on optional variable declaration in TypeScript (#2261 ) closes #2253 closes #2255	2024-02-02 14:13:10 +08:00
Dunqing	2578bb3d64	feat(ast): remove generator property from ArrowFunction (#2260 ) ArrowFunction doesn't support generator. https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/function*	2024-02-02 04:01:19 +00:00
hjio	a95a16c2ae	feat(linter): complete custom components setting (#2234 ) - add custom components setting - let tasks/rulegen generate settings --------- Co-authored-by: huangjunjie.coder <huangjunjie.coder@bytedance.com>	2024-02-01 23:28:11 +08:00
Dunqing	de6d2f5dc5	refactor(transformer/decorators): optimizing code with ast.private_field (#2249 )	2024-02-01 22:30:48 +08:00
Tzvi Melamed	f4674f33b2	fix(oxc_semantic): Handle short-circuiting operators in CFG (#2252 ) Closes #2239	2024-02-01 21:04:28 +08:00
Tzvi Melamed	27681951e1	feat(oxc_semantic): Improve sample visualization (#2251 ) 1. add a `test.js` file to the project root: ```js class A extends B { constructor() { try { super(); } finally { this.a; } } } ``` 2. run: ```bash $ cargo run -p oxc_semantic --example simple Compiling oxc_semantic v0.5.0 (/home/tzvipm/src/github.com/tzvipm/oxc/crates/oxc_semantic) Finished dev [unoptimized + debuginfo] target(s) in 32.07s Running `target/debug/examples/simple` Wrote AST to: test.ast.txt Wrote CFG blocks to: test.cfg.txt Wrote CFG dot diagram to: test.dot ``` 3. resulting graph from .dot file: ![image](https://github.com/TzviPM/oxc/assets/1950680/7163deaa-ab75-4bed-a093-946e2d6d2206)	2024-02-01 12:55:56 +00:00
Tzvi Melamed	73ccf8a4da	fix(oxc_semantic): proper traversal of try statements (#2250 ) Closes #2227	2024-02-01 20:46:38 +08:00
Dunqing	165f948227	feat(ast): remove expression property from Function (#2247 )	2024-02-01 15:23:27 +08:00
Dunqing	02c18d8506	feat(transformer/decorators): support for static and private member decorators (#2246 )	2024-02-01 15:19:14 +08:00
Boshen	2beacd3f4d	fix(lexer): correct the span for irregular whitespaces (#2245 ) closes #2236	2024-02-01 14:18:47 +08:00
Boshen	589fd0cdd1	chore: omit warning for unused `TS_APPEND_CONTENT`	2024-02-01 14:07:40 +08:00
Tzvi Melamed	e561457683	feat(semantic): track cfg index per ast node (#2210 ) This allows looking up a cfg index from an ast node in a semantics return. This allows later passes to better make use of the cfg.	2024-02-01 13:27:20 +08:00
Maurice Nicholson	6b2150f3b3	Better report source line and col for multiline annotations (#2242 ) Improves diagnostic report source line- and column-number for multiline annotations Requires #2241	2024-02-01 11:37:28 +08:00
Dunqing	ba85b097e0	feat(transformer/decorators): support method decorator and is not static (#2238 )	2024-02-01 11:36:22 +08:00
overlookmotel	d0d708295b	refactor(parser): consume chars when parsing surrogate pair escape (#2243 ) This fixes a mistake I made in #2237. I was confused by the `!(...)` wrapping of the preceding `if` test and missed that there are definitely 2 chars to consume, so can use `consume_char()` instead of `next_char()`. This makes no difference to behavior, but it follows the convention to always prefer `consume_char()` when possible. I've also refactored the code which confused me, so hopefully others won't be confused too!	2024-02-01 11:34:26 +08:00

1 2 3 4 5 ...

1933 commits