Commit graph

2667 commits

Author SHA1 Message Date
Boshen
43a9cb5522
chore: update snapshot 2024-02-06 22:17:59 +08:00
Boshen
721f6cb74e
fix(codegen): format new expression + call expression with the correct parentheses (#2330)
closes #2328
2024-02-06 22:06:12 +08:00
Dunqing
40e9541cec
feat(semantic): add export binding for ExportDefaultDeclarations in module record (#2329) 2024-02-06 22:01:16 +08:00
Boshen
3b7dfb9bd1
Release oxlint and vscode extension v0.2.8 2024-02-06 15:57:00 +08:00
Boshen
80010f5b9c
Release napi oxc-parser@v0.3.0 2024-02-06 15:45:55 +08:00
luhc228
8771c6410f
feat: add typescript-eslint rule array-type (#2292)
Ref: https://github.com/oxc-project/oxc/issues/2180
2024-02-06 11:35:29 +08:00
Boshen
839e7c5c22
feat(napi/parser): add more linux-musl targets 2024-02-06 00:04:25 +08:00
Boshen
6fe9300880
chore(linter); add regression case for require-yield (#2326)
closes #2323
closes #2324
2024-02-05 22:57:28 +08:00
Boshen
1db780960c
Revert "refactor(semantic): get function by scope_id in set_function_node_flag (#2208)"
This reverts commit c62495d23f.
2024-02-05 22:49:10 +08:00
overlookmotel
cdef41d552
refactor(parser): lexer replace Chars with Source (#2288)
This PR replaces the `Chars` iterator in the lexer with a new structure
`Source`.

## What it does

`Source` holds the source text, and allows:

* Iterating through source text char-by-char (same as `Chars` did).
* Iterating byte-by-byte.
* Getting a `SourcePosition` for current position, which can be used
later to rewind to that position, without having to clone the entire
`Source` struct.

`Source` has the same invariants as `Chars` - cursor must always be
positioned on a UTF-8 character boundary (i.e. not in the middle of a
multi-byte Unicode character).

However, unsafe APIs are provided to allow a caller to temporarily break
that invariant, as long as they satisfy it again before they pass
control back to safe code. This will be useful for processing batches of
bytes.

## Why

I envisage most of the Lexer migrating to byte-by-byte iteration, and I
believe it'll make a significant impact on performance.

It will allow efficiently processing batches of bytes (e.g. consuming
identifiers or whitespace) without the overhead of calculating code
points for every character. It should also make all the many `peek()`,
`next_char()` and `next_eq()` calls faster.

`Source` is also more performant than `Chars` in itself. This wasn't my
intent, but seems to be a pleasant side-effect of it being less opaque
to the compiler than `Chars`, so it can apply more optimizations.

In addition, because checkpoints don't need to store the entire `Source`
struct, but only a `SourcePosition` (8 bytes), was able to reduce the
size of `LexerCheckpoint` and `ParserCheckpoint`, and make them both
`Copy`.

## Notes on implementation

`Source` is heavily based on Rust's `std::str::Chars` and
`std::slice::Iter` iterators and I've copied the code/concepts from them
as much as possible.

As it's a low-level primitive, it uses raw pointers and contains a *lot*
of unsafe code. I *think* I've crossed the T's and dotted the I's, and
I've commented the code extensively, but I'd appreciate a close review
if anyone has time.

I've split it into 2 commits.

* First commit is all the substantive changes.
* 2nd commit just does away with `lexer.current` which is no longer
needed, and replaces `lexer.current.token` with `lexer.token`
everywhere.

Hopefully looking just at the 1st commit will reduce the noise and make
it easier to review.

### `SourcePosition`

There is one annoyance with the API which I haven't been able solve:

`SourcePosition` is a wrapper around a pointer, which can only be
created from the current position of `Source`. Due to the invariant
mentioned above, therefore `SourcePosition` is always in bounds of the
source text, and points to a UTF-8 character boundary. So `Source` can
be rewound to a `SourcePosition` cheaply, without any checks. I had
originally envisaged `Source::set_position` being a safe function, as
`SourcePosition` enforces the necessary invariants itself.

The fly in the ointment is that a `SourcePosition` could theoretically
have been created from *another* `Source`. If that was the case, it
would be out of bounds, and it would be instant UB. Consequently,
`Source::set_position` has to be an unsafe function.

This feels rather ridiculous. *Of course* the parser won't create 2
Lexers at the same time. But still it's *possible*, so I think better to
take the strict approach and make it unsafe until can find a way to
statically prove the safety by some other means. Any ideas?

## Oddity in the benchmarks

There's something really odd going on with the semantic benchmark for
`pdf.mjs`.

While I was developing this, small and seemingly irrelevant changes
would flip that benchmark from +0.5% or so to -4%, and then another
small change would flip it back.

What I don't understand is that parsing happens outside of the
measurement loop in the semantic benchmark, so the parser shouldn't have
*any* effect either way on semantic's benchmarks.

If CodSpeed's flame graph is to be believed, most of the negative effect
appears to be a large Vec reallocation happening somewhere in semantic.

I've ruled out a few things: The AST produced by the parser for
`pdf.mjs` after this PR is identical to what it was before. And
semantic's `nodes` and `scopes` Vecs are same length as they were
before. Nothing seems to have changed!

I really am at a loss to explain it. Have you seen anything like this
before?

One possibility is a fault in my unsafe code which is manifesting only
with `pdf.mjs`, and it's triggering UB, which I guess could explain the
weird effects. I'm running the parser on `pdf.mjs` in Miri now and will
see if it finds anything (Miri doesn't find any problem running the
tests). It's been running for over an hour now. Hopefully it'll be done
by morning!

I feel like this shouldn't merged until that question is resolved, so
marking this as draft in the meantime.
2024-02-05 13:51:46 +00:00
Boshen
d9bea520ea
Release @oxc-parser/wasm v0.0.5 2024-02-05 21:10:11 +08:00
Yuji Sugiura
b27079cf8e
chore(linter): Add more tests for ESLintConfig (#2284)
Before trying  #2258 , I'd like to prevent regression. 🦺 

### Overview

- Rename `ESLintConfig::new(path)` -> `from_file(path)`
- Split `from_file()` implementation into 2 parts
  - Parse path, strip json comment, check `.json` ext part
  - `from_value()`: Read +parse JSON contents part
    - ☝🏻used in tests
- Add tests for parsing rules, settings, env

### TODOs found, for next PR
- `rules` parser should handle `"no-debugger": 1` form
- `settings.xxx_components` should go under `settings.react.`

### Notes

- `rules`'s type
  - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L12
- `settings`'s type is `Object` 😅 
  - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L53
  - and its usage is extended by each plugin
-
https://github.com/jsx-eslint/eslint-plugin-react?tab=readme-ov-file#configuration-legacy-eslintrc-
-
https://github.com/jsx-eslint/eslint-plugin-jsx-a11y/?tab=readme-ov-file#configurations
-
https://nextjs.org/docs/pages/building-your-application/configuring/eslint#eslint-plugin
- `env`'s type is just a `Record<string, boolean>`
  - https://github.com/eslint/eslint/blob/main/lib/shared/types.js#L40
2024-02-05 20:42:03 +08:00
magic-akari
577d7ab72f
feat(prettier): Support TSImportEqualsDeclaration (#2321) 2024-02-05 20:37:26 +08:00
magic-akari
c6273732f6
feat(prettier): Support TSExportAssignment (#2320) 2024-02-05 20:33:03 +08:00
Dunqing
ed29207781
chore(clippy): disable nursery group rules (#2319)
#2318
2024-02-05 18:43:15 +08:00
Dunqing
d571839ab8
feat(ast): enter AstKind::ExportDefaultDeclaration, AstKind::ExportNamedDeclaration and AstKind::ExportAllDeclaration (#2317) 2024-02-05 17:43:30 +08:00
Dunqing
a3570d41f0
feat(semantic): report parameter related errors for setter/getter (#2316) 2024-02-05 17:38:43 +08:00
Dunqing
9ca13d040d
feat(semantic): report type parameter list cannot be empty (#2315) 2024-02-05 16:05:51 +08:00
Boshen
a762d17603
feat(linter): promote no-this-before-super to correctness (#2313)
I've tested this in all real world test repos and found no false
positives. Thank you so much @u9g @TzviPM for making this happen!
2024-02-05 16:01:09 +08:00
Dunqing
d6d931cd80
fix(tasks/coverage): when providing filter, do not run just submodules (#2310) 2024-02-05 15:06:37 +08:00
Boshen
0948e675c5
chore(fuzz): add a timeout command 2024-02-05 14:41:14 +08:00
renovate[bot]
41d1876650
chore(deps): update rust crates (#2302) 2024-02-05 14:36:53 +08:00
Boshen
90205db21e
fix(fuzz): fix README name 2024-02-05 14:21:56 +08:00
Dunqing
540b2a0396
fix(semantic): remove unnecessary SymbolFlags::Import (#2311) 2024-02-05 14:16:29 +08:00
Boshen
ad64e0b395
chore(renovate): ignore miette and ureq 2024-02-05 14:14:29 +08:00
Dunqing
f53c54ced9
feat(semantic): report unexpected type annotation in ArrayPattern (#2309) 2024-02-05 13:45:52 +08:00
overlookmotel
73ec4df61c
chore(benchmark): add radix-ui.jsx as a real world jsx example (#2304)
Following #2297, this adds another benchmark.

This one is from radix-ui website. I've chosen this particular file
because it differs from the other benchmark sources in 3 ways:

1. JSX not TSX (despite the file extension).
2. Contains no logic, only JSX component hierarchy, and content text.
3. Very small (60 LOC).

The last is particularly important, I think. Often developers will be
working on small files (single component per file convention). And some
possible directions for the parser (SIMD etc) involve optimizing chewing
through chunks of text, with a de-opt at the end to process the final
batch of bytes. If that imposes a penalty on short files, this benchmark
will surface it.

---------

Co-authored-by: Boshen <boshenc@gmail.com>
2024-02-05 13:42:45 +08:00
Dunqing
f3035f1bbe
feat(semantic): apply ImportSpecifier's binder and remove ModuleDeclaration's binder (#2307)
Added in #2230, But i forgot to call.
2024-02-05 13:16:05 +08:00
renovate[bot]
bb9cf484e9
chore(deps): update website npm packages (#2303) 2024-02-05 13:11:40 +08:00
renovate[bot]
7138432541
chore(deps): update codecov/codecov-action action to v4 (#2305)
[![Mend
Renovate](https://app.renovatebot.com/images/banner.svg)](https://renovatebot.com)

This PR contains the following updates:

| Package | Type | Update | Change |
|---|---|---|---|
| [codecov/codecov-action](https://togithub.com/codecov/codecov-action)
| action | major | `v3` -> `v4` |

---

### Release Notes

<details>
<summary>codecov/codecov-action (codecov/codecov-action)</summary>

### [`v4`](https://togithub.com/codecov/codecov-action/compare/v3...v4)

[Compare
Source](https://togithub.com/codecov/codecov-action/compare/v3...v4)

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 8am on monday" in timezone
Asia/Shanghai, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Disabled by config. Please merge this manually once you
are satisfied.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Mend
Renovate](https://www.mend.io/free-developer-tools/renovate/). View
repository job log
[here](https://developer.mend.io/github/oxc-project/oxc).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xNzAuMCIsInVwZGF0ZWRJblZlciI6IjM3LjE3MC4wIiwidGFyZ2V0QnJhbmNoIjoibWFpbiJ9-->

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
2024-02-05 13:08:56 +08:00
overlookmotel
9811c3a2c3
refactor(parser): name byte handler functions (#2301)
This PR solves the problem of lexer byte handlers all being called
`core::ops::function::FnOnce::call_once` in the flame graphs on
CodSpeed, by defining them as named functions instead of closures.

Pure refactor, no substantive changes.
2024-02-05 13:06:09 +08:00
Boshen
018674c56c
chore(benchmark): add cal.com.tsx as a realword tsx example (#2297)
closes #1884
2024-02-04 22:40:55 +08:00
Dunqing
cb17a83f4f
fix(semantic): remove ignore cases (#2300) 2024-02-04 22:40:41 +08:00
Boshen
6002560fa1
feat(span): fix memory leak by implementing inlineable string for oxc_allocator (#2294)
closes #1803

This string is currently unsafe, but I want to get miri working before
introducing more changes.

I want to make a progress from memory leak to unsafe then to safety.
It's harder to do the steps in one go.
2024-02-04 19:28:23 +08:00
Boshen
1822cfe18d
refactor(ast): fix BigInt memory leak by removing it (#2293)
relates

We'll need to evaluate the value by other means.
2024-02-04 16:47:00 +08:00
Boshen
a574f2852e
chore: ignore git submodules 2024-02-04 16:15:01 +08:00
Boshen
b5e43fbc5d
fix(linter): fix no_dupe_keys false postive on similar key names (#2291)
closes #2287
2024-02-04 14:54:09 +08:00
underfin
2f97b332bd
chore: codegen test skip v8 failed test (#2283)
Because v8 isn't pass some test262 tests, we also need to skip them, you
can find it at
https://chromium.googlesource.com/v8/v8/+/refs/heads/main/test/test262/test262.status.
2024-02-04 14:02:39 +08:00
Tzvi Melamed
0060d6a730
feat(linter): Implement no_this_before_super with cfg (#2254)
Implements `eslint/no-this-before-super` in #479.

Closes #2279
2024-02-04 13:51:04 +08:00
Boshen
d2b304b1f8
Publish crates v0.6.0 2024-02-03 22:35:30 +08:00
Boshen
52e32449b3
Release oxlint and vscode extension v0.2.7 2024-02-03 21:21:23 +08:00
Wenzhe Wang
0c225a49aa
fix(codegen): print space before with clause in import (#2278) 2024-02-02 14:52:32 +00:00
Boshen
3569e42475
ci: install just in conformance 2024-02-02 22:49:15 +08:00
Boshen
b2040ed2ec
ci: use node.js 20 2024-02-02 22:28:23 +08:00
Boshen
54dfd91729
ci: temporary fix git clones 2024-02-02 22:27:15 +08:00
Dunqing
37a2676e1e
fix(linter): AllowFunction doesn't support generator (#2277) 2024-02-02 21:53:44 +08:00
Dunqing
3cb85778bc
ci: incorrect uses value (#2276) 2024-02-02 21:53:10 +08:00
Boshen
d2f7ea2923
chore: manually clone git modules instead of using submodules (#2274)
closes #2231

Submodules are a blocker for beginners, we should make it clone on
demand.

It is also a blocker for people who wants to target this repo as a crate
for testing purposes, cargo will do a full clone if you specify
oxc_parser = { git = "this repo" } in Cargo.toml
2024-02-02 11:56:18 +00:00
Boshen
28daf83b19
feat(semantic): report no class name error (#2273)
closes #2144
2024-02-02 19:05:00 +08:00
kaykdm
2ceba79dcd
feat(tasks): add eslint-plugin-n rulegen (#2272)
related: https://github.com/oxc-project/oxc/issues/493

Support generate rules from
[eslint-plugin-n](https://github.com/eslint-community/eslint-plugin-n).
2024-02-02 18:50:58 +08:00