fix: correct UTF-16 index handling in native MagicString#8693
fix: correct UTF-16 index handling in native MagicString#8693graphite-app[bot] merged 1 commit intomainfrom
Conversation
How to use the Graphite Merge QueueAdd the label graphite: merge-when-ready to this PR to add it to the merge queue. You must have a Graphite account in order to use the merge queue. Sign up using this link. An organization admin has enabled the Graphite Merge Queue in this repository. Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue. This stack of pull requests is managed by Graphite. Learn more about stacking. |
✅ Deploy Preview for rolldown-rs canceled.
|
6ed7f47 to
e03a0d1
Compare
Merge activity
|
There was a problem hiding this comment.
Pull request overview
Fixes native MagicString index handling so JS-facing indices behave correctly with Unicode (UTF-16 code unit indexing), including exact surrogate-boundary slicing behavior to match magic-string.
Changes:
- Reworked the native
CharToByteMapperto map UTF-16 code unit indices (JS string indices) to UTF-8 byte offsets. - Updated
sliceto correctly handle indices that land inside surrogate pairs by emitting lone surrogates via UTF-16 N-API string creation. - Added a new unicode-focused test suite covering emoji, CJK, mixed scripts, negative indices, and surrogate-boundary slicing.
Reviewed changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.
| File | Description |
|---|---|
| packages/rolldown/tests/magic-string/magic-string-unicode.test.ts | Adds coverage for unicode + surrogate-pair boundary behavior across MagicString APIs. |
| packages/rolldown/src/binding.d.cts | Extends slice docs to document surrogate-boundary behavior and UTF-16 string creation details. |
| crates/rolldown_binding/src/types/binding_magic_string.rs | Implements UTF-16-index → UTF-8-byte mapping and surrogate-aware slice returning UTF-16 JS strings when needed. |
Benchmarks Rust |
Merging this PR will not alter performance
Comparing Footnotes
|
e03a0d1 to
7668692
Compare
198f27f to
560b408
Compare
There was a problem hiding this comment.
Pull request overview
Fixes native MagicString index handling to align with JavaScript UTF-16 code unit indices (including surrogate-pair boundary behavior), preventing panics and matching magic-string semantics.
Changes:
- Replace the char-based index→byte mapper with a UTF-16 code-unit index→UTF-8 byte offset mapper.
- Update
sliceto correctly handle surrogate-pair boundary indices by emitting lone surrogates via UTF-16 N-API string creation. - Add a dedicated unicode regression test suite covering emoji/CJK/mixed scripts/negative indices/surrogate boundary slicing.
Reviewed changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.
| File | Description |
|---|---|
| packages/rolldown/tests/magic-string/magic-string-unicode.test.ts | Adds unicode-focused regression tests for UTF-16 indexing and surrogate boundary slicing. |
| packages/rolldown/src/binding.d.cts | Updates slice documentation to explicitly describe UTF-16 index semantics and lone-surrogate behavior. |
| crates/rolldown_binding/src/types/binding_magic_string.rs | Implements UTF-16 index→byte mapping and updates binding methods (notably slice) to avoid panics and match JS behavior. |
560b408 to
ce90866
Compare
## Summary - Fix `CharToByteMapper` to index by UTF-16 code unit position (matching JS string indices) instead of Rust char position, and accumulate UTF-8 byte offsets instead of UTF-16 lengths - When `slice` indices fall mid-surrogate-pair, emit lone surrogates via `napi_create_string_utf16` to match original magic-string behavior exactly - Add comprehensive unicode test suite covering emoji, CJK, mixed scripts, negative indices, and surrogate pair boundary slicing Fixes #8685 ## Test plan - [x] New `magic-string-unicode.test.ts` covers emoji slice/overwrite/remove, CJK, mixed scripts, negative indices, and lone surrogate emission - [x] All 157 existing magic-string tests continue to pass - [x] Exact repro from issue (`slice` on `"some 🤷♂️ string"`) no longer panics 🤖 Generated with [Claude Code](https://claude.com/claude-code)
ce90866 to
f8be84a
Compare
## [1.0.0-rc.10] - 2026-03-18 ### 🚀 Features - add indentExclusionRanges property to MagicString (#8746) by @IWANABETHATGUY - expose `oxcRuntimePlugin` (#8654) by @sapphi-red - rust: make bundler generic over FileSystem for in-memory benchmarks (#8652) by @Boshen ### 🐛 Bug Fixes - rolldown_plugin_vite_dynamic_import_vars: align dynamic import fast check with Vite (#8760) by @shulaoda - renamer: handle existing bindings in nested scopes when finding unique names (#8741) by @drewolson - pass `yarn_pnp` option where needed (#8736) by @sapphi-red - preserve optional chaining in namespace member expr rewrite (#8712) by @Copilot - correct UTF-16 index handling in native MagicString (#8693) by @IWANABETHATGUY - mark failing doctests as ignore (#8700) by @Boshen - prevent may_partial_namespace from leaking through include_module (#8682) by @IWANABETHATGUY - ci: bump native-build cache key to invalidate stale napi-rs artifacts (#8678) by @Boshen - `comments.annotation: false` breaking tree-shaking (#8657) by @IWANABETHATGUY - validate filenames for NUL bytes from chunkFileNames/entryFileNames (#8644) by @IWANABETHATGUY - dce-only minify should not set NODE_ENV to production (#8651) by @IWANABETHATGUY ### 🚜 Refactor - rust: remove dead `CrossModuleOptimizationConfig::side_effects_free_function_optimization` (#8673) by @Dunqing - rust: simplify `cross_module_optimization` by removing redundant scope tracking (#8672) by @Dunqing - simplify string repeat in guess_indentor (#8753) by @IWANABETHATGUY - consolidate custom magic-string tests into one file (#8696) by @IWANABETHATGUY - extract CJS bailout checks from include_symbol (#8683) by @IWANABETHATGUY - rust: remove `BindingIdentifierExt` to use `BindingIdentifier::symbol_id()` instead (#8667) by @Dunqing - bench: add bench_preset helper and inline presets (#8658) by @Boshen - rust: filter external modules from entries instead of mapping bit positions (#8637) by @Dunqing ### 📚 Documentation - clarify watch mode behavior and its limitations (#8751) by @sapphi-red - add external link icon to GitHub button in Hero section (#8731) by @thisisnkc - guide: clarify that `inject` option is only conceptually similar to esbuild's one (#8743) by @sapphi-red - meta/design: add `devtools.md` (#8663) by @hyf0 - add viteplus alpha announcement banner (#8668) by @shulaoda ### ⚡ Performance - rolldown: some minor perf optimization found by autoresearch (#8730) by @Brooooooklyn - replace Vec allocation with lazy iterator in find_hash_placeholders (#8703) by @Boshen - replace TypedDashMap with TypedMap in CustomField (#8708) by @Boshen - bench: remove scan benchmark binary to halve LTO link time (#8694) by @Boshen ### 🧪 Testing - watch: increase timeout for error output (#8766) by @sapphi-red - vite-tests: remove JS plugin tests (#8767) by @sapphi-red - watch: add CLI exit code test (#8752) by @sapphi-red - normalize paths on Windows even if `resolve.symlinks` is false (#8483) by @sapphi-red ### ⚙️ Miscellaneous Tasks - correct comment in bundle-analyzer-plugin.ts (#8770) by @origami-z - upgrade oxc to 0.120.0 (#8764) by @Boshen - enable all test for `reset` category in MagicString.test.ts (#8749) by @IWANABETHATGUY - deps: update test262 submodule for tests (#8742) by @sapphi-red - deps: update oxc apps (#8734) by @renovate[bot] - deps: update softprops/action-gh-release action to v2.6.1 (#8724) by @renovate[bot] - deps: update npm packages (major) (#8722) by @renovate[bot] - deps: update github-actions (major) (#8721) by @renovate[bot] - deps: update softprops/action-gh-release action to v2.6.0 (#8720) by @renovate[bot] - deps: update npm packages (#8718) by @renovate[bot] - deps: update rust crates (#8717) by @renovate[bot] - deps: update github-actions (#8716) by @renovate[bot] - deps: update dependency oxlint-tsgolint to v0.17.0 (#8713) by @renovate[bot] - deps: bump cargo-shear to v1.11.2 (#8711) by @Boshen - use org level `CODE_OF_CONDUCT.md` (#8706) by @sapphi-red - fix cache key mismatch and remove redundant cache saves (#8695) by @Boshen - deps: update oxc apps (#8692) by @renovate[bot] - deps: update oxc apps (#8649) by @renovate[bot] - should do matrix out side of reusable workflows 2 (#8691) by @hyf0 - should do matrix out side of reusable workflows (#8690) by @hyf0 - deps: update dependency rolldown-plugin-dts to v0.22.5 (#8689) by @renovate[bot] - upgrade oxc to 0.119.0 and oxc_resolver to 11.19.1 (#8686) by @Boshen - correct if condition of `type-check` job (#8677) by @hyf0 - Gate CI type-check job on node changes (#8669) by @Copilot - benchmark: improve codspeed build (#8665) by @Boshen - deps: update oxc to v0.118.0 (#8650) by @renovate[bot] - deps: update crate-ci/typos action to v1.44.0 (#8647) by @renovate[bot] - deps: update oxc resolver to v11.19.1 (#8646) by @renovate[bot] - deps: update dependency rust to v1.94.0 (#8648) by @renovate[bot] - deps: update dependency rolldown-plugin-dts to v0.22.4 (#8645) by @renovate[bot] ###◀️ Revert - Revert "ci: Gate CI type-check job on node changes" (#8674) by @hyf0 - "chore(deps): update dependency rust to v1.94.0 (#8648)" (#8660) by @shulaoda ### ❤️ New Contributors * @origami-z made their first contribution in [#8770](#8770) * @drewolson made their first contribution in [#8741](#8741) * @thisisnkc made their first contribution in [#8731](#8731) Co-authored-by: shulaoda <165626830+shulaoda@users.noreply.github.com>

Summary
CharToByteMapperto index by UTF-16 code unit position (matching JS string indices) instead of Rust char position, and accumulate UTF-8 byte offsets instead of UTF-16 lengthssliceindices fall mid-surrogate-pair, emit lone surrogates vianapi_create_string_utf16to match original magic-string behavior exactlyFixes #8685
Test plan
magic-string-unicode.test.tscovers emoji slice/overwrite/remove, CJK, mixed scripts, negative indices, and lone surrogate emissionsliceon"some 🤷♂️ string") no longer panics🤖 Generated with Claude Code