perf(html): reduce allocations and speed up the experimental HTML parser by alexander-akait · Pull Request #21152 · webpack/webpack

alexander-akait · 2026-06-09T17:57:32Z

Summary

Behavior-preserving allocation- and CPU-focused optimizations to the experimental HTML parser (the walkHtmlTokens tokenizer and the buildHtmlAst tree builder). The changes: replace the per-token input.slice(...).toLowerCase() end-tag checks in the RCDATA/RAWTEXT/script states with the allocation-free rangeEqualsLower helper; swap inline array-literal .includes() in the insertion-mode handlers for module-level Set lookups (reusing the existing sets) and hoist the repeated "is there an open HTML <template>?" predicate; make the open-stack scope checks (inScope/inButtonScope/inListItemScope/inTableScope/inScopeEl) closure-free; add a findAttr helper to drop per-call Array#find closures; skip the redundant per-text-token framesetOk whitespace scan once the flag is false (and rewrite isAllWs as a charCodeAt loop); and fast-forward the tag-name and attribute-name tokenizer states like the data/value states already do.

On 16 MB inputs this is ~17% faster on indented/pretty-printed markup, ~9% on attribute-heavy markup, and ~2% on a balanced page (where AST construction dominates total time). Retained heap is unchanged — the wins come from removing transient allocations and redundant scans, not from changing AST node shapes.

What kind of change does this PR introduce?

perf

Did you add tests for your changes?

No new tests — these are behavior-preserving performance changes. Correctness is covered by the existing test/walkHtmlTokens.unittest.js (253 tests), test/buildHtmlAst.unittest.js + test/HtmlParser.unittest.js (48 tests), and the full WHATWG test/html5lib.spectest.js conformance corpus (15,161 cases); all pass unchanged.

Does this PR introduce a breaking change?

No.

If relevant, what needs to be documented once your changes are merged or what have you already documented?

n/a

Use of AI

Yes. Implemented with Claude Code: it located the allocation/CPU hotspots, made the edits, and verified them against the existing unit tests, the html5lib conformance suite, and before/after benchmarks. Reviewed before submitting.

Generated by Claude Code

Replace the per-check input.slice(...).toLowerCase() in the RCDATA / RAWTEXT / SCRIPT_DATA / SCRIPT_DATA_ESCAPED end-tag-name states with the existing allocation-free rangeEqualsLower helper, and add an exact-match fast path to decodeHtmlEntities so the common single-entity case skips a full-length prefix slice. https://claude.ai/code/session_01RbPceANkJXa5R9WQWCfH6q

Replace inline array-literal `.includes()` checks in the per-token insertion-mode handlers with module-level `Set.has()` lookups (reusing TABLE_CONTEXT / HEAD_ELEMENTS and adding a few small sets), and hoist the repeated open-stack "is there an HTML <template>?" predicate so the eight `open.some(...)` calls share one function instead of allocating an arrow each time. https://claude.ai/code/session_01RbPceANkJXa5R9WQWCfH6q

The have-an-element-in-scope helpers (inScope / inButtonScope / inListItemScope / inTableScope / inScopeEl) passed one or two arrow predicates into a shared matcher, allocating closures on every call -- and these run several times per body tag. Rewrite them to walk the open stack directly with the boundary kind selected by a small int constant, and add a hoisted findAttr helper to replace the per-call `Array#find` closures used for the <input> type and annotation-xml encoding lookups. https://claude.ai/code/session_01RbPceANkJXa5R9WQWCfH6q

`framesetOk` only ever transitions true→false, so once it is false the per-character-token `isAllWs` check in "in body" is wasted work -- guard it with `framesetOk &&` so the scan stops running after the flag flips (which happens very early in real documents). Also rewrite `isAllWs` with a charCodeAt loop, dropping the `for…of` code-point iterator, per-char string, and Set lookup; this speeds up the remaining whitespace checks in the head/table/after-body modes too. https://claude.ai/code/session_01RbPceANkJXa5R9WQWCfH6q

The tag-name and attribute-name tokenizer states stepped one character per outer-state-machine iteration, unlike the data / RAWTEXT / attribute-value states which fast-forward the ordinary run in a tight inner loop. Add the same inner loop to both states so a run of ordinary name characters is consumed without re-entering the big state switch each character; the loop stops on every terminator and on the chars that need a per-occurrence parse error, which the outer switch re-handles, so behavior is unchanged. https://claude.ai/code/session_01RbPceANkJXa5R9WQWCfH6q

changeset-bot · 2026-06-09T17:57:41Z

🦋 Changeset detected

Latest commit: 81e37d6

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
webpack	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

github-actions · 2026-06-09T17:59:04Z

This PR is packaged and the instant preview is available (d323aee).

Install it locally:

npm

npm i -D webpack@https://pkg.pr.new/webpack@d323aee

yarn

yarn add -D webpack@https://pkg.pr.new/webpack@d323aee

pnpm

pnpm add -D webpack@https://pkg.pr.new/webpack@d323aee

codecov · 2026-06-09T18:01:03Z

Codecov Report

❌ Patch coverage is 97.95918% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.33%. Comparing base (87394de) to head (81e37d6).
⚠️ Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
lib/html/buildHtmlAst.js	97.14%	2 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main   #21152   +/-   ##
=======================================
  Coverage   92.32%   92.33%           
=======================================
  Files         581      581           
  Lines       63288    63349   +61     
  Branches    17507    17518   +11     
=======================================
+ Hits        58431    58491   +60     
- Misses       4857     4858    +1

Flag	Coverage Δ
css-parsing	`28.64% <ø> (-0.02%)`	⬇️
html5lib	`31.05% <97.95%> (+0.02%)`	⬆️
integration	`88.51% <78.57%> (+0.01%)`	⬆️
test262	`45.36% <ø> (+0.04%)`	⬆️
unit	`41.13% <81.63%> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

github-actions · 2026-06-09T18:15:19Z

Types Coverage

Coverage after merging perf/html-parser-perf into main will be

99.33%

Coverage Report

File	Stmts	Branches	Funcs	Lines	Uncovered Lines
bin
webpack.js	98.77%	100%	100%	98.77%	91
examples
build-common.js	100%	100%	100%	100%
buildAll.js	100%	100%	100%	100%
examples.js	100%	100%	100%	100%
template-common.js	98.21%	100%	100%	98.21%	72
examples/custom-javascript-parser
test.filter.js	100%	100%	100%	100%
examples/custom-javascript-parser/internals
acorn-parse.js	100%	100%	100%	100%
meriyah-parse.js	100%	100%	100%	100%
oxc-parse.js	91.30%	100%	100%	91.30%	140, 142–143, 145, 147, 153–154, 161, 168, 90
examples/markdown
webpack.config.mjs	100%	100%	100%	100%
examples/typescript
test.filter.js	100%	100%	100%	100%
examples/typescript-non-erasable
test.filter.js	50%	100%	100%	50%	5
examples/virtual-modules
test.filter.js	100%	100%	100%	100%
examples/wasm-bindgen-esm
test.filter.js	100%	100%	100%	100%
examples/wasm-complex
test.filter.js	100%	100%	100%	100%
examples/wasm-simple
test.filter.js	100%	100%	100%	100%
examples/wasm-simple-source-phase
test.filter.js	100%	100%	100%	100%
lib
APIPlugin.js	100%	100%	100%	100%
AsyncDependenciesBlock.js	100%	100%	100%	100%
AutomaticPrefetchPlugin.js	100%	100%	100%	100%
BannerPlugin.js	100%	100%	100%	100%
Cache.js	98.21%	100%	100%	98.21%	101
CacheFacade.js	100%	100%	100%	100%
Chunk.js	99.72%	100%	100%	99.72%	39
ChunkGraph.js	100%	100%	100%	100%
ChunkGroup.js	100%	100%	100%	100%
ChunkTemplate.js	100%	100%	100%	100%
CleanPlugin.js	99.15%	100%	100%	99.15%	206, 226
CodeGenerationResults.js	100%	100%	100%	100%
CompatibilityPlugin.js	100%	100%	100%	100%
Compilation.js	98.49%	100%	100%	98.49%	1577, 1873, 1880, 1888, 1910, 2806, 3249, 3924, 3954, 4007–4008, 4012, 4017, 4033–4034, 4048–4049, 4054–4055, 4532, 4558, 512, 517, 5366, 5398, 5415, 5431, 5447, 5462, 5487–5488, 5490, 5818, 5823, 5829, 5832, 5844, 5846, 5850, 5866, 5881, 5913, 5967, 5991, 6105, 731–732
Compiler.js	99.56%	100%	100%	99.56%	1135–1136, 1144
ConcatenationScope.js	98.59%	100%	100%	98.59%	189
ConditionalInitFragment.js	100%	100%	100%	100%
ConstPlugin.js	100%	100%	100%	100%
ContextExclusionPlugin.js	100%	100%	100%	100%
ContextModule.js	100%	100%	100%	100%
ContextModuleFactory.js	97.40%	100%	100%	97.40%	258, 395, 418, 420, 424, 433–434
ContextReplacementPlugin.js	100%	100%	100%	100%
DefinePlugin.js	99%	100%	100%	99%	170–171, 187, 206, 280
DependenciesBlock.js	100%	100%	100%	100%
Dependency.js	98.15%	100%	100%	98.15%	379, 425
DependencyTemplate.js	100%	100%	100%	100%
DependencyTemplates.js	100%	100%	100%	100%
DotenvPlugin.js	98.41%	100%	100%	98.41%	378, 391–392
DynamicEntryPlugin.js	100%	100%	100%	100%
EntryOptionPlugin.js	100%	100%	100%	100%
EntryPlugin.js	100%	100%	100%	100%
Entrypoint.js	100%	100%	100%	100%
EnvironmentPlugin.js	97.14%	100%	100%	97.14%	49
ErrorHelpers.js	100%	100%	100%	100%
EvalDevToolModulePlugin.js	100%	100%	100%	100%
EvalSourceMapDevToolPlugin.js	100%	100%	100%	100%
ExportsInfo.js	100%	100%	100%	100%
ExportsInfoApiPlugin.js	100%	100%	100%	100%
ExternalModule.js	98.97%	100%	100%	98.97%	425–429, 577
ExternalModuleFactoryPlugin.js	100%	100%	100%	100%
ExternalsPlugin.js	100%	100%	100%	100%
FileSystemInfo.js	99.50%	100%	100%	99.50%	182, 2252–2253, 2256, 2267, 2278, 2289, 278, 3693, 3708, 3732
FlagAllModulesAsUsedPlugin.js	100%	100%	100%	100%
FlagDependencyExportsPlugin.js	98.85%	100%	100%	98.85%	434, 436, 440
FlagDependencyUsagePlugin.js	100%	100%	100%	100%
FlagEntryExportAsUsedPlugin.js	100%	100%	100%	100%
Generator.js	100%	100%	100%	100%
HotModuleReplacementPlugin.js	100%	100%	100%	100%
HotUpdateChunk.js	100%	100%	100%	100%
IgnorePlugin.js	100%	100%	100%	100%
IgnoreWarningsPlugin.js	100%	100%	100%	100%
InitFragment.js	100%	100%	100%	100%
JavascriptMetaInfoPlugin.js	100%	100%	100%	100%
LibraryTemplatePlugin.js	100%	100%	100%	100%
LoaderOptionsPlugin.js	100%	100%	100%	100%
LoaderTargetPlugin.js	100%	100%	100%	100%
MainTemplate.js	100%	100%	100%	100%
ManifestPlugin.js	100%	100%	100%	100%
Module.js	98.50%	100%	100%	98.50%	1311, 1316, 1376, 1390, 1452, 1461
ModuleFactory.js	100%	100%	100%	100%
ModuleFilenameHelpers.js	98.85%	100%	100%	98.85%	106, 108
ModuleGraph.js	99.73%	100%	100%	99.73%	1005
ModuleGraphConnection.js	100%	100%	100%	100%
ModuleInfoHeaderPlugin.js	100%	100%	100%	100%
ModuleNotFoundError.js	100%	100%	100%	100%
ModuleProfile.js	100%	100%	100%	100%
ModuleSourceTypeConstants.js	100%	100%	100%	100%
ModuleTemplate.js	100%	100%	100%	100%
ModuleTypeConstants.js	100%	100%	100%	100%
MultiCompiler.js	99.69%	100%	100%	99.69%	659
MultiStats.js	100%	100%	100%	100%
MultiWatching.js	100%	100%	100%	100%
NoEmitOnErrorsPlugin.js	100%	100%	100%	100%
NodeStuffPlugin.js	100%	100%	100%	100%
NormalModule.js	97.90%	100%	100%	97.90%	1219, 1222, 1239, 1256, 1503, 1537, 1553, 1640, 1994, 2292, 2297–2307, 417, 421, 575
NormalModuleFactory.js	99.47%	100%	100%	99.47%	1083, 1392, 486, 498
NormalModuleReplacementPlugin.js	100%	100%	100%	100%
NullFactory.js	100%	100%	100%	100%
OptimizationStages.js	100%	100%	100%	100%
OptionsApply.js	100%	100%	100%	100%
Parser.js	100%	100%	100%	100%
PlatformPlugin.js	100%	100%	100%	100%
PrefetchPlugin.js	100%	100%	100%	100%
ProgressPlugin.js	98.85%	100%	100%	98.85%	519–520, 525, 527, 591
ProvidePlugin.js	100%	100%	100%	100%
RawModule.js	100%	100%	100%	100%
RecordIdsPlugin.js	100%	100%	100%	100%
RequestShortener.js	100%	100%	100%	100%
ResolverFactory.js	100%	100%	100%	100%
RuntimeGlobals.js	100%	100%	100%	100%
RuntimeModule.js	100%	100%	100%	100%
RuntimePlugin.js	100%	100%	100%	100%
RuntimeTemplate.js	100%	100%	100%	100%
SelfModuleFactory.js	100%	100%	100%	100%
SingleEntryPlugin.js	100%	100%	100%	100%
SourceMapDevToolModuleOptionsPlugin.js	100%	100%	100%	100%
SourceMapDevToolPlugin.js	98.62%	100%	100%	98.62%	220, 224, 226, 419, 430, 891
Stats.js	100%	100%	100%	100%
Template.js	100%	100%	100%	100%
TemplatedPathPlugin.js	99.13%	100%	100%	99.13%	176–177
UseStrictPlugin.js	100%	100%	100%	100%
WarnCaseSensitiveModulesPlugin.js	100%	100%	100%	100%
WarnDeprecatedOptionPlugin.js	100%	100%	100%	100%
WarnNoModeSetPlugin.js	100%	100%	100%	100%
WatchIgnorePlugin.js	100%	100%	100%	100%
Watching.js	100%	100%	100%	100%
WebpackError.js	100%	100%	100%	100%
WebpackIsIncludedPlugin.js	100%	100%	100%	100%
WebpackOptionsApply.js	100%	100%	100%	100%
WebpackOptionsDefaulter.js	100%	100%	100%	100%
buildChunkGraph.js	99.87%	100%	100%	99.87%	326
cli.js	98.62%	100%	100%	98.62%	10, 119, 545, 577, 627, 897
index.js	99.72%	100%	100%	99.72%	165
validateSchema.js	94.67%	100%	100%	94.67%	100, 87, 89, 98
webpack.js	96.33%	100%	100%	96.33%	10, 198, 220, 222
lib/asset
AssetBytesGenerator.js	100%	100%	100%	100%
AssetBytesParser.js	100%	100%	100%	100%
AssetGenerator.js	100%	100%	100%	100%
AssetModulesPlugin.js	97.32%	100%	100%	97.32%	283, 307, 310, 36, 362, 41
AssetParser.js	100%	100%	100%	100%
AssetSourceGenerator.js	100%	100%	100%	100%
AssetSourceParser.js	100%	100%	100%	100%
RawDataUrlModule.js	100%	100%	100%	100%
lib/async-modules
AsyncModuleHelpers.js	100%	100%	100%	100%
AwaitDependenciesInitFragment.js	100%	100%	100%	100%
InferAsyncModulesPlugin.js	100%	100%	100%	100%
lib/cache
AddBuildDependenciesPlugin.js	100%	100%	100%	100%
AddManagedPathsPlugin.js	100%	100%	100%	100%
IdleFileCachePlugin.js	97.92%	100%	100%	97.92%	71, 83, 91
MemoryCachePlugin.js	95.83%	100%	100%	95.83%	33
MemoryWithGcCachePlugin.js	93.15%	100%	100%	93.15%	106, 113–114, 122, 89
PackFileCacheStrategy.js	96.40%	100%	100%	96.40%	1250, 1350, 1354, 1416, 628, 647, 657–659, 661, 677–678, 683, 686, 688, 693, 698, 722, 728, 762, 768, 774, 779, 790, 799, 804–805, 807, 824, 830–831, 833
ResolverCachePlugin.js	100%	100%	100%	100%
getLazyHashedEtag.js	100%	100%	100%	100%
mergeEtags.js	100%	100%	100%	100%
lib/config
browserslistTargetHandler.js	100%	100%	100%	100%
defaults.js	99.30%	100%	100%	99.30%	1429–1431, 1439, 274, 277, 282, 286
normalization.js	99.01%	100%	100%	99.01%	191–192, 258, 273
target.js	100%	100%	100%	100%
lib/container
ContainerEntryDependency.js	100%	100%	100%	100%
ContainerEntryModule.js	100%	100%	100%	100%
ContainerEntryModuleFactory.js	100%	100%	100%	100%
ContainerExposedDependency.js	100%	100%	100%	100%
ContainerPlugin.js	100%	100%	100%	100%
ContainerReferencePlugin.js	100%	100%	100%	100%

codspeed-hq · 2026-06-09T18:39:47Z

Merging this PR will improve performance by 37.56%

⚠️

Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 2 improved benchmarks
❌ 1 regressed benchmark
✅ 141 untouched benchmarks

Warning

Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
❌	Memory	`benchmark "asset-modules-bytes", scenario '{"name":"mode-development-rebuild","mode":"development","watch":true}'`	246.7 KB	859.1 KB	-71.28%
⚡	Memory	`benchmark "lodash", scenario '{"name":"mode-development-rebuild","mode":"development","watch":true}'`	858.6 KB	126.6 KB	×6.8
⚡	Memory	`benchmark "many-chunks-esm", scenario '{"name":"mode-production","mode":"production"}'`	10 MB	7.5 MB	+33.63%

Tip

Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.

_{Comparing perf/html-parser-perf (81e37d6) with main (9bd0b91)}

alexander-akait added 5 commits June 9, 2026 15:38

alexander-akait merged commit d323aee into main Jun 9, 2026
126 of 127 checks passed

alexander-akait deleted the perf/html-parser-perf branch June 9, 2026 19:20

github-actions Bot mentioned this pull request Jun 9, 2026

chore(release): new release #21037

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

perf(html): reduce allocations and speed up the experimental HTML parser#21152

perf(html): reduce allocations and speed up the experimental HTML parser#21152
alexander-akait merged 5 commits into
mainfrom
perf/html-parser-perf

alexander-akait commented Jun 9, 2026

Uh oh!

changeset-bot Bot commented Jun 9, 2026

Uh oh!

github-actions Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 9, 2026

Uh oh!

codspeed-hq Bot commented Jun 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

alexander-akait commented Jun 9, 2026

Uh oh!

changeset-bot Bot commented Jun 9, 2026

🦋 Changeset detected

Uh oh!

github-actions Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions Bot commented Jun 9, 2026

Types Coverage

Uh oh!

codspeed-hq Bot commented Jun 9, 2026

Merging this PR will improve performance by 37.56%

Performance Changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented Jun 9, 2026 •

edited

Loading

codecov Bot commented Jun 9, 2026 •

edited

Loading