Fix parsing of bogus comments after end tags by AcqRel · Pull Request #507 · servo/html5ever

AcqRel · 2023-08-25T17:02:52Z

This fixes a bug in the tokenizer where the tag name was included in a bogus comment after an appropriate end tag.

For example, this:

<style></style ><!a>

is incorrectly parsed as the following:

<style></style><!--stylea-->

instead of the expected:

<style></style><!--a-->

For this bug to trigger, the end tag needs to be parsed in one of the raw text states (RCDATA, RAWTEXT, or Script data) and have whitespace or a slash after the tag name. I don't know how the contents of the temporary buffer end up inside the comment, but clearing the temporary buffer when exiting the RawEndTagName state seems to be enough to fix it.

jdm

This looks reasonable! Thank you!

Fix parsing of bogus comments after end tags

086160d

jdm approved these changes Aug 26, 2023

View reviewed changes

jdm enabled auto-merge August 26, 2023 14:03

jdm added this pull request to the merge queue Aug 26, 2023

Merged via the queue into servo:master with commit aa11b3b Aug 26, 2023

AcqRel deleted the tokenizer-bugfix branch August 26, 2023 14:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix parsing of bogus comments after end tags#507

Fix parsing of bogus comments after end tags#507
jdm merged 1 commit intoservo:masterfrom
AcqRel:tokenizer-bugfix

AcqRel commented Aug 25, 2023

Uh oh!

jdm left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AcqRel commented Aug 25, 2023

Uh oh!

jdm left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants