Update/block support settings use tag processor by dmsnell · Pull Request #46625 · WordPress/gutenberg

dmsnell · 2022-12-16T20:50:52Z

What?

Use WP_HTML_Tag_Processor to add new class name to wrapping elements when rendering block supports.

Why?

This class was built to quickly and reliably modify HTML tag attributes. It circumvents specific problems, such as matching on the wrong attributes (such as data-custom-class="some value"), overlooking matches (such as class=blue or class='wp-block-group'), writing updates in a way that get overlooked by the browser (by writing to the end of the tag instead of before any potential duplicate attributes), and by writing invalid content to the HTML (such as through a bug in a PCRE pattern greedily matching more than it should).

A side perk here is that application code becomes more focused on the semantic operations it's performing rather than the mechanisms through which it does it.

How?

Utilizes the API provided by the Tag Processor to do the heavy lifting for us.

Testing

Hopefully the unit tests cover this.
Otherwise try to use block supports and make sure that the expected classes appear where they are and not where they shouldn't be.
I'm not that familiar with how this system works so I hope you can help figure out what needs to be tested.

adamziel

I love how simpler is the final code! Way to go ❤️

…pper. When we introduced #42124 new block supports behavior we did so with a PCRE replacement that opened the possibility for a few bugs related to processing the HTML attributes. It was noted in that PR that this would be a good candidate for the `WP_HTML_Tag_Processor`. In this patch we're performing that replacement as follow-up work. This should improve the reliability and hopefully the readability of what is being done to the HTML as it renders.

github-actions · 2023-01-12T00:12:31Z

Flaky tests detected in b187df8.
Some tests passed with failed attempts. The failures may not be related to this commit but are still reported for visibility. See the documentation for more information.

🔍 Workflow run URL: https://github.com/WordPress/gutenberg/actions/runs/3897623939
📝 Reported issues:

[Flaky Test] should always expand single line selection #39787 in specs/editor/various/multi-block-selection.test.js

This commit pulls in the HTML Tag Processor from the Gutenbeg repository. The Tag Processor attempts to be an HTML5-spec-compliant parser that provides the ability in PHP to find specific HTML tags and then add, remove, or update attributes on that tag. It provides a safe and reliable way to modify the attribute on HTML tags. ```php // Add missing `rel` attribute to links. $p = new WP_HTML_Tag_Processor( $block_content ); if ( $p->next_tag( 'A' ) && empty( $p->get_attribute( 'rel' ) ) ) { $p->set_attribute( 'noopener nofollow' ); } return $p->get_updated_html(); ``` Introduced originally in WordPress/gutenberg#42485 and developed within the Gutenberg repository, this HTML parsing system was built in order to address a persistent need (properly modifying HTML tag attributes) and was motivated after a sequence of block editor defects which stemmed from mismatches between actual HTML code and expectectations for HTML input running through existing naive string-search-based solutions. The Tag Processor is intended to operate fast enough to avoid being an obstacle on page render while using as little memory overhead as possible. It is practically a zero-memory-overhead system, and only allocates memory as changes to the input HTML document are enqueued, releasing that memory when flushing those changes to the document, moving on to find the next tag, or flushing its entire output via `get_updated_html()`. Rigor has been taken to ensure that the Tag Processor will not be consfused by unexpected or non-normative HTML input, including issues arising from quoting, from different syntax rules within `<title>`, `<textarea>`, and `<script>` tags, from the appearance of rare but legitimate comment and XML-like regions, and from a variety of syntax abnormalities such as unbalanced tags, incomplete syntax, and overlapping tags. The Tag Processor is constrained to parsing an HTML document as a stream of tokens. It will not build an HTML tree or generate a DOM representation of a document. It is designed to start at the beginning of an HTML document and linearly scan through it, potentially modifying that document as it scans. It has no access to the markup inside or around tags and it has no ability to determine which tag openers and tag closers belong to each other, or determine the nesting depth of a given tag. It includes a primitive bookmarking system to remember tags it has previously visited. These bookmarks refer to specific tags, not to string offsets, and continue to point to the same place in the document as edits are applied. By asking the Tag Processor to seek to a given bookmark it's possible to back up and continue processsing again content that has already been traversed. Attribute values are sanitized with `esc_attr()` and rendered as double-quoted attributes. On read they are unescaped and unquoted. Authors wishing to rely on the Tag Processor therefore are free to pass around data as normal strings. Convenience methods for adding and removing CSS class names exist in order to remove the need to process the `class` attribute. ```php // Update heading block class names $p = new WP_HTML_Tag_Processor( $html ); while ( $p->next_tag() ) { switch ( $p->get_tag() ) { case 'H1': case 'H2': case 'H3': case 'H4': case 'H5': case 'H6': $p->remove_class( 'wp-heading' ); $p->add_class( 'wp-block-heading' ); break; } return $p->get_updated_html(); ``` The Tag Processor is intended to be a reliable low-level library for traversing HTML documents and higher-level APIs are to be built upon it. Immediately, and in Core Gutenberg blocks it is meant to replace HTML modification that currently relies on RegExp patterns and simpler string replacements. See the following for examples of such replacement: WordPress/gutenberg@1315784 https://github.com/WordPress/gutenberg/pull/45469/files#diff-dcd9e1f9b87ca63efe9f1e834b4d3048778d3eca41aa39c636f8b16a5bb452d2L46 WordPress/gutenberg#46625 Co-Authored-By: Adam Zielinski <adam@adamziel.com> Co-Authored-By: Bernie Reiter <ockham@raz.or.at> Co-Authored-By: Grzegorz Ziolkowski <grzegorz@gziolo.pl>

Porting part of WordPress/gutenberg#46625 Replace use of fragile `preg_match` with Tag Processor when adding an element class name to its wrapper.

dmsnell requested review from adamziel, georgeh and jorgefilipecosta December 16, 2022 20:50

dmsnell requested a review from spacedmonkey as a code owner December 16, 2022 20:50

adamziel approved these changes Dec 22, 2022

View reviewed changes

dmsnell force-pushed the update/block-support-settings-use-tag-processor branch from 7e7b3b8 to 2f40651 Compare January 11, 2023 23:22

dmsnell force-pushed the update/block-support-settings-use-tag-processor branch from 2f40651 to b187df8 Compare January 11, 2023 23:27

dmsnell merged commit b187df8 into trunk Jan 12, 2023

dmsnell deleted the update/block-support-settings-use-tag-processor branch January 12, 2023 00:09

Mamaduka mentioned this pull request Jan 17, 2023

Plugin: Backport PHP changes for WordPress 6.2 release #47187

Closed

85 tasks

ockham mentioned this pull request Jan 23, 2023

Revert "Block Settings/Support: Use Tag Processor to inject class name on wrapper." #47350

Merged

dmsnell mentioned this pull request Jan 26, 2023

Editor: Introduce HTML Tag Processor WordPress/wordpress-develop#3920

Closed

dmsnell mentioned this pull request Feb 1, 2023

Block Supports: Use Tag Processor for adding class-name to wrapper dmsnell/wordpress-develop#1

Closed

ntsekouras mentioned this pull request Feb 6, 2023

Update wp_render_elements_support to use html API WordPress/wordpress-develop#4007

Closed

jorgefilipecosta mentioned this pull request Feb 6, 2023

Update: Backport block settings to core. WordPress/wordpress-develop#4013

Closed

ajlende mentioned this pull request Mar 21, 2023

Replace regex with tag processor for duotone class render #49212

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update/block support settings use tag processor#46625

Update/block support settings use tag processor#46625
dmsnell merged 1 commit intotrunkfrom
update/block-support-settings-use-tag-processor

dmsnell commented Dec 16, 2022

Uh oh!

adamziel left a comment

Uh oh!

github-actions bot commented Jan 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dmsnell commented Dec 16, 2022

What?

Why?

How?

Testing

Uh oh!

adamziel left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants