[lexical-markdown] Bug Fix: Replace regex-based format matching with … by kimseongyu · Pull Request #8093 · facebook/lexical

kimseongyu · 2026-01-25T12:12:47Z

Description

Current Behavior

Currently, Markdown texts are converted into Lexical nodes using regex. In issue #8073, this approach fails to correctly identify nested or complex matching pairs.

For example, with the pattern *text**text***, the opening tag * should find its corresponding closing tag. But since regex searches for an independently existing closing *, it cannot match correctly.

Changes in This PR

To resolve these inconsistencies, I have implemented the CommonMark Delimiter Algorithm for processing emphasis and strong emphasis.

The algorithm proceeds as follows:

Scan text to build a delimiter stack with canOpen/canClose properties
Process delimiters to find matching pairs using flanking rules and the rule of 3
Return the outermost matched emphasis

Additionally, this PR fixes the issue where formats inside code spans were incorrectly processed. However, there are still some remaining issues. inline elements other than code spans (e.g., links, raw HTML) are handled by text match transformers, making them difficult to address with the current implementation. A new conversion approach may be needed to fully resolve these cases.

As a result, all tests pass. One previously incorrect test was fixed, and 5 new test cases were added.

Closes #8073

Test plan

Before

before.mov

After

after.mov

…CommonMark delimiter algorithm Previously, the outer format detection relied on regex patterns to find matched formats. However, regex cannot cover all Markdown specification edge cases. This change implements the CommonMark delimiter algorithm to properly handle emphasis parsing: 1. Scan text to build a delimiter stack with canOpen/canClose properties 2. Process delimiters to find matching pairs using flanking rules and the rule of 3 3. Return the outermost matched emphasis

vercel · 2026-01-25T12:12:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
lexical	Ready	Preview, Comment	Jan 26, 2026 11:39am
lexical-playground	Ready	Preview, Comment	Jan 26, 2026 11:39am

etrepum

I didn't carefully review the algorithm, but this does seem like a positive change! Do you have any thoughts on this before going forward @AlessioGr?

packages/lexical-markdown/src/importTextFormatTransformer.ts

packages/lexical-markdown/src/__tests__/unit/LexicalMarkdown.test.ts

AlessioGr · 2026-01-25T19:28:17Z

I think this is a good change!

kimseongyu · 2026-01-26T11:50:27Z

Thank you! I've addressed the feedback in the latest commit. I also fixed the case with *a `*` b `x`*.

Previously, the delimiter stack included delimiters inside code spans, causing incorrect matching. Now, I exclude code span ranges from delimiter scanning to ensure correct behavior.

playground

kimseongyu requested review from acywatson, etrepum, fantactuka, ivailop7, potatowagon, takuyakanbr and zurfyx as code owners January 25, 2026 12:12

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 25, 2026

vercel bot deployed to Preview – lexical January 25, 2026 12:13 View deployment

vercel bot deployed to Preview – lexical-playground January 25, 2026 12:14 View deployment

etrepum approved these changes Jan 25, 2026

View reviewed changes

packages/lexical-markdown/src/importTextFormatTransformer.ts Outdated Show resolved Hide resolved

packages/lexical-markdown/src/importTextFormatTransformer.ts Outdated Show resolved Hide resolved

etrepum added the extended-tests Run extended e2e tests on a PR label Jan 25, 2026

AlessioGr reviewed Jan 25, 2026

View reviewed changes

packages/lexical-markdown/src/__tests__/unit/LexicalMarkdown.test.ts Outdated Show resolved Hide resolved

vercel bot deployed to Preview – lexical January 26, 2026 10:05 View deployment

vercel bot deployed to Preview – lexical-playground January 26, 2026 10:07 View deployment

[lexical-markdown] Fix: reset regex lastIndex

04ae8ba

kimseongyu force-pushed the fix-incorrect-tag-conversion-from-markdown-to-lexical branch from 0093256 to 04ae8ba Compare January 26, 2026 10:40

vercel bot deployed to Preview – lexical January 26, 2026 10:41 View deployment

vercel bot deployed to Preview – lexical-playground January 26, 2026 10:42 View deployment

[lexical-markdown] Fix: Ignore delimiters inside code spans

e39e025

vercel bot deployed to Preview – lexical January 26, 2026 11:38 View deployment

vercel bot deployed to Preview – lexical-playground January 26, 2026 11:39 View deployment

kimseongyu requested review from AlessioGr and etrepum January 26, 2026 15:55

etrepum approved these changes Jan 27, 2026

View reviewed changes

ivailop7 approved these changes Jan 27, 2026

View reviewed changes

etrepum added this pull request to the merge queue Jan 27, 2026

Merged via the queue into facebook:main with commit f1e4f66 Jan 27, 2026
42 checks passed

kimseongyu deleted the fix-incorrect-tag-conversion-from-markdown-to-lexical branch January 27, 2026 20:38

etrepum mentioned this pull request Jan 31, 2026

v0.40.0 #8104

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lexical-markdown] Bug Fix: Replace regex-based format matching with …#8093

[lexical-markdown] Bug Fix: Replace regex-based format matching with …#8093
etrepum merged 3 commits intofacebook:mainfrom
kimseongyu:fix-incorrect-tag-conversion-from-markdown-to-lexical

kimseongyu commented Jan 25, 2026

Uh oh!

vercel bot commented Jan 25, 2026 •

edited

Loading

Uh oh!

etrepum left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlessioGr commented Jan 25, 2026

Uh oh!

kimseongyu commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kimseongyu commented Jan 25, 2026

Description

Current Behavior

Changes in This PR

Test plan

Before

After

Uh oh!

vercel bot commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

etrepum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlessioGr commented Jan 25, 2026

Uh oh!

kimseongyu commented Jan 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vercel bot commented Jan 25, 2026 •

edited

Loading