Malformed HTML comment closer `-- >` causes unbounded parsing → OOM kill during `mkdocs build`

**Summary**
When a Markdown file contains an HTML comment with a malformed closer (`-- >` instead of `-->`), `mkdocs build` can consume unbounded memory and get killed by the OOM killer. The parser appears to treat the remainder of the file (and potentially subsequent files) as one giant open comment, leading to catastrophic backtracking / token growth.

**Environment**

* MkDocs: 1.6.1
* Python: 3.11.2
* OS: Ubuntu 22.04 (repro’d locally and in GitHub Actions `ubuntu-latest`)
* Theme: occurs even with built-in `mkdocs` theme (no plugins, no extensions)
* Plugins: none (explicitly `plugins: []`)
* Markdown extensions: none (explicitly `markdown_extensions: []`)

> Also reproduced with Material + typical extensions, but **root cause reproduces without any theme/plugins**.

**Minimal Reproducer**
Create these two files:

`mkdocs.yml`

```yaml
site_name: Repro
docs_dir: docs
theme:
  name: mkdocs
plugins: []
markdown_extensions: []
nav:
  - Home: index.md
```

`docs/index.md`

```md
# Hello

` immediately resolves the OOM.

**Why this likely happens (hypothesis)**

* The HTML comment tokenizer/regex expects `-->` and fails to find a terminator when it encounters `-- >`. The parser then treats the remainder as part of the same comment block, effectively creating an enormous single token/buffer. That can trigger catastrophic backtracking or simply allocate until OOM.

**Impact**

* Reliability: a single typo in a doc can take down CI and local builds.
* Security/DoS: a crafted Markdown line containing ``
  ```
* As a preventive CI check:

  ```bash
  grep -RIn "\-\- >" docs && { echo "Malformed HTML comment found"; exit 1; }
  ```

**Proposed Fix (ideas)**

* In the HTML comment parser:

  1. **Fail closed safely:** If a proper `-->` terminator isn’t found on a bounded lookahead, treat the sequence as plain text rather than an open comment block.
  2. **Tolerant close:** Optionally accept `--\s*>` as a terminator (normalize/trimming spaces) to be robust to the specific typo.
  3. **Guardrails:** Impose a maximum comment block length and abort-to-text mode once exceeded, preventing unbounded accumulation/backtracking.
* Add a regression test with the minimal repro above to ensure parsing does not OOM and produces deterministic output (ideally with a warning).

**Versions Tested**

* MkDocs 1.6.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Malformed HTML comment closer `-- >` causes unbounded parsing → OOM kill during `mkdocs build` #4030

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Malformed HTML comment closer -- > causes unbounded parsing → OOM kill during mkdocs build #4030

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Malformed HTML comment closer `-- >` causes unbounded parsing → OOM kill during `mkdocs build` #4030