fix: normalize implicit cache tags for non-ASCII paths by swarnava · Pull Request #93139 · vercel/next.js

swarnava · 2026-04-22T17:19:20Z

Encode pathname-derived cache tags to ASCII-safe values before serializing x-next-cache-tags headers, and align revalidatePath normalization so invalidation still matches. Add unit and standalone ISR production coverage for unicode slugs to prevent ERR_INVALID_CHAR regressions.

When a cache tag contains a non-ASCII character (Hebrew, CJK, emoji, …) it gets written into the internal `x-next-cache-tags` HTTP header on ISR responses. Node's `validateHeaderValue` rejects any byte outside `\t\x20-\x7e`, so the response crashes with `ERR_INVALID_CHAR`. On Vercel deploys stale-if-error masks the 500 from clients, but revalidation itself keeps failing and the cache stops refreshing for affected routes. This change introduces a single `encodeCacheTag` helper and applies it at every public boundary — `validateTags` (which `cacheTag()`, `unstable_cache()`, and `fetch` tags all funnel through), `getImplicitTags` for path-derived tags, and `revalidatePath` / `revalidateTag` / `updateTag` for invalidation inputs. The encoder matches runs of out-of-class code units so surrogate pairs reach `encodeURIComponent` intact, and it is idempotent on already-encoded `%xx` sequences, so callers can pass either the raw or the encoded form interchangeably. PR #93139 already encodes path-derived tags at construction, but it misses every user-supplied tag entry point and uses a `decodeURIComponent` round-trip that silently mangles literal `%xx` characters in tag values. PR #93167 encodes only at the `setHeader` sites, which leaves storage and invalidation diverging and requires every new write site to remember the encoding step. The canonical-form-at-the-boundary approach taken here covers all entry points and keeps storage, comparison, and the wire in sync. fixes #93142 closes #93139 closes #93167 Co-authored-by: Swarnava Sengupta <swarnava.sengupta@vercel.com> Co-authored-by: Or Nakash <ornakash@gmail.com>

unstubbable mentioned this pull request May 7, 2026

Encode non-ASCII characters in cache tags at construction #93601

Merged

unstubbable closed this in #93601 May 7, 2026

github-actions Bot added the locked label May 22, 2026

github-actions Bot locked as resolved and limited conversation to collaborators May 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: normalize implicit cache tags for non-ASCII paths#93139

fix: normalize implicit cache tags for non-ASCII paths#93139
swarnava wants to merge 1 commit into
canaryfrom
fix/non-ascii-cache-tags-header

swarnava commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

swarnava commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant