Skip to content

fix(markdown): preserve unicode export paths#8

Merged
vincentkoc merged 1 commit into
mainfrom
fix/unicode-markdown-paths
Apr 27, 2026
Merged

fix(markdown): preserve unicode export paths#8
vincentkoc merged 1 commit into
mainfrom
fix/unicode-markdown-paths

Conversation

@vincentkoc

Copy link
Copy Markdown
Member

Summary

  • preserve Unicode letters, CJK text, and emoji in Markdown path slugs
  • continue replacing path separators and unsafe punctuation with dashes
  • make exported page filename suffixes collision-safe for Notion Desktop IDs
  • document Unicode-safe Markdown paths

Live-data check

  • synced a temporary Desktop snapshot into /tmp
  • old export: 4343 pages reported, 4190 Markdown files written
  • new export: 4343 pages reported, 4343 Markdown files written
  • new export preserved 31 non-ASCII paths that old export flattened

Testing

  • git diff --check
  • go test ./...
  • go build ./cmd/notcrawl
  • temp Desktop sync/export comparison

@github-actions github-actions Bot added the fix label Apr 27, 2026
@vincentkoc vincentkoc merged commit a906ed4 into main Apr 27, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant