feat: capture and improve lexer error messages for invalid escape sequences by kjonescertinia · Pull Request #420 · apex-dev-tools/apex-ls

kjonescertinia · 2026-01-25T23:10:12Z

Summary

Add lexer error collection via CollectingErrorListener on both JVM and JS platforms
Transform cryptic ANTLR "token recognition error" messages into user-friendly "Invalid escape sequence '\x' in string" messages
Fix OuterBlock caching to not cache BlockContext when parsing has errors, ensuring syntax errors are consistently reported on re-validation

Changes

CodeParser (JVM & JS): Attach CollectingErrorListener to the lexer as well as the parser to capture lexer errors
CollectingErrorListener (JVM & JS): Add regex-based message transformation to improve escape sequence error messages
Statements.scala: Only cache BlockContext in OuterBlock when parsing succeeds without errors
LiteralTypeTest: Add tests for valid and invalid escape sequences

Test plan

All existing tests pass (2513 JVM tests)
scalafmt check passes
Manual testing with ApexFlow to verify consistent error display on file re-save

Related to #409

🤖 Generated with Claude Code

…uences - Add lexer error collection via CollectingErrorListener on both JVM and JS - Transform cryptic "token recognition error" messages into user-friendly "Invalid escape sequence '\x' in string" messages - Fix OuterBlock caching to not cache BlockContext when parsing has errors, ensuring syntax errors are consistently reported on re-validation - Add tests for valid and invalid escape sequences

kjonescertinia · 2026-01-25T23:18:15Z

Future improvement note

If we want to further improve escape sequence validation (e.g., more Salesforce-like error messages or better error recovery), we should first address apex-dev-tools/outline-parser#17 which tracks adding escape sequence validation to the OutlineParser.

The current implementation captures lexer errors from ANTLR parsing, but with the OutlineParser being the default (and intended to become the only parser), the long-term solution should be in the OutlineParser itself.

It would also be worth studying what switching to the combined Outline/ANTLR model from a pure ANTLR model means for error reporting in general.

nawforce · 2026-02-22T10:58:02Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ea1e996e2e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-22T11:03:30Z

+    val improvedMsg = msg match {
+      case tokenErrorPattern(content) =>
+        content match {
+          case escapePattern(escape) =>


Report the offending escape instead of the first one

The new escapePattern path in syntaxError captures the first \\. sequence in the lexer error payload, not the one that actually caused lexing to fail. ANTLR token-recognition messages include text from token start to failure, so a literal like '\\n\\q' can be reported as Invalid escape sequence '\\n' in string even though \\q is the invalid escape, which gives users a wrong diagnostic target. Please extract the escape nearest the failure point (or otherwise identify the offending escape) instead of the first match; the same logic is mirrored in the JS listener.

Useful? React with 👍 / 👎.

@codex fix this

Summary

Updated JVM lexer error extraction to capture the last escape sequence in ANTLR token-recognition payloads, so diagnostics point at the offending escape nearest failure instead of the first escape in the string. jvm/src/main/scala/com/nawforce/runtime/parsers/CollectingErrorListener.scalaL26-L49

Applied the same fix in the Scala.js listener to keep JVM/JS behavior aligned. js/src/main/scala/com/nawforce/runtime/parsers/CollectingErrorListener.scalaL26-L49

Added a regression test for '\\n\\q' to assert the reported invalid escape is \\q and not \\n. jvm/src/test/scala/com/nawforce/apexlink/cst/LiteralTypeTest.scalaL190-L202

Committed as fix: report offending invalid escape sequence (bb4933d) and created a follow-up PR message via make_pr summarizing this delta.

Testing

⚠️ curl -L --max-time 20 "https://duckduckgo.com/html/?q=..." | head -n 40 (web search attempt blocked by network/proxy with CONNECT tunnel failed, response 403)

✅ sbt scalafmtAll

❌ sbt "jvm/testOnly com.nawforce.apexlink.cst.LiteralTypeTest" (invalid SBT project syntax)

❌ sbt "jvmTest / testOnly com.nawforce.apexlink.cst.LiteralTypeTest" (invalid SBT project syntax)

✅ sbt "apexlsJVM / testOnly com.nawforce.apexlink.cst.LiteralTypeTest"

View task →

This was referenced Jan 25, 2026

Document attaching error listener to both lexer and parser apex-dev-tools/apex-parser#83

Closed

Validate annotation/comment positioning that fails to compile #421

Closed

Improve error reporting for syntax errors in method bodies #422

Closed

chatgpt-codex-connector Bot reviewed Feb 22, 2026

View reviewed changes

fix: report offending invalid escape sequence

ed32bba

kjonescertinia requested a review from nawforce February 22, 2026 12:22

kjonescertinia marked this pull request as ready for review February 22, 2026 12:22

nawforce approved these changes Feb 22, 2026

View reviewed changes

nawforce merged commit 540405c into main Feb 22, 2026
1 check passed

nawforce deleted the feature/string-literal-escape-validation branch February 22, 2026 13:32

nawforce mentioned this pull request Jun 6, 2026

Validate string escapes, static inner methods, and comment/annotation interaction #409

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: capture and improve lexer error messages for invalid escape sequences#420

feat: capture and improve lexer error messages for invalid escape sequences#420
nawforce merged 2 commits into
mainfrom
feature/string-literal-escape-validation

kjonescertinia commented Jan 25, 2026

Uh oh!

kjonescertinia commented Jan 25, 2026

Uh oh!

nawforce commented Feb 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Feb 22, 2026

Uh oh!

nawforce Feb 22, 2026

Uh oh!

chatgpt-codex-connector Bot Feb 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kjonescertinia commented Jan 25, 2026

Summary

Changes

Test plan

Uh oh!

kjonescertinia commented Jan 25, 2026

Future improvement note

Uh oh!

nawforce commented Feb 22, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

nawforce Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Feb 22, 2026

Choose a reason for hiding this comment

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants