ESQL: Add error policy and configurable options for CSV format reader by costin · Pull Request #143779 · elastic/elasticsearch

costin · 2026-03-06T22:12:18Z

Adds resilient error handling and configurable parsing options to the ESQL CSV datasource format reader.

Error policy — three modes control how malformed rows are handled during CSV ingestion:

FAIL_FAST — abort on the first error (default, equivalent to Spark FAILFAST)
SKIP_ROW — drop the malformed row and continue (equivalent to Spark DROPMALFORMED, DuckDB ignore_errors)
NULL_FIELD — null-fill unparseable fields while keeping the row (equivalent to Spark PERMISSIVE)

An error budget (max_errors, max_error_ratio) caps how many errors are tolerated before aborting, giving operators fine-grained control over data quality vs. throughput.

Configurable format options: delimiter, quote/escape characters, comment prefix, null representation, encoding, datetime format, and max field size can all be set per-query via WITH parameters

Both features are wired through the existing FormatReader SPI via a new withConfig(Map) method, keeping the interface backward-compatible.

Developed using AI-assisted tooling

Introduce ErrorPolicy with three modes (FAIL_FAST, SKIP_ROW, NULL_FIELD) and an error budget (maxErrors, maxErrorRatio) for resilient CSV parsing. Add CsvFormatOptions for configurable delimiter, quote/escape characters, comment prefix, null representation, encoding, datetime format, and max field size. Extend FormatReader SPI with withConfig() for per-query configuration.

elasticsearchmachine · 2026-03-06T22:12:44Z

Hi @costin, I've created a changelog YAML for you.

elasticsearchmachine · 2026-03-06T22:12:44Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

Replace exception-based error flow with return-code approach using a lastFieldError field, eliminating exception allocation and stack trace filling on parse failures. Pre-compute projected column arrays (int[], DataType[], Attribute[]) at schema init to avoid autoboxing and list lookups per field. Hoist invariant checks (comment filter, null value, log flag) into constructor-time booleans. Reuse Object[] row buffer across rows. Replace division with multiplication in error budget ratio check.

bpintea

LG, left just one note.

bpintea · 2026-03-09T11:48:16Z

...atasource-csv/src/main/java/org/elasticsearch/xpack/esql/datasource/csv/CsvFormatReader.java

+        private void onFieldError(String message, String value, Attribute attr) {
+            errorCount++;
+            if (logErrors) {
+                logger.warn(


This is good, but I guess we might want to evolve how we inform the user about the error. I believe that unless the policy is FAIL_FAST, a user would need to check the logs to see what happened.
We might want to add warnings?

costin added >enhancement Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) :Analytics/ES|QL AKA ESQL v9.4.0 labels Mar 6, 2026

costin requested a review from bpintea March 6, 2026 22:12

costin added 2 commits March 7, 2026 00:12

Update docs/changelog/143779.yaml

8a57897

costin enabled auto-merge (squash) March 7, 2026 06:57

bpintea approved these changes Mar 9, 2026

View reviewed changes

costin merged commit 4e671c4 into elastic:main Mar 9, 2026
36 checks passed

costin deleted the esql/ds/format-error-policy branch March 9, 2026 11:48

prwhelan mentioned this pull request Mar 9, 2026

[Transform] Disable PIT for CPS #143876

Closed

costin mentioned this pull request Mar 9, 2026

ESQL: Add configurable bracket-based multi-value support for CSV reader #143890

Merged

tylerperk added the ES|QL|DS ES|QL datasources label Mar 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ESQL: Add error policy and configurable options for CSV format reader#143779

ESQL: Add error policy and configurable options for CSV format reader#143779
costin merged 3 commits intoelastic:mainfrom
costin:esql/ds/format-error-policy

costin commented Mar 6, 2026 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 6, 2026

Uh oh!

elasticsearchmachine commented Mar 6, 2026

Uh oh!

bpintea left a comment

Uh oh!

bpintea Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

costin commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 6, 2026

Uh oh!

elasticsearchmachine commented Mar 6, 2026

Uh oh!

bpintea left a comment

Choose a reason for hiding this comment

Uh oh!

bpintea Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

costin commented Mar 6, 2026 •

edited

Loading