Histogram field data type by not-napoleon · Pull Request #139457 · elastic/elasticsearch

not-napoleon · 2025-12-12T16:17:14Z

Add minimal support for a histogram field data type. This adds the type as "under construction" but not behind a feature flag; as with the previous PR, I don't see any need for that additional layer. As is tradition with new data types, most of this PR is test support.

Conflicts: server/src/main/resources/transport/upper_bounds/9.3.csv x-pack/plugin/analytics/src/test/java/org/elasticsearch/xpack/analytics/mapper/HistogramFieldBlockLoaderTests.java

elasticsearchmachine · 2025-12-12T16:17:51Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

Conflicts: server/src/main/resources/transport/upper_bounds/9.3.csv

…ld-data-type' into histogram-field-data-type

nik9000

Requested one thing. The other "big" types don't have it, but should as well. I just hadn't noticed before.

nik9000 · 2025-12-12T16:29:32Z

x-pack/plugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/CsvTestUtils.java

+                throw new IllegalArgumentException("Expected START_OBJECT but found: " + parser.currentToken());
+            }
+            parser.nextToken();
+            // TODO: This is striaght up copied from HistgramParser. There are even fewer good places to put that for resue than


nik9000 · 2025-12-12T16:30:40Z

...ugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/CsvTestsDataLoader.java

        boolean exponentialHistogramFieldSupported,
-        boolean tDigestFieldSupported
+        boolean tDigestFieldSupported,
+        boolean histogramFieldSupported


TODO for who knows when: this is too many booleans for one method. It needs something kinder to read.

+1. I can try to get to it during the holiday lull, but no promises. I've already got a lot on my list for that, and I'm only working a couple of days.

Do we really need this check here for the classic histogram field?
I had to add these checks for exponential_histogram because mixed-cluster / bwc tests would create clusters without exponential_histogram support and would fail on test-setup when trying to ingest the test-data.
I'd assume that the histogram ES field type (!= ESQL type) is old enough to be supported in all versions we test with?

Whether we actually support it as an ES|QL type will be guarded via the capabilities and doesn't need this check.

That might be the case, but I couldn't prove it to my satisfaction in the time I had to work on this, so I added this check defensively. My opinion, we need to rethink how we're doing this in general. It's not sustainable for us to have to keep touching this, and it's functionally adding another poorly supported proxy for a version. I think ideally, this would tie to a transport version or index version or something like that.

nik9000 · 2025-12-12T16:31:41Z

...ck/plugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/EsqlTestUtils.java

+            for (int i = 0; i < values.size(); i++) {
+                long count = counts.get(i);
+                assert count >= 0;
+                // we do not add elements with count == 0


That's just "compression"?

Honestly, this is bad copy pasta. There's no reason we should ever generate a zero in the random test data here, and I'll just update it to not do that. I believe the original code I copied this from, in the field mapper, does skip empty buckets as a space optimization, but I don't 100% remember the reasoning there.

nik9000 · 2025-12-12T16:46:14Z

...ql/src/main/java/org/elasticsearch/xpack/esql/expression/function/scalar/nulls/Coalesce.java

            case NULL -> EvalOperator.CONSTANT_NULL_FACTORY;
            case UNSUPPORTED, SHORT, BYTE, DATE_PERIOD, OBJECT, DOC_DATA_TYPE, SOURCE, TIME_DURATION, FLOAT, HALF_FLOAT, TSID_DATA_TYPE,
-                SCALED_FLOAT, AGGREGATE_METRIC_DOUBLE, TDIGEST, DENSE_VECTOR -> throw new UnsupportedOperationException(
+                SCALED_FLOAT, AGGREGATE_METRIC_DOUBLE, TDIGEST, HISTOGRAM, DENSE_VECTOR -> throw new UnsupportedOperationException(


We should get to stuff like this before long.

I wonder if there's a good way of "querying" the code base for the list of outstanding "stuff" for new data types. Like this.

Anything like that would need a way to differentiate between "real" types (like Histogram) and types that should not exist by this point in the plan, like Half Float.

That said, I intend to implement this for Histogram (and probably T-Digest, unless @JonasKunz gets to it first) soon. Just, not yet.

nik9000 · 2025-12-12T16:47:46Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/type/EsqlDataTypeConverter.java

+
+    public static String histogramToString(BytesRef histogram) {
+        // TODO: reuse the nearly identical code from HistogramFieldMapper
+        try (XContentBuilder builder = JsonXContent.contentBuilder()) {


Would you kindly throw an IllegalArgumentException from this if the histogram is more than, like, 2mb worth of buckets? And add IllegalArgumentException to the list of warnExceptions in the @ConvertEvaluator in ToString. Just out of paranoia.

…ld-data-type' into histogram-field-data-type

Conflicts: server/src/main/resources/transport/upper_bounds/9.3.csv x-pack/plugin/esql/qa/server/src/main/java/org/elasticsearch/xpack/esql/qa/rest/EsqlSpecTestCase.java x-pack/plugin/esql/qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/CsvTestsDataLoader.java

Add minimal support for a histogram field data type. This adds the type as "under construction" but not behind a feature flag; as with the previous PR, I don't see any need for that additional layer. As is tradition with new data types, most of this PR is test support. --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>

not-napoleon added 8 commits December 9, 2025 15:59

javadoc for the next time I have to do this

a5944b7

Block loader tests

efb2130

CSV data loading for Histogram fields

48ef0e3

Get the datatype to compile and run some tests

0b5247a

basic load test

4e7490b

fix expected value rendering for histograms

41038ca

Merge branch 'main' into histogram-field-data-type

d985c83

Conflicts: server/src/main/resources/transport/upper_bounds/9.3.csv x-pack/plugin/analytics/src/test/java/org/elasticsearch/xpack/analytics/mapper/HistogramFieldBlockLoaderTests.java

fix EsqlQueryResponseTests

476b55a

not-napoleon requested review from JonasKunz and nik9000 December 12, 2025 16:17

not-napoleon added >non-issue :StorageEngine/ES|QL Timeseries / metrics / PromQL / logsdb capabilities in ES|QL v9.3.0 labels Dec 12, 2025

elasticsearchmachine added the Team:StorageEngine label Dec 12, 2025

not-napoleon and others added 4 commits December 12, 2025 11:20

Merge branch 'main' into histogram-field-data-type

fb2b43d

Conflicts: server/src/main/resources/transport/upper_bounds/9.3.csv

[CI] Auto commit changes from spotless

18ea941

spotless apply

5a62732

Merge remote-tracking branch 'refs/remotes/not-napoleon/histogram-fie…

aac3085

…ld-data-type' into histogram-field-data-type

nik9000 requested changes Dec 12, 2025

View reviewed changes

nik9000 mentioned this pull request Dec 12, 2025

ESQL: Limit TO_STRING element size #139464

Open

4 tasks

not-napoleon added 2 commits December 12, 2025 13:36

Fix all supported fields test case

c9938af

response to PR feedback

8426af4

nik9000 approved these changes Dec 12, 2025

View reviewed changes

not-napoleon added 2 commits December 12, 2025 14:08

that was just dumb on my part

b7f26b4

Merge branch 'main' into histogram-field-data-type

c79a372

not-napoleon enabled auto-merge (squash) December 12, 2025 19:34

not-napoleon added 2 commits December 12, 2025 15:58

fix unsupported type yaml tests

7c19f91

Merge remote-tracking branch 'refs/remotes/not-napoleon/histogram-fie…

207b725

…ld-data-type' into histogram-field-data-type

not-napoleon disabled auto-merge December 12, 2025 20:59

not-napoleon added 3 commits December 12, 2025 15:59

Merge branch 'main' into histogram-field-data-type

cd299be

capability skip. We don't need to be BWC for unsupported fields

bcd48cf

Merge remote-tracking branch 'refs/remotes/not-napoleon/histogram-fie…

b383a7f

…ld-data-type' into histogram-field-data-type

JonasKunz approved these changes Dec 15, 2025

View reviewed changes

not-napoleon added 2 commits December 15, 2025 09:03

skip a few more tests

ccf5f0a

Merge branch 'main' into histogram-field-data-type

c9926ea

not-napoleon enabled auto-merge (squash) December 15, 2025 16:13

not-napoleon added 3 commits December 15, 2025 12:53

fix merge semantic confilct

777d19c

fix merge semantic confilct

7f11b96

not-napoleon merged commit 27feccd into elastic:main Dec 15, 2025
35 checks passed

kkrik-es mentioned this pull request Dec 17, 2025

[CI] XPackRestIT test {p0=esql/40_unsupported_types/unsupported with sort} failing #139702

Closed

not-napoleon mentioned this pull request Dec 17, 2025

Add support for storing and querying T-Digest sketches #137649

Closed

57 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Histogram field data type#139457

Histogram field data type#139457
not-napoleon merged 26 commits intoelastic:mainfrom
not-napoleon:histogram-field-data-type

not-napoleon commented Dec 12, 2025

Uh oh!

elasticsearchmachine commented Dec 12, 2025

Uh oh!

nik9000 left a comment

Uh oh!

nik9000 Dec 12, 2025

Uh oh!

nik9000 Dec 12, 2025

Uh oh!

not-napoleon Dec 12, 2025

Uh oh!

JonasKunz Dec 15, 2025

Uh oh!

not-napoleon Dec 15, 2025

Uh oh!

nik9000 Dec 12, 2025

Uh oh!

not-napoleon Dec 12, 2025

Uh oh!

nik9000 Dec 12, 2025

Uh oh!

not-napoleon Dec 12, 2025

Uh oh!

nik9000 Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

not-napoleon commented Dec 12, 2025

Uh oh!

elasticsearchmachine commented Dec 12, 2025

Uh oh!

nik9000 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants