LogsDB qa tests - add specific matcher for source by lkts · Pull Request #111568 · elastic/elasticsearch

lkts · 2024-08-02T21:41:41Z

This PR adds a new matcher that specifically handles source of documents. We need a special implementation due to complex structure of source document (multiple layers of documents) and differences between synthetic and stored source.

Such differences in theory could be expressed in generic terms (e.g. add a parameter to ignore nulls to list matcher) but i think this is simpler. Moreover as i discovered previously we need special matchers for some fields due to field-specific differences in synthetic source. To do that we would need a special code path anyway.

elasticsearchmachine · 2024-08-02T21:42:04Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

dnhatn

Looks great. Thanks @lkts.

dnhatn · 2024-08-03T21:44:15Z

...tTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/FieldSpecificMatcher.java

+    boolean match(List<Object> actual, List<Object> expected);
+
+    class HalfFloatMatcher implements FieldSpecificMatcher {
+        public boolean match(List<Object> actual, List<Object> expected) {


nit: Override?

dnhatn · 2024-08-03T21:54:31Z

...aRestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/SourceTransforms.java

+        if (currentField instanceof Map<?, ?> map) {
+            descend(pathToCurrentField, (Map<String, Object>) map, flattened);
+        } else {
+            flattened.putIfAbsent(pathToCurrentField, new ArrayList<>());


nit: Can we use computeIfAbsent().add()

dnhatn · 2024-08-03T21:56:11Z

...javaRestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/SourceMatcher.java

+        this.fieldSpecificMatchers = Map.of("half_float", new FieldSpecificMatcher.HalfFloatMatcher());
+    }
+
+    public MatchResult match() {


nit: override

dnhatn · 2024-08-03T22:04:26Z

...tTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/FieldSpecificMatcher.java

+import java.util.stream.Collectors;
+
+public interface FieldSpecificMatcher {
+    boolean match(List<Object> actual, List<Object> expected);


Can this method return a MatchResult instead of a boolean?

I wanted to do that but to produce full message in case there is a mismatch you'll need to pass mapping and settings here which ends up being a mouthful.

Actually it's not a big deal.

dnhatn · 2024-08-03T22:07:05Z

...javaRestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/SourceMatcher.java

+        return MatchResult.match();
+    }
+
+    private Optional<MatchResult> matchWithFieldSpecificMatcher(String fieldName, List<Object> actualValues, List<Object> expectedValues) {


Can this method return the specific matcher instead of MatchResult

There is no common interface between a field specific matcher and generic matcher. Maybe there should be but i don't immediately see the benefit.

kkrik-es · 2024-08-05T13:19:28Z

...RestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/MappingTransforms.java

+                        continue;
+                    }
+
+                    flattened.putIfAbsent(pathFromRoot, new HashMap<>());


Nit: combine the two lines:

flattened.computeIfAbsent(pathFromRoot, new HashMap<>()).put(entry.getKey(), entry.getValue());

Neat, didn't expect this from java.

kkrik-es · 2024-08-05T13:21:25Z

...RestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/MappingTransforms.java

+     * @return
+     */
+    public static Map<String, Map<String, Object>> normalizeMapping(Map<String, Object> map) {
+        var flattened = new HashMap<String, Map<String, Object>>();


I'm somewhat puzzled here, why not just use HashMap<String, Object> to track just the leaf fields?

I see, you do that for source checking while you also want to compare the mappings. Need to think a bit about it..

This is a map from normalized field name (a.b.c) to a map of mapping parameters (like type) since there can be multiple.

kkrik-es · 2024-08-05T13:25:59Z

...javaRestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/SourceMatcher.java

+    private Optional<MatchResult> matchWithFieldSpecificMatcher(String fieldName, List<Object> actualValues, List<Object> expectedValues) {
+        var actualFieldMapping = actualNormalizedMapping.get(fieldName);
+        if (actualFieldMapping == null) {
+            // Dynamic mapping, nothing to do


Check that expectedNormalizedMapping.get(fieldName) returns null too?

kkrik-es · 2024-08-05T13:34:13Z

...aRestTest/java/org/elasticsearch/datastreams/logsdb/qa/matchers/source/SourceTransforms.java

+
+        // Synthetic source modifications:
+        // * null values are not present
+        // * duplicates are removed


We probably need to add an extension to randomization to inject duplicates..

In general, it'd be nice to have some values repeated randomly, esp for hostname.,

That's a very good idea!

* upstream/main: (132 commits) Fix compile after several merges Update docs with new behavior on skip conditions (elastic#111640) Skip on any instance of node or version features being present (elastic#111268) Skip on any node capability being present (elastic#111585) [DOCS] Publishes Anthropic inference service docs. (elastic#111619) Introduce `ChunkedZipResponse` (elastic#109820) [Gradle] fix esql compile cacheability (elastic#111651) Mute org.elasticsearch.datastreams.logsdb.qa.StandardVersusLogsIndexModeChallengeRestIT testTermsQuery elastic#111666 Mute org.elasticsearch.datastreams.logsdb.qa.StandardVersusLogsIndexModeChallengeRestIT testMatchAllQuery elastic#111664 Mute org.elasticsearch.xpack.esql.analysis.VerifierTests testMatchCommand elastic#111661 Mute org.elasticsearch.xpack.esql.optimizer.LocalPhysicalPlanOptimizerTests testMatchCommandWithMultipleMatches {default} elastic#111660 Mute org.elasticsearch.xpack.esql.optimizer.LocalPhysicalPlanOptimizerTests testMatchCommand {default} elastic#111659 Mute org.elasticsearch.xpack.esql.optimizer.LocalPhysicalPlanOptimizerTests testMatchCommandWithWhereClause {default} elastic#111658 LogsDB qa tests - add specific matcher for source (elastic#111568) ESQL: Move `randomLiteral` (elastic#111647) [ESQL] Clean up UNSUPPORTED type blocks (elastic#111648) ESQL: Remove the `NESTED` DataType (elastic#111495) ESQL: Move more out of esql-core (elastic#111604) Improve MvPSeriesWeightedSum edge case and add more tests (elastic#111552) Add link to flood-stage watermark exception message (elastic#111315) ... # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

LogsDB qa tests - add specific matcher for source

e02b455

lkts added >test Issues or PRs that are addressing/adding tests :StorageEngine/Logs You know, for Logs labels Aug 2, 2024

lkts requested review from dnhatn and kkrik-es August 2, 2024 21:41

elasticsearchmachine added v8.16.0 Team:StorageEngine labels Aug 2, 2024

dnhatn approved these changes Aug 3, 2024

View reviewed changes

kkrik-es reviewed Aug 5, 2024

View reviewed changes

Address feedback

269fa22

lkts merged commit aa1d2bc into elastic:main Aug 6, 2024

lkts deleted the logsdb_qa_tests_matcher_updates branch August 6, 2024 20:49

lkts mentioned this pull request Aug 6, 2024

Fix LogsDB challenge test #111665

Merged

rjernst pushed a commit to rjernst/elasticsearch that referenced this pull request Aug 7, 2024

LogsDB qa tests - add specific matcher for source (elastic#111568)

3ae42bf

mhl-b pushed a commit that referenced this pull request Aug 8, 2024

LogsDB qa tests - add specific matcher for source (#111568)

88e2fcc

cbuescher pushed a commit to cbuescher/elasticsearch that referenced this pull request Sep 4, 2024

LogsDB qa tests - add specific matcher for source (elastic#111568)

3017956

Conversation

lkts commented Aug 2, 2024

Uh oh!

elasticsearchmachine commented Aug 2, 2024

Uh oh!

dnhatn left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kkrik-es Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kkrik-es Aug 5, 2024 •

edited

Loading