Write column OID to the source by BaurzhanSakhariev · Pull Request #14636 · crate/crate

BaurzhanSakhariev · 2023-08-31T08:07:07Z

Next step after #14635

TODO:

(reads) Change LuceneReferenceResolver/Lucene reading logic to access data by oid instead of column name
(reads) Need to update query building to use oid instead of column name
Primary key lookup? (check PKLookupOperation)
(indexing) Change mapper implementations to use oid as field name
(indexing) Change source generation to use oid as field name
optional Change _raw lookup to rewrite oid to name and don’t' include dropped columns
optional select _raw must hide dropped columns

Add flag to denote that a column is dropped.

This is a pre-requisite for using column OID-s instead of names in the _source. Phase 1: Collect new columns and do a schema update. Don't create a Lucene document with source at this point. Phase 2: Index. Target references still might have unassigned OID since this is only a preparation step.

server/src/main/java/io/crate/execution/dml/ObjectIndexer.java

seut · 2023-09-05T14:10:20Z

server/src/main/java/io/crate/types/DataType.java


    @Nullable
-    public final ValueIndexer<? super T> valueIndexer(RelationName table,
+    public final ValueIndexer<? super T> valueIndexer(DocTableInfo table,


Why is this change needed?

The whole second commit was needed only to make ObjectIndexer aware of table.versionCreated. As you noted in the prev comment, we need to parse sourceKeyWriter instead.

I mean, I didn't want to inject version in the indexer like

if (valueIndexer instance of ObjectIndexer objectIndexer) { only ObjectIndexer cares about Version objectIndexer.someNewMethodToInjectVersion(version) }

as I thought it's not nice - but it looks shorter. I will drop second commit, and do direct injection instead. Will pass sourceKeyWriter as suggested above.

ah ok, I recalled why I didn't want to do "direct injection" - in case of nested indexers, need to propagate it
for Object/Array/Dynamic indexers. I will think how I can propagate that shortly.

I added soureKeyWriter to the indexValue and then it gets propagated to all nested (or newly created by DynamicIndexer) ObjectIndexers. Thanks for the suggestion!

I rebased the PR

BaurzhanSakhariev · 2023-09-05T14:54:38Z

server/src/main/java/io/crate/execution/dml/Indexer.java

            addedColumnsByIdent.put(ref.column(), ref);
+            // Also add to columnsByIdent,so that getRef supplier doesn't get stale.
+            columnsByIdent.put(ref.column(), ref);
        }


This is needed for the nested ObjectIndexer-s, after phase 1 table can get stale.
We update explicitly only targets via Indexer.updateTargets.

Before it was not important since we used column name anyway, now we are using getRef.apply on each ObjectIndexer.indexValue since we need to get OID now.

BaurzhanSakhariev · 2023-09-05T15:03:38Z

server/src/main/java/io/crate/execution/dml/Indexer.java

                   Symbol[] returnValues) {
+        Iterator<Reference> refsIterator = table.iterator();
+        while (refsIterator.hasNext()) {
+            Reference reference = refsIterator.next();
+            columnsByIdent.put(reference.column(), reference);
+        }


Not using table.columns() since we need all columns, including nested ones.

This reverts commit b9c18b1

BaurzhanSakhariev and others added 11 commits August 30, 2023 19:15

Intriduce global autoincrement counter column_oid

da9cdd6

Add column OID to FieldMapper-s

99e0fff

Add oid to Reference

43415cf

Assign column OID-s on column addition/table creation

f1b695a

Add isDropped flag to mappers

f34c42a

Add flag to denote that a column is dropped.

Add isDropped to SimpleReference.

2c6092a

Add support for DROP COLUMN to parser

c6523bb

Add Analyzer support for DROP COLUMN

1115948

Add planner/execution support for DROP COLUMN

3ca647b

Implement drop object sub-column

432338c

added hasOid method to ReferenceAssert

19fb158

BaurzhanSakhariev mentioned this pull request Aug 31, 2023

2 phase indexing #14635

Merged

BaurzhanSakhariev force-pushed the b/write-oid2 branch 4 times, most recently from 5cd0e82 to ad3dbc1 Compare August 31, 2023 09:07

BaurzhanSakhariev force-pushed the b/oid-in-src2 branch from 8e8f391 to 4a8bf85 Compare September 4, 2023 16:55

BaurzhanSakhariev added 4 commits September 4, 2023 17:24

Support OID assignment for tables created by SQLExecutor.addTable

989ac2f

Added AddColumnResponse containing references with OIDs

f1116ec

Update Indexer targets after doing schema update

005ab93

Base automatically changed from b/oid-in-src2 to drop-column September 4, 2023 17:24

BaurzhanSakhariev force-pushed the b/write-oid2 branch 3 times, most recently from 2f3d562 to 0622fcf Compare September 5, 2023 12:36

seut reviewed Sep 5, 2023

View reviewed changes

server/src/main/java/io/crate/execution/dml/ObjectIndexer.java Outdated Show resolved Hide resolved

BaurzhanSakhariev force-pushed the b/write-oid2 branch from 0622fcf to 15c0a06 Compare September 5, 2023 14:08

seut reviewed Sep 5, 2023

View reviewed changes

BaurzhanSakhariev force-pushed the b/write-oid2 branch 2 times, most recently from cef00d6 to cd5ee19 Compare September 5, 2023 14:49

BaurzhanSakhariev commented Sep 5, 2023

View reviewed changes

BaurzhanSakhariev force-pushed the b/write-oid2 branch 2 times, most recently from b37e849 to 6f94952 Compare September 5, 2023 15:02

BaurzhanSakhariev commented Sep 5, 2023

View reviewed changes

BaurzhanSakhariev added 2 commits September 5, 2023 17:57

Write column OID to the source

7b4f5cf

ignore fulltext tests (temporal)

bab9e60

BaurzhanSakhariev force-pushed the b/write-oid2 branch from 6f94952 to bab9e60 Compare September 5, 2023 15:57

BaurzhanSakhariev added 5 commits September 5, 2023 18:12

Use OID on reverse index/doc values

b9c18b1

Revert "Use OID on reverse index/doc values"

4899838

This reverts commit b9c18b1

Use OID on reverse index/doc values - version 2

0ca43ad

Update LuceneReferenceResolver to access data by oid

786c877

todo for LuceneReferenceResolver

f1f15c9

BaurzhanSakhariev force-pushed the drop-column branch from b94ee50 to 8c97f41 Compare September 7, 2023 09:27

BaurzhanSakhariev force-pushed the drop-column branch from a992aca to ce9e07c Compare September 14, 2023 11:04

BaurzhanSakhariev closed this Sep 14, 2023

BaurzhanSakhariev deleted the b/write-oid2 branch September 14, 2023 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write column OID to the source#14636

Write column OID to the source#14636
BaurzhanSakhariev wants to merge 22 commits intodrop-columnfrom
b/write-oid2

BaurzhanSakhariev commented Aug 31, 2023 •

edited

Loading

Uh oh!

Uh oh!

seut Sep 5, 2023

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Uh oh!

BaurzhanSakhariev Sep 5, 2023 •

edited

Loading

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Uh oh!

BaurzhanSakhariev Sep 5, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

BaurzhanSakhariev commented Aug 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

seut Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

BaurzhanSakhariev Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

BaurzhanSakhariev Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

BaurzhanSakhariev Sep 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BaurzhanSakhariev commented Aug 31, 2023 •

edited

Loading

BaurzhanSakhariev Sep 5, 2023 •

edited

Loading

BaurzhanSakhariev Sep 5, 2023 •

edited

Loading