Skip to content

GH-40055: [Java][Docs] Simplify use of Filter and Expression into Dataset Substrait#40056

Merged
lidavidm merged 1 commit intoapache:mainfrom
davisusanibar:JAVA-ARROW-SUBSTRAIT
Feb 15, 2024
Merged

GH-40055: [Java][Docs] Simplify use of Filter and Expression into Dataset Substrait#40056
lidavidm merged 1 commit intoapache:mainfrom
davisusanibar:JAVA-ARROW-SUBSTRAIT

Conversation

@davisusanibar
Copy link
Copy Markdown
Contributor

@davisusanibar davisusanibar commented Feb 12, 2024

Rationale for this change

Simplify creation of SQL Expression Filter and Projections into Arrow Java Dataset module using new Substrait Feature for SQL Expressions.

What changes are included in this PR?

Update Apache Arrow Java Dataset Substrait documentation

Are these changes tested?

Yes

Are there any user-facing changes?

No

@github-actions
Copy link
Copy Markdown

⚠️ GitHub issue #40055 has been automatically assigned in GitHub to PR creator.

@kou kou changed the title GH-40055: [Java][Doc] Simplify use of Filter and Expression into Dataset Substrait GH-40055: [Java][Docs] Simplify use of Filter and Expression into Dataset Substrait Feb 13, 2024
.. code-block:: text

ADD_TEN_TO_COLUMN_N_REGIONKEY CONCAT_COLUMNS_N_NAME_AND_N_COMMENT
column-1 column-2
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@davisusanibar I guess this column name changes occur from the Substrait end?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it is.

I just created this issue for enhancements "Produce projection/filter extended expressions with predefined column names" substrait-io/substrait-java#229

@github-actions github-actions bot added awaiting committer review Awaiting committer review and removed awaiting review Awaiting review labels Feb 13, 2024
while (reader.loadNextBatch()) {
System.out.println(
reader.getVectorSchemaRoot().contentToTSVString());
System.out.println(reader.getVectorSchemaRoot().contentToTSVString());
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: does this need formatting. I am merely asking this comparing with the existing code.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll review how a linter will be configured/implemented for Java code.

Copy link
Copy Markdown
Contributor

@vibhatha vibhatha left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@davisusanibar thank you for working on this. PR LGTM!

@vibhatha vibhatha requested a review from danepitkin February 13, 2024 07:02
Copy link
Copy Markdown
Member

@danepitkin danepitkin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@lidavidm lidavidm merged commit a03d957 into apache:main Feb 15, 2024
@lidavidm lidavidm removed the awaiting committer review Awaiting committer review label Feb 15, 2024
@github-actions github-actions bot added the awaiting merge Awaiting merge label Feb 15, 2024
@conbench-apache-arrow
Copy link
Copy Markdown

After merging your PR, Conbench analyzed the 4 benchmarking runs that have been run so far on merge-commit a03d957.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details. It also includes information about 4 possible false positives for unstable benchmarks that are known to sometimes produce them.

dgreiss pushed a commit to dgreiss/arrow that referenced this pull request Feb 19, 2024
…to Dataset Substrait (apache#40056)

### Rationale for this change

Simplify creation of SQL Expression Filter and Projections into Arrow Java Dataset module using new [Substrait Feature for SQL Expressions](https://github.com/substrait-io/substrait-java/releases/tag/v0.26.0).

### What changes are included in this PR?

Update Apache Arrow Java Dataset Substrait documentation

### Are these changes tested?

Yes

### Are there any user-facing changes?

No
* Closes: apache#40055

Authored-by: david dali susanibar arce <davi.sarces@gmail.com>
Signed-off-by: David Li <li.davidm96@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Java][Dataset][Substrait][Docs] Simplify creation of SQL Expression Filter and Projections using new Substrait Feature

4 participants