Skip to content

[GLUTEN-2169][VL] fix: some spark 33 ut fixed#2563

Merged
zhouyuan merged 12 commits intoapache:mainfrom
gaoyangxiaozhu:gayangya/gluten
Aug 8, 2023
Merged

[GLUTEN-2169][VL] fix: some spark 33 ut fixed#2563
zhouyuan merged 12 commits intoapache:mainfrom
gaoyangxiaozhu:gayangya/gluten

Conversation

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor

@gaoyangxiaozhu gaoyangxiaozhu commented Aug 2, 2023

fix some spark 33 uts #2169

new supported suites

GlutenParquetDeltaByteArrayEncodingSuite
GlutenParquetDeltaLengthByteArrayEncodingSuite
GlutenParquetFieldIdIOSuite
GlutenParquetVectorizedSuite
GlutenFileMetadataStructSuite
GlutenParquetV2AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenOrcV1AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenOrcV2AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenReplaceHashWithSortAggSuite
GlutenBroadcastJoinSuite

Here the basic principal of this PR is to fallback all unsupported scenario/op to JVM to quick quick the UTs issue.

Open 3 new issues to let refactor code in future do native support insteads fallback for maybe better perf gain.

pushaggregate support in native scan - #2617
medascan column support in native scan - #2618
match column by filedIds in native scan #2619

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 2, 2023

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/oap-project/gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 2, 2023

Run Gluten Clickhouse CI

@Yohahaha
Copy link
Copy Markdown
Contributor

Yohahaha commented Aug 2, 2023

Would you add newly supported suites name in PR description?

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Would you add newly supported suites name in PR description?

updated thanks

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 2, 2023

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 2, 2023

Run Gluten Clickhouse CI

@gaoyangxiaozhu gaoyangxiaozhu changed the title some spark 33 ut fixed [GLUTEN-2169][VL] fix: some spark 33 ut fixed Aug 3, 2023
@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 3, 2023

#2169

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Would you add newly supported suites name in PR description?

updated thanks

can anyone @Yohahaha @zhouyuan @FelixYBW help re-run the fail workflow job ? I don't have permission to the thing (re-run failure job)

@Yohahaha
Copy link
Copy Markdown
Contributor

Yohahaha commented Aug 3, 2023

@gaoyangxiaozhu This command can help you.

git commit --allow-empty -m "Trigger CI" && git push

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 3, 2023

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 3, 2023

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 5, 2023

Run Gluten Clickhouse CI

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 5, 2023

Run Gluten Clickhouse CI

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

anyone help checking for what thing goes wrong in Gluten ClickHouse CI ?

@zhouyuan
Copy link
Copy Markdown
Member

zhouyuan commented Aug 7, 2023

Run Gluten Clickhouse CI

@philo-he
Copy link
Copy Markdown
Member

philo-he commented Aug 7, 2023

Hi @gaoyangxiaozhu, you can log in ClickHouse CI with a public account/password. See https://github.com/oap-project/gluten/blob/main/docs/get-started/ClickHouse.md#new-ci-system.
If you need to just re-trigger ClickHouse CI, you can comment with Run Gluten Clickhouse CI.

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Run Gluten Clickhouse CI

Run Gluten Clickhouse CI

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Run Gluten Clickhouse CI

Run Gluten Clickhouse CI
Run Gluten Clickhouse CI

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Run Gluten Clickhouse CI
image

GlutenPerfBot

This comment was marked as off-topic.

@gaoyangxiaozhu
Copy link
Copy Markdown
Contributor Author

Velox TPCH SF2000 performance report: Error: Could not connect to https://api.github.com/

anyone help for ClickHouse CI issue, i manually trigger multiple times always fail but i can't find RCA from the log

@zhouyuan
Copy link
Copy Markdown
Member

zhouyuan commented Aug 7, 2023

@zzcclp It seem the CI on clickhouse showed all tests passed, but still failed on exit, could you please help to take a look on this?

thanks, -yuan

@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 8, 2023

Run Gluten Clickhouse CI

1 similar comment
@github-actions
Copy link
Copy Markdown

github-actions bot commented Aug 8, 2023

Run Gluten Clickhouse CI

Copy link
Copy Markdown
Contributor

@zhli1142015 zhli1142015 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.


def pushedAggregate(fileFormat: String): Option[Aggregation] = {
fileFormat match {
case "parquet" => scan.asInstanceOf[ParquetScan].pushedAggregate
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @gaoyangxiaozhu, can we directly use the below code to get the pushed aggregate? It seems there is no need to get file format to do the matching. Thanks!

@transient lazy val pushedAggregate: Option[Aggregation] = {
    scan match {
      case s: ParquetScan =>
        s.pushedAggregate
      case o: OrcScan =>
        o.pushedAggregate
      case _ =>
        None
    }
  }

@zhouyuan zhouyuan merged commit 0f0127d into apache:main Aug 8, 2023
@zhouyuan
Copy link
Copy Markdown
Member

zhouyuan commented Aug 8, 2023

@gaoyangxiaozhu thanks for enabling those unit tests!

Copy link
Copy Markdown
Member

@philo-he philo-he left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great! I will do some small refinement in a follow-up PR. Thanks for the contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants