[GLUTEN-2169][VL] fix: some spark 33 ut fixed by gaoyangxiaozhu · Pull Request #2563 · apache/gluten

gaoyangxiaozhu · 2023-08-02T02:14:27Z

fix some spark 33 uts #2169

new supported suites

GlutenParquetDeltaByteArrayEncodingSuite
GlutenParquetDeltaLengthByteArrayEncodingSuite
GlutenParquetFieldIdIOSuite
GlutenParquetVectorizedSuite
GlutenFileMetadataStructSuite
GlutenParquetV2AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenOrcV1AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenOrcV2AggregatePushDownSuite exclude "aggregate push down - different data types"
GlutenReplaceHashWithSortAggSuite
GlutenBroadcastJoinSuite

Here the basic principal of this PR is to fallback all unsupported scenario/op to JVM to quick quick the UTs issue.

Open 3 new issues to let refactor code in future do native support insteads fallback for maybe better perf gain.

pushaggregate support in native scan - #2617
medascan column support in native scan - #2618
match column by filedIds in native scan #2619

github-actions · 2023-08-02T02:14:47Z

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/oap-project/gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

Other pull requests

github-actions · 2023-08-02T02:15:00Z

Run Gluten Clickhouse CI

gluten-core/src/main/scala/io/glutenproject/execution/BatchScanExecTransformer.scala

Yohahaha · 2023-08-02T02:35:03Z

Would you add newly supported suites name in PR description?

gaoyangxiaozhu · 2023-08-02T03:14:11Z

Would you add newly supported suites name in PR description?

updated thanks

github-actions · 2023-08-02T05:34:49Z

Run Gluten Clickhouse CI

github-actions · 2023-08-02T13:18:33Z

Run Gluten Clickhouse CI

github-actions · 2023-08-03T01:42:47Z

#2169

gaoyangxiaozhu · 2023-08-03T01:50:38Z

Would you add newly supported suites name in PR description?

updated thanks

can anyone @Yohahaha @zhouyuan @FelixYBW help re-run the fail workflow job ? I don't have permission to the thing (re-run failure job)

Yohahaha · 2023-08-03T02:14:47Z

@gaoyangxiaozhu This command can help you.

git commit --allow-empty -m "Trigger CI" && git push

github-actions · 2023-08-03T03:57:53Z

Run Gluten Clickhouse CI

…luten into gayangya/gluten

github-actions · 2023-08-03T06:35:17Z

Run Gluten Clickhouse CI

github-actions · 2023-08-05T15:07:56Z

Run Gluten Clickhouse CI

github-actions · 2023-08-05T15:08:34Z

Run Gluten Clickhouse CI

gaoyangxiaozhu · 2023-08-06T10:36:38Z

anyone help checking for what thing goes wrong in Gluten ClickHouse CI ?

zhouyuan · 2023-08-07T02:09:02Z

Run Gluten Clickhouse CI

philo-he · 2023-08-07T02:13:40Z

Hi @gaoyangxiaozhu, you can log in ClickHouse CI with a public account/password. See https://github.com/oap-project/gluten/blob/main/docs/get-started/ClickHouse.md#new-ci-system.
If you need to just re-trigger ClickHouse CI, you can comment with Run Gluten Clickhouse CI.

gaoyangxiaozhu · 2023-08-07T03:36:34Z

Run Gluten Clickhouse CI

Run Gluten Clickhouse CI

gaoyangxiaozhu · 2023-08-07T06:57:02Z

Run Gluten Clickhouse CI

Run Gluten Clickhouse CI
Run Gluten Clickhouse CI

gaoyangxiaozhu · 2023-08-07T06:57:54Z

Run Gluten Clickhouse CI

gaoyangxiaozhu · 2023-08-07T09:50:12Z

Velox TPCH SF2000 performance report: Error: Could not connect to https://api.github.com/

anyone help for ClickHouse CI issue, i manually trigger multiple times always fail but i can't find RCA from the log

zhouyuan · 2023-08-07T11:55:05Z

@zzcclp It seem the CI on clickhouse showed all tests passed, but still failed on exit, could you please help to take a look on this?

thanks, -yuan

github-actions · 2023-08-08T06:33:54Z

Run Gluten Clickhouse CI

github-actions · 2023-08-08T06:34:04Z

Run Gluten Clickhouse CI

zhli1142015

LGTM.

philo-he · 2023-08-08T12:36:26Z

...spark33/src/main/scala/org/apache/spark/sql/execution/datasources/v2/BatchScanExecShim.scala


+  def pushedAggregate(fileFormat: String): Option[Aggregation] = {
+    fileFormat match {
+      case "parquet" => scan.asInstanceOf[ParquetScan].pushedAggregate


Hi @gaoyangxiaozhu, can we directly use the below code to get the pushed aggregate? It seems there is no need to get file format to do the matching. Thanks!

@transient lazy val pushedAggregate: Option[Aggregation] = { scan match { case s: ParquetScan => s.pushedAggregate case o: OrcScan => o.pushedAggregate case _ => None } }

zhouyuan · 2023-08-08T12:47:51Z

@gaoyangxiaozhu thanks for enabling those unit tests!

philo-he

Looks great! I will do some small refinement in a follow-up PR. Thanks for the contribution!

some spark 33 ut fixed

aad7a4b

Yohahaha reviewed Aug 2, 2023

View reviewed changes

gluten-core/src/main/scala/io/glutenproject/execution/BatchScanExecTransformer.scala Outdated Show resolved Hide resolved

fix code style issue

e0b1c0d

enable GlutenReplaceHashWithSortAggSuite

b38311d

gaoyangxiaozhu changed the title ~~some spark 33 ut fixed~~ [GLUTEN-2169][VL] fix: some spark 33 ut fixed Aug 3, 2023

Merge branch 'main' into gayangya/gluten

de4ceba

Yangyang Gao added 3 commits August 3, 2023 14:34

Trigger CI

130ebab

Merge branch 'gayangya/gluten' of https://github.com/gaoyangxiaozhu/g…

852ce93

…luten into gayangya/gluten

Trigger CI

d23307a

Yangyang Gao added 2 commits August 5, 2023 21:23

some spark33 ut fixed

0939eee

add filesourcescanshim to fix 32 build issue

9a7d300

Merge branch 'main' into gayangya/gluten

52dbf6e

This was referenced Aug 6, 2023

[VL][Spark 3.3+] support return metadataColumns from native scan insteads of fallback #2618

Closed

[VL][Spark 3.3+] support match columns use filedIds in native insteads of fallback #2619

Closed

This comment was marked as off-topic.

Sign in to view

Yangyang Gao and others added 2 commits August 8, 2023 14:33

try fix clickhouse ci fail issue

fabb1aa

Merge branch 'main' into gayangya/gluten

1447b1c

zhli1142015 approved these changes Aug 8, 2023

View reviewed changes

philo-he reviewed Aug 8, 2023

View reviewed changes

zhouyuan merged commit 0f0127d into apache:main Aug 8, 2023

philo-he reviewed Aug 8, 2023

View reviewed changes

This was referenced Nov 9, 2023

[Gluten-core][VL] Supports DeltaLake 2.2 Read #3376

Closed

[CORE][VL] Support RewriteTransformer Rules and DeltaLake Scan #3646

Merged

Conversation

gaoyangxiaozhu commented Aug 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 2, 2023

Uh oh!

github-actions bot commented Aug 2, 2023

Uh oh!

Uh oh!

Yohahaha commented Aug 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaoyangxiaozhu commented Aug 2, 2023

Uh oh!

github-actions bot commented Aug 2, 2023

Uh oh!

github-actions bot commented Aug 2, 2023

Uh oh!

github-actions bot commented Aug 3, 2023

Uh oh!

gaoyangxiaozhu commented Aug 3, 2023

Uh oh!

Yohahaha commented Aug 3, 2023

Uh oh!

github-actions bot commented Aug 3, 2023

Uh oh!

github-actions bot commented Aug 3, 2023

Uh oh!

github-actions bot commented Aug 5, 2023

Uh oh!

github-actions bot commented Aug 5, 2023

Uh oh!

gaoyangxiaozhu commented Aug 6, 2023

Uh oh!

zhouyuan commented Aug 7, 2023

Uh oh!

philo-he commented Aug 7, 2023

Uh oh!

gaoyangxiaozhu commented Aug 7, 2023

Uh oh!

gaoyangxiaozhu commented Aug 7, 2023

Uh oh!

gaoyangxiaozhu commented Aug 7, 2023

Uh oh!

This comment was marked as off-topic.

Uh oh!

gaoyangxiaozhu commented Aug 7, 2023

Uh oh!

zhouyuan commented Aug 7, 2023

Uh oh!

github-actions bot commented Aug 8, 2023

Uh oh!

github-actions bot commented Aug 8, 2023

Uh oh!

zhli1142015 left a comment

Choose a reason for hiding this comment

Uh oh!

philo-he Aug 8, 2023

Choose a reason for hiding this comment

Uh oh!

zhouyuan commented Aug 8, 2023

Uh oh!

philo-he left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

gaoyangxiaozhu commented Aug 2, 2023 •

edited

Loading

Yohahaha commented Aug 2, 2023 •

edited

Loading