[GLUTEN-11088][VL] Fall back CSV reader#11190
Conversation
|
Run Gluten ClickHouse CI on ARM |
|
Run Gluten ClickHouse CI on ARM |
|
Run Gluten ClickHouse CI on x86 |
|
Run Gluten ClickHouse CI on ARM |
|
Passed the tests one time, but after rerun CSV failed by |
|
Run Gluten ClickHouse CI on ARM |
|
Trigger the flaky test |
zhztheplayer
left a comment
There was a problem hiding this comment.
It seems the CI is still failing
|
Maybe we need to compile the arrow, flaky test may cause by platform difference |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
| BloomFilterMightContainJointRewriteRule.apply( | ||
| c.session, | ||
| c.caller.isBloomFilterStatFunction())) | ||
| injector.injectPreTransform(c => ArrowScanReplaceRule.apply(c.session)) |
There was a problem hiding this comment.
Just to confirm: is CSV format no longer supported, or do we only need a fallback for Spark 40 and later versions?
There was a problem hiding this comment.
CSV format is no longer supported
|
Run Gluten Clickhouse CI on x86 |
|
Could you help approve? Thanks! @zhztheplayer |
philo-he
left a comment
There was a problem hiding this comment.
@jinchengchenghh, if Arrow CSV reader is not required, can we directly use the official Apache Arrow Jar to replace the Jar locally built by developers? cc @zhouyuan
|
I remember there is several patches applied to arrow 15, not only csv reader related change, for arrow 18(Spark4.0), we use the official release @philo-he |
@jinchengchenghh, do we need to remove those CSV-reader-specific patches under |
|
This patch only fallbacks the csv reader, we does not remove all the csv related code from java code, when we decide to remove it, we will also remove the patch, I'm not sure if some customer may be interested on it. |
philo-he
left a comment
There was a problem hiding this comment.
Thanks for the clarification.
|
@jinchengchenghh would you please also fallback csv for spark 4.1? |
|
Yes, csv fall back for all the Spark version in this PR @baibaichen |
Oh, right. We also need to re-enable the CSV-related suites in Spark 4.1. |
Related issue: #11088