[SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition #35319

AngersZhuuuu · 2022-01-25T11:30:09Z

What changes were proposed in this pull request?

For current data source insert sql commit protocol, it have below problems:
case a: both job A and job B write data into partitioned table TBL with different statistic partition, it will have conflict since they use same temp location ${table_location}/_temporary/0/....,
when job A has finished and then it will clean this temp location, then cause job B’s temp data is cleared. Then it will cause job B to fail to write data.
case b: for current dynamic partition insert, if we kill a job writing data, will remain data under table location in the staging dir under table path.
case c: If we use a dynamic partition insert to insert a new table with a huge number of partitions, we need to move partition path one by one, for this case, we can just rename stagingdir path to table path to make it more quicker. But if we want to do this, we need to make staging dir can be customized and should not use the staging path under table location.

In this approach, we plan to do two thing:

Make staging dir can be customized like hive
- When we terminate a job doing dynamic partition insert, it will remain staging dir under table location . If we make the staging dir can be customized like hive(such as define staging dir as /tmp/spark/.stagingdir ) can avoid remaining such staging dir and data under table path.
- If we define staging dir using a tmp location, not under table location, when we use dynamic partition insert to write data to a new table, we can just rename staging dir to target table location to avoid moving partition dir one by one. It’s more quickly.
- If we can customize staging dir, we can implement a new commit protocol to enable use of staging dir to write non-partitioned table and static partition insert use staging dir and won’t increase FS operation.
New SQL Commit protocol supports staging dir and won’t increase FS operation.

For current static partition insert in v1, it have step:

1. Task attempts firstly write files under the intermediate path, e.g. /path/to/outputPath/_temporary/{appAttemptId}/_temporary/{taskAttemptId}/{part_spec_path}/xxx.parquet.

2. Then task commit file to /path/to/outputPath/_temporary/{appId}/_temporary/{taskId}/{part_spec_path}/xxx.parquet.

3. Job commit move file to /path/to/outputPath/{part_spec_path}/xxx.parquet.

For current dynamic partition insert in v1, it have step:

1. Task attempts firstly write files under the intermediate path, e.g. /path/to/outputPath/.spark-staging-{jobId}/_temporary/{appAttemptId}/_temporary/{taskAttemptId}/a=1/b=1/xxx.parquet.
2. Then task commit file to /path/to/outputPath/.spark-staging-{jobId}/_temporary/{appId}/_temporary/{taskId}/a=1/b=1/xxx.parquet.

During Job commit move file to /path/to/outputPath/a=1/b=1/xxx.parquet.

In this new sql commit protocol SQLPathHadoopMapReduceCommitProtocol ,

for non-partition insert:

1. Task attempts firstly write files under the intermediate path, e.g. {staging_dir}/_temporary/{appAttemptId}/_temporary/{taskAttemptId}/xxx.parquet.
2. Then task commit file to {staging_dir}/_temporary/{appId}/_temporary/{taskId}/xxx.parquet.
3. When job commit, moving file to /path/to/outputPath/xxx.parquet.

for all static partition insert:

1. Task attempts firstly write files under the intermediate path, e.g. {staging_dir}/_temporary/{appAttemptId}/_temporary/{taskAttemptId}/{part_spec_path}/xxx.parquet.
2. Then task commit file to {staging_dir}/_temporary/{appId}/_temporary/{taskId}/{part_spec_path}/xxx.parquet.
3. When job commit, moving file to /path/to/outputPath/{part_spec_path}/xxx.parquet.

for dynamic partition insert, we implement it as:

1. Task attempts firstly write files under the intermediate path, e.g. {staging_dir}/_temporary/{appAttemptId}/_temporary/{taskAttemptId}/{part_spec_path}/xxx.parquet.
2. Then task commit file to {staging_dir}/_temporary/{appId}/_temporary/{taskId}/{part_spec_path}/xxx.parquet.
3. When job commit, move file to /{staging_dir}/{part_spec_path}/xxx.parquet.  Then, if we write to a empty table path, we directly rename   /{staging_dir} to /{table_path}, if not, we move partition dir one by one from  /{staging_dir}/{part_spec_path} to  /{table_location}/{part_spec_path}

The new sql commit protocal’s benefit is:
- Can support Insert into non-partitioned table form it self
- Can support Insert into partition table's statistic partition and read data from target partition.
- Can support Insert into different partition using statistic partition concurrently

These are all normal problems when we use data source API insert data

Why are the changes needed?

Provide a more flexible commit protocol and won't impact perf

Does this PR introduce any user-facing change?

User can set sql commit protocol to SQLPathHadoopMapReduceCommitProtocol to use a commit protocol with staging dir

How was this patch tested?

Added UT

…lict

AngersZhuuuu · 2022-03-07T01:49:36Z

gentle ping @cloud-fan

AngersZhuuuu · 2022-03-10T14:51:21Z

@steveloughran Hi Steve, this pr's desc can explain some of your confuse in #33828

steveloughran

code looks ok to me, would like to see the viewfs and validation of that -ext-1000 suffix to make sure those bits continue to work in future, and that regressions are found in unit tests rather than support calls

steveloughran · 2022-03-30T13:57:43Z

sql/core/src/test/scala/org/apache/spark/sql/sources/StagingInsertSuite.scala

+
+  import testImplicits._
+
+  val stagingParentDir = Utils.createTempDir()


be nice if you could target this against actual filesystem uris rather than just file://; but that will take more changes in the base classes. would help with those of us trying to regression test other committers through spark sql

steveloughran · 2022-03-30T13:59:53Z

core/src/main/scala/org/apache/spark/internal/io/FileCommitProtocol.scala

+      engineType: String,
+      jobId: String): Path = {
+    val extURI = path.toUri
+    if (extURI.getScheme == "viewfs") {


there's no test for this in the tests that i can see...it'd be good to have that viewfs coverage tested too.

github-actions · 2022-09-28T00:31:40Z

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

AngersZhuuuu added 30 commits August 24, 2021 16:14

[36562][SQL] Add new NewSQLHadoopMapReduceCommitProtocol resolve conf…

6331e56

…lict

Update PartitionedWriteSuite.scala

4d80c24

Update PartitionedWriteSuite.scala

b692e0f

Update NewSQLHadoopMapReduceCommitProtocol.scala

e6dbae1

Update NewSQLHadoopMapReduceCommitProtocol.scala

6419573

Update PathOutputCommitProtocol.scala

44d1d8f

Update PathOutputCommitProtocol.scala

e2c5318

update

9106d18

[SPARK-36579][SQL] Make spark source stagingDir can use user defined

2031f5b

Update SQLHadoopMapReduceCommitProtocol.scala

c29f55e

Update

2604c9f

fix ut

71f6b17

Merge branch 'master' into SPARK-36579

1947cbf

Update SaveAsHiveFile.scala

30113d2

update

6f405dc

update

361263b

update

9ee6ee5

update

7773fb2

Update PathOutputCommitProtocol.scala

a3b3c51

Update PathOutputCommitProtocol.scala

b4d60e4

Update CommitterBindingSuite.scala

2c41808

update

454118d

Update PartitionedWriteSuite.scala

11d6d15

update

9c9826c

update

5926822

update

6cdee58

Update FileCommitProtocol.scala

824ec04

update

8c8a174

update

8d7ce6e

update

da6a0b9

AngersZhuuuu added 6 commits October 14, 2021 16:05

update

ac468fa

update

09f211b

update

f1f12c3

Update PathOutputCommitProtocol.scala

023787a

Update PathOutputCommitProtocol.scala

5984fb2

trigger GA

c2f606f

AngersZhuuuu marked this pull request as draft January 25, 2022 11:30

AngersZhuuuu added 11 commits January 25, 2022 19:32

Merge branch 'master' into SPARK-36562

a4468c2

Update FileCommitProtocol.scala

b90caeb

Update FileCommitProtocol.scala

98f066c

update

2ea214c

Update HadoopMapReduceCommitProtocol.scala

201b0b5

Update InsertIntoHadoopFsRelationCommand.scala

772bf6b

Update InsertIntoHadoopFsRelationCommand.scala

866681c

update

aaae0f2

Update SQLPathHadoopMapReduceCommitProtocol.scala

c838dd3

Update StagingInsertSuite.scala

1f2222c

Update SQLPathHadoopMapReduceCommitProtocol.scala

c3472d9

github-actions bot added CORE SQL labels Jan 25, 2022

Update SQLPathHadoopMapReduceCommitProtocol.scala

6b44163

AngersZhuuuu marked this pull request as ready for review January 26, 2022 08:49

steveloughran reviewed Mar 30, 2022

View reviewed changes

AngersZhuuuu mentioned this pull request Mar 31, 2022

[SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized #33828

Closed

github-actions bot added the Stale label Sep 28, 2022

github-actions bot closed this Sep 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition #35319

[SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition #35319

Uh oh!

AngersZhuuuu commented Jan 25, 2022 •

edited

Loading

Uh oh!

AngersZhuuuu commented Mar 7, 2022

Uh oh!

AngersZhuuuu commented Mar 10, 2022

Uh oh!

steveloughran left a comment

Uh oh!

steveloughran Mar 30, 2022

Uh oh!

steveloughran Mar 30, 2022

Uh oh!

github-actions bot commented Sep 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		import testImplicits._

		val stagingParentDir = Utils.createTempDir()

[SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition #35319

[SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition #35319

Uh oh!

Conversation

AngersZhuuuu commented Jan 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

AngersZhuuuu commented Mar 7, 2022

Uh oh!

AngersZhuuuu commented Mar 10, 2022

Uh oh!

steveloughran left a comment

Choose a reason for hiding this comment

Uh oh!

steveloughran Mar 30, 2022

Choose a reason for hiding this comment

Uh oh!

steveloughran Mar 30, 2022

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AngersZhuuuu commented Jan 25, 2022 •

edited

Loading