Implement multipart upload for azureblob-sdk provider by klaudworks · Pull Request #904 · gaul/s3proxy

klaudworks · 2025-10-18T13:44:07Z

Current Approach

The current approach forwards the input stream from the incoming request to the azure sdk. I used a Flux<ByteBuffer> to work around the limitation that our input stream does not support mark / reset. I looked into the azure-sdk-for-java after seeing your issue: Azure/azure-sdk-for-java#42603. It seems like this is the only workaround.

Current Limitations

Given, that we don't really have a place to store md5-based Etags for each part, we return them but don't actually use them to complete the multipart upload. Instead, we try to find all parts belonging to a multipart upload by using the upload id. For the finally assembled blob, we just return azure's Etag.
We can't make use of the azure-java-sdk retry logic on network issues. There is no way to solve this without persisting the input stream. The problem is pushed down to the s3-proxy client's aws sdk, which will retry uploading failed parts.
~~We don't handle if-match, if-none-match headers for conditional writes. This is supported by S3 for CompleteMultipartUpload requests.~~ EDIT: I added a commit to support conditional writes for CompleteMultipartUpload requests.
AWS supports up to 5GiB part size. I updated the max part size to 4000 MiB as documented here: https://learn.microsoft.com/en-us/azure/storage/blobs/scalability-targets. The default in the AWS SDK is 8MB so I would rather not implement some complicated chunking of parts on our side to cover the edge case where people upload 4-5 GiB files.

Alternative Approaches

Using a BufferedInputStream

I discarded this solution due to potentially high memory usage when uploading multiple parts in parallel.

Persisting the input stream on disk

Limitation 1: Potentially doubles the time until the part upload completes -> Can be compensated by uploading more files in parallel.

Limitation 2: Need to inform users that they provide sufficient /tmp storage on sufficiently fast SSDs. On an HDD the user would be heavily IO bound.

Benefit: We could actually calculate proper Etags for each part and store the md5 hash, e.g. encoded in the block name. This is not possible with the current solution because we need to already provide the block name when we pass along the input stream from the request to the azure sdk.

s3-tests Update

gaul/s3-tests#4

EDIT: I also running this on a test kubernetes cluster. It seems to work just fine. Some tooling (cnpg, kafka connector) can run their backups through it.

Fixes

#709 #553 #552

klaudworks · 2025-10-24T10:59:38Z

FYI @gaul I'm running this in a stage k8s cluster and it seems to work just fine.

gaul

Please address comments in uploadMultipartPart but overall this looks good!

gaul · 2025-11-06T18:36:51Z

src/main/java/org/gaul/s3proxy/azureblob/AzureBlobStore.java

+                        }
+                        int chunkSize = (int) Math.min(4 * 1024 * 1024L,
+                                contentLength - position);
+                        ByteBuffer buffer = ByteBuffer.allocate(chunkSize);


Can you allocate this outside the loop and reuse it within the loop? You will want to call ByteBuffer.reset here.

this may be a bad idea because there is no guarantee that the AzureSDK strictly processes one ByteBuffer after another. Instead, we may overwrite a ByteBuffer that the AzureSDK still relies on.

src/main/java/org/gaul/s3proxy/azureblob/AzureBlobStore.java

src/test/java/org/gaul/s3proxy/AwsSdkTest.java

klaudworks · 2025-11-10T15:24:37Z

@gaul addressed all your other comments. Thanks for reviewing it.

klaudworks · 2025-11-13T20:42:05Z

@DzeCin what makes you think so? There should be no state persisted in the s3 proxy.

DzeCin · 2025-11-13T21:00:59Z

@DzeCin what makes you think so? There should be no state persisted in the s3 proxy.

You are right I misread the code, sorry for the confusion.

gaul · 2025-11-20T01:45:05Z

Thank you for your contribution @klaudworks! Let me tidy up a few other things but I will run a new release soon.

klaudworks · 2025-11-20T16:49:51Z

@gaul no hurry, I am currently running a custom build in production anyways. Thank you for reviewing the changes!

klaudworks added 9 commits October 18, 2025 14:51

implement multipart upload for azureblob-sdk

4e787ed

update s3-tests

8661425

update readme

9bc9ee0

implement conditional writes for azureblob-sdk multipart upload

028a4a6

handle duplicate parts in azureblob-sdk multipart upload completion

4015171

enable more tests for azureblob-sdk multipart upload

c62ea95

remove unused import

98796de

azureblob-sdk increase part size to 4000 MiB

c479ba4

fix integer overflow error for getMaximumMultipartSize

39091d8

gaul requested changes Nov 6, 2025

View reviewed changes

address PR comments

e795f8d

klaudworks requested a review from gaul November 12, 2025 07:41

This comment was marked as resolved.

Sign in to view

gaul merged commit fb96699 into gaul:master Nov 20, 2025
3 checks passed

Conversation

klaudworks commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Current Approach

Current Limitations

Alternative Approaches

Using a BufferedInputStream

Persisting the input stream on disk

s3-tests Update

Fixes

Uh oh!

klaudworks commented Oct 24, 2025

Uh oh!

gaul left a comment

Choose a reason for hiding this comment

Uh oh!

gaul Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

klaudworks Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

klaudworks commented Nov 10, 2025

Uh oh!

This comment was marked as resolved.

klaudworks commented Nov 13, 2025

Uh oh!

DzeCin commented Nov 13, 2025

Uh oh!

Uh oh!

gaul commented Nov 20, 2025

Uh oh!

klaudworks commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

klaudworks commented Oct 18, 2025 •

edited

Loading

klaudworks commented Nov 20, 2025 •

edited

Loading