Limit size of shardDeleteResults#133558

Merged

joshua-adams-1 merged 48 commits intoelastic:mainfrom

joshua-adams-1:limit-shard-blobs-to-delete

Oct 21, 2025

Contributor

joshua-adams-1 commented Aug 26, 2025 •

edited

Loading

Modifies BlobStoreRepository.ShardBlobsToDelete.shardDeleteResults to have a variable size depending on the remaining heap space rather than a hard-coded 2GB size which caused smaller nodes with less heap space to OOMe.

Relates to #131822
Closes #116379

Closes ES-12540


          Limit size of shardDeleteResults

Modifies `BlobStoreRepository.ShardBlobsToDelete.shardDeleteResults` to have
a variable size depending on the remaining heap space rather than a
hard-coded 2GB size which caused smaller nodes with less heap
space to OOMe.

Relates to elastic#131822

Closes ES-12540

elasticsearchmachine added the v9.2.0 label

elasticsearchmachine and others added 4 commits

August 26, 2025 14:29


          [CI] Auto commit changes from spotless

97e9969


          Minor tweaks

24b7a62


          Merge branch 'limit-shard-blobs-to-delete' of github.com:joshua-adams…

d888113

…-1/elasticsearch into limit-shard-blobs-to-delete


          Ran ./gradlew spotlessApply precommit

ee89eb2

joshua-adams-1 commented

View reviewed changes

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated Show resolved Hide resolved

joshua-adams-1 added the >non-issue label

DaveCTurner reviewed

View reviewed changes

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated Show resolved Hide resolved

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated Show resolved Hide resolved

joshua-adams-1 added 5 commits

August 27, 2025 12:25


          TBR - Add TODO

92991b9


          Uses a setting to control the max shardDeleteResults size

Modifies `addShardDeleteResult` to only write to `shardDeleteResults`
when there is capacity for the write


          Remove TODOs

a16856c


          Fix failing unit tests

381d294


          Merge branch 'main' into limit-shard-blobs-to-delete

203d513

joshua-adams-1 requested a review from DaveCTurner

September 1, 2025 14:17

DaveCTurner reviewed

View reviewed changes

server/src/main/java/org/elasticsearch/common/io/stream/BytesStreamOutput.java Outdated Show resolved Hide resolved

joshua-adams-1 added 6 commits

September 3, 2025 14:53


          Moved the limit logic out of the streams submodule and into

daf09b6

BlobStoreRepository


          Merge branch 'main' of github.com:elastic/elasticsearch into limit-sh…

dc70d5b

…ard-blobs-to-delete


          Add tests

0355c2a


          Run ./gradlew spotlessApply precommit

f072128


          Merge branch 'limit-shard-blobs-to-delete' of github.com:joshua-adams…

abb2d4c

…-1/elasticsearch into limit-shard-blobs-to-delete


          Merge branch 'main' into limit-shard-blobs-to-delete

a0d728f

DaveCTurner reviewed

View reviewed changes

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated Show resolved Hide resolved

joshua-adams-1 marked this pull request as ready for review

September 4, 2025 15:32

joshua-adams-1 requested a review from a team as a code owner

September 4, 2025 15:32

joshua-adams-1 requested a review from DaveCTurner

September 4, 2025 15:33

elasticsearchmachine added the needs:triage label

joshua-adams-1 self-assigned this

joshua-adams-1 added the :Distributed Coordination/Distributed label

elasticsearchmachine added the Team:Distributed Coordination (obsolete) label

joshua-adams-1 added 3 commits

October 7, 2025 15:59


          Merge branch 'main' into limit-shard-blobs-to-delete

1ff464d


          spotless

0d01264


          Modify comment

d73ffef

DaveCTurner reviewed

View reviewed changes

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated

+                                  // We only want to read this shard delete result if we were able to write the entire object.
+                                  // Otherwise, for partial writes, an EOFException will be thrown upon reading
+                                  if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
+                                      successfullyWrittenBlobsCount += 1;

Member

DaveCTurner Oct 7, 2025

This replaces resultCount but it's the count of the number of successfully recorded shards not blobs.

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated

+                                  if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
+                                      successfullyWrittenBlobsCount += 1;
+                                  } else {
+                                      leakedBlobsCount += 1;

Member

DaveCTurner Oct 7, 2025

Likewise this is recording the number of shards with leaked blobs rather than the number of leaked blobs. However, rather than just renaming the variable I think we should actually count the number of leaked blobs (i.e. += blobsToDelete.size() here).

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Show resolved Hide resolved

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Show resolved Hide resolved

joshua-adams-1 added 2 commits

October 8, 2025 16:11


          Add dynamic cluster level setting

df51576


          Merge branch 'main' into limit-shard-blobs-to-delete

eb33600

joshua-adams-1 requested a review from DaveCTurner

October 8, 2025 17:13

DaveCTurner reviewed

View reviewed changes

Member

DaveCTurner left a comment

Good stuff, I left only tiny nits about the production code and a few other comments about the testing.

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated

+                      ClusterSettings clusterSettings = clusterService.getClusterSettings();
+                      clusterSettings.initializeAndWatch(
+                          MAX_HEAP_SIZE_FOR_SNAPSHOT_DELETION_SETTING,
+                          status -> this.maxHeapSizeForSnapshotDeletion = status

Member

DaveCTurner Oct 14, 2025

Maybe apply the limit here on write (making the field an int) rather than each read?

Suggested change

      
                        status -> this.maxHeapSizeForSnapshotDeletion = status
          
                        maxHeapSizeForSnapshotDeletion -> this.maxHeapSizeForSnapshotDeletion = Math.toIntExact(
          
                            Math.min(maxHeapSizeForSnapshotDeletion.getBytes(), Integer.MAX_VALUE - ByteSizeUnit.MB.toBytes(1))
          
                        )

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java Outdated

Comment on lines +1747 to +1760

+                              boolean writeTruncated = false;
+                              // There is a minimum of 1 byte available for writing
+                              if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
+                                  new ShardSnapshotMetaDeleteResult(Objects.requireNonNull(indexId.getId()), shardId, blobsToDelete).writeTo(compressed);
+                                  // We only want to read this shard delete result if we were able to write the entire object.
+                                  // Otherwise, for partial writes, an EOFException will be thrown upon reading
+                                  if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
+                                      resultsCount += 1;
+                                  } else {
+                                      writeTruncated = true;
+                                  }
+                              } else {
+                                  writeTruncated = true;
+                              }

Member

DaveCTurner Oct 14, 2025

A matter of taste, but consider extracting this section into its own method to clarify that we only get false on the branch that succeeded, all other paths lead to true (and maybe we should invert that so that true means "success").

        private boolean writeBlobsIfCapacity(IndexId indexId, int shardId, Collection<String> blobsToDelete) throws IOException {
            // There is a minimum of 1 byte available for writing
            if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
                new ShardSnapshotMetaDeleteResult(Objects.requireNonNull(indexId.getId()), shardId, blobsToDelete).writeTo(compressed);
                // We only want to read this shard delete result if we were able to write the entire object.
                // Otherwise, for partial writes, an EOFException will be thrown upon reading
                if (this.truncatedShardDeleteResultsOutputStream.hasCapacity()) {
                    resultsCount += 1;
                    return false;
                }
            }

            return true;
        }

server/src/test/java/org/elasticsearch/repositories/blobstore/BlobStoreRepositoryTests.java Outdated

Comment on lines +844 to +847

+                              assertEquals(expectedShardGenerations.build(), shardBlobsToDelete.getUpdatedShardGenerations());
+                              shardBlobsToDelete.getBlobPaths().forEachRemaining(s -> assertTrue(expectedBlobsToDelete.remove(s)));
+                              assertThat(expectedBlobsToDelete, empty());
+                              assertThat(shardBlobsToDelete.sizeInBytes(), lessThanOrEqualTo(Math.max(ByteSizeUnit.KB.toIntBytes(1), 20 * blobCount)));

Member

DaveCTurner Oct 14, 2025

Hmm it's not really within the implied contract of this class to be able to iterate its contents and then try and append more items. We've already closed the underlying compressed stream by this point. I think we should defer these assertions until the end.

server/src/test/java/org/elasticsearch/repositories/blobstore/BlobStoreRepositoryTests.java Outdated


		// === Second, now capacity is exceeded, test whether subsequent writes are accepted without throwing an error === //

		for (int i = 0; i < randomIntBetween(1, 20); i++) {

Member

DaveCTurner Oct 14, 2025

This is trappy, it's going to generate a new randomIntBetween on each iteration so you don't get a uniform distribution of values. Better to count down from between(1,20) to zero, or else extract a variable for the upper bound.

server/src/test/java/org/elasticsearch/repositories/blobstore/BlobStoreRepositoryTests.java Outdated

+                              // === First, write blobs until capacity is exceeded === //
+                              // While there is at least one byte in the stream, write
+                              while (shardBlobsToDelete.sizeInBytes() < heapMemory) {

Member

DaveCTurner Oct 14, 2025

WDYT about iterating until leakedBlobCount reaches some target value here, rather than doing the two separate loops?

joshua-adams-1 added 5 commits

October 15, 2025 15:45


          David Comments

bdf4b5a


          Merge branch 'main' into limit-shard-blobs-to-delete


          Spotless

030ee3e


          Merge branch 'limit-shard-blobs-to-delete' of https://github.com/josh…

6517c99

…ua-adams-1/elasticsearch into limit-shard-blobs-to-delete


          Merge branch 'main' into limit-shard-blobs-to-delete

b6e8eb3

joshua-adams-1 requested a review from DaveCTurner

October 15, 2025 18:04

DaveCTurner approved these changes

View reviewed changes

Member

DaveCTurner left a comment

LGTM

server/src/test/java/org/elasticsearch/repositories/blobstore/BlobStoreRepositoryTests.java

+                                  final var indexId = new IndexId(randomIdentifier(), randomUUID());
+                                  final var shardId = between(1, 30);
+                                  final var shardGeneration = new ShardGeneration(randomUUID());
+                                  // Always write at least one blob, guaranteeing that the shardDeleteResults stream increases in size

Member

DaveCTurner Oct 17, 2025

👍 well spotted (in fact the stream grows anyway because we write the index ID, but that doesn't necessarily increase leakedBlobCount)

joshua-adams-1 added 3 commits

October 20, 2025 11:56


          Merge branch 'main' into limit-shard-blobs-to-delete

859d7f8


          Merge branch 'limit-shard-blobs-to-delete' of https://github.com/josh…

031d09c

…ua-adams-1/elasticsearch into limit-shard-blobs-to-delete


          Merge branch 'main' into limit-shard-blobs-to-delete

0491c57

joshua-adams-1 merged commit 236c9fe into elastic:main

34 checks passed

joshua-adams-1 deleted the limit-shard-blobs-to-delete branch

October 21, 2025 14:33

joshua-adams-1 mentioned this pull request

Restricts snapshot concurrency based on available heap memory #136952

Closed

chrisparrinello pushed a commit to chrisparrinello/elasticsearch that referenced this pull request


          Limit size of shardDeleteResults (elastic#133558)

52aaa80

Modifies `BlobStoreRepository.ShardBlobsToDelete.shardDeleteResults` to have
a variable size depending on the remaining heap space rather than a
hard-coded 2GB size which caused smaller nodes with less heap
space to OOMe.

Relates to elastic#131822

Closes ES-12540

fzowl pushed a commit to voyage-ai/elasticsearch that referenced this pull request


          Limit size of shardDeleteResults (elastic#133558)

9d5b66f

Modifies `BlobStoreRepository.ShardBlobsToDelete.shardDeleteResults` to have
a variable size depending on the remaining heap space rather than a
hard-coded 2GB size which caused smaller nodes with less heap
space to OOMe.

Relates to elastic#131822

Closes ES-12540

joshua-adams-1 mentioned this pull request

Restrict resource usage during snapshots deletion #131822

Closed

DaveCTurner added a commit to DaveCTurner/elasticsearch that referenced this pull request


          Limit heap used tracking IndexMetadata deletions

c0cc61f

In elastic#133558 we imposed a limit on the heap used to keep track of
shard-level blobs to clean up after the commit of a snapshot deletion.
This commit makes use of the same mechanism to track `IndexMetadata`
blobs for future deletion.

Closes elastic#140018

DaveCTurner mentioned this pull request

Limit heap used tracking IndexMetadata deletions #140394

Merged

DaveCTurner added a commit that referenced this pull request


          Limit heap used tracking IndexMetadata deletions (#140394)

e3caa46

In #133558 we imposed a limit on the heap used to keep track of
shard-level blobs to clean up after the commit of a snapshot deletion.
This commit makes use of the same mechanism to track `IndexMetadata`
blobs for future deletion.

Closes #140018

jimczi pushed a commit to jimczi/elasticsearch that referenced this pull request


          Limit heap used tracking IndexMetadata deletions (elastic#140394)

ccd2b93

In elastic#133558 we imposed a limit on the heap used to keep track of
shard-level blobs to clean up after the commit of a snapshot deletion.
This commit makes use of the same mechanism to track `IndexMetadata`
blobs for future deletion.

Closes elastic#140018

repantis added :Distributed/Distributed and removed :Distributed Coordination/Distributed labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

:Distributed/Distributed >non-issue Team:Distributed Coordination (obsolete) v9.3.0