Limit memory usage of exchange sender by windtalker · Pull Request #10387 · pingcap/tiflash

windtalker · 2025-09-01T05:47:34Z

What problem does this PR solve?

Issue Number: close #10337

Problem Summary:

The root cause is in current implementation, the buffered block number in ExchangeSender can be upto max_threads * max_threads * fine_grain_stream_count, so if max_threads is large, and the size of each row is big, the buffered block can consume very large memory.

This pr limit the memory usage of ExchangeSender by max(MAX_BATCH_SEND_MIN_LIMIT_MEM_SIZE * max_threads, max_buffered_bytes_in_executor)

What is changed and how it works?

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
test in 2-tiflash's cluster, with max_threads = 32 and sort memory limit set to 5G

explain analyze select * from (select *, row_number() over(partition by ps_partkey order by ps_suppkey) as rn from test) d where rn = 10;

Where test has 40000000 rows, each row has ~1200 bytes.
Before this pr
peak memory of each TiFlash is about 30G

[2025/09/01 11:47:48.178 +08:00] [DEBUG] [MemoryTracker.cpp:110] ["Peak memory usage (for query): 33.13 GiB."] [source=MemoryTracker] [thread_id=813]
[2025/09/01 11:47:48.178 +08:00] [DEBUG] [MemoryTracker.cpp:110] ["Peak memory usage (total): 33.13 GiB."] [source=MemoryTracker] [thread_id=813]

After this pr
peak memory of each TiFlash is about 10G

[2025/09/01 11:42:28.895 +08:00] [DEBUG] [MemoryTracker.cpp:110] ["Peak memory usage (for query): 9.86 GiB."] [source=MemoryTracker] [thread_id=635]
[2025/09/01 11:42:28.895 +08:00] [DEBUG] [MemoryTracker.cpp:110] ["Peak memory usage (total): 9.86 GiB."] [source=MemoryTracker] [thread_id=635]

No code

Side effects

Performance regression: Consumes more CPU
Performance regression: Consumes more Memory
Breaking backward compatibility

Documentation

Release note

None

Copilot

Pull Request Overview

This PR limits memory usage of ExchangeSender by adding a max_buffered_bytes parameter to control buffering across all writer types. The root cause of high memory usage was unlimited buffering based only on row counts, which could consume excessive memory with large rows and high thread counts.

Key changes:

Adds max_buffered_bytes parameter to all exchange writer constructors and functions
Implements needFlush() method that checks both byte and row limits for flushing decisions
Introduces getMaxBufferedBytesInResponseWriter() utility function to calculate appropriate buffer limits

Reviewed Changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
PhysicalExchangeSender.cpp	Adds concurrency calculation and max_buffered_bytes parameter to writer creation
newMPPExchangeWriter.h/.cpp	Adds max_buffered_bytes parameter to function signatures and passes it through
HashPartitionWriter.h/.cpp	Refactors to use base class buffering fields and byte-based flushing
FineGrainedShuffleWriter.h/.cpp	Similar refactoring to use unified buffering approach
BroadcastOrPassThroughWriter.h/.cpp	Updates to use base class buffering and byte limits
StreamingDAGResponseWriter.h/.cpp	Refactors buffering logic to use base class implementation
DAGResponseWriter.h	Adds base buffering fields and needFlush() method
DAGUtils.h/.cpp	Adds utility function to calculate max buffered bytes
DAGDriver.cpp	Updates streaming writer creation with new parameter
Test files	Updates all test cases to include new max_buffered_bytes parameter

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-02T02:55:01Z

dbms/src/Flash/Mpp/FineGrainedShuffleWriter.cpp

+            if (scattered[col_id][chunk_id]->valuesHaveFixedSize())
+            {
+                // Reserve space for each chunk to avoid frequent memory allocation.
+                scattered[col_id][chunk_id]->reserve(1024);
+            }


The hardcoded value 1024 should be defined as a named constant to improve maintainability and make the purpose clearer.

dbms/src/Flash/Coprocessor/DAGUtils.cpp

windtalker · 2025-09-03T07:34:15Z

/test pull-unit-test

dbms/src/Flash/Mpp/tests/gtest_mpp_exchange_writer.cpp

ti-chi-bot · 2025-09-04T05:51:27Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gengliqi, solotzg

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [gengliqi,solotzg]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2025-09-04T05:51:29Z

[LGTM Timeline notifier]

Timeline:

2025-09-02 03:03:23.870779078 +0000 UTC m=+841295.748610022: ☑️ agreed by solotzg.
2025-09-04 05:51:28.679598855 +0000 UTC m=+72632.662696293: ☑️ agreed by gengliqi.

windtalker · 2025-09-04T08:17:09Z

/test pull-unit-test

windtalker · 2025-09-04T11:28:15Z

/hold

windtalker · 2025-09-05T01:46:18Z

/test pull-integration-next-gen

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

windtalker · 2025-09-05T02:05:19Z

/hold cancel

windtalker · 2025-09-05T10:02:33Z

/test pull-unit-test

Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>

ti-chi-bot · 2025-09-08T01:22:31Z

In response to a cherrypick label: new pull request created to branch release-8.5: #10417.
But this PR has conflicts, please resolve them!

close #10337 Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io> Co-authored-by: xufei <xufeixw@mail.ustc.edu.cn> Co-authored-by: xufei <xufei@pingcap.com>

windtalker requested a review from Copilot September 2, 2025 02:54

Copilot AI reviewed Sep 2, 2025

View reviewed changes

solotzg approved these changes Sep 2, 2025

View reviewed changes

ti-chi-bot bot added approved needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Sep 2, 2025

gengliqi reviewed Sep 4, 2025

View reviewed changes

dbms/src/Flash/Mpp/tests/gtest_mpp_exchange_writer.cpp Show resolved Hide resolved

dbms/src/Flash/Mpp/tests/gtest_mpp_exchange_writer.cpp Show resolved Hide resolved

gengliqi approved these changes Sep 4, 2025

View reviewed changes

ti-chi-bot bot added lgtm and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Sep 4, 2025

windtalker force-pushed the limit_memory_usage_of_exchange_sender branch from 3e85f09 to 6d89003 Compare September 4, 2025 10:13

ti-chi-bot bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 4, 2025

windtalker and others added 9 commits September 5, 2025 10:04

save work

0bd20e8

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

save work

ffb2f0a

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

refine

71b271f

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

add tests

e8d4a5e

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

format

9ae2aee

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

Update dbms/src/Flash/Coprocessor/DAGUtils.cpp

45bacf0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

save work

b798acf

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

disable unstable tests

7b17065

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

save work

5ca77f3

Signed-off-by: xufei <xufeixw@mail.ustc.edu.cn>

windtalker force-pushed the limit_memory_usage_of_exchange_sender branch from 220960d to 5ca77f3 Compare September 5, 2025 02:05

ti-chi-bot bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 5, 2025

CalvinNeo mentioned this pull request Sep 5, 2025

unstable ut RegionKVStoreTestFAP.Cancel4 #10391

Closed

ti-chi-bot bot merged commit 6fce865 into pingcap:master Sep 5, 2025
7 checks passed

ti-chi-bot bot added the needs-cherry-pick-release-8.5 Should cherry pick this PR to release-8.5 branch. label Sep 8, 2025

ti-chi-bot pushed a commit to ti-chi-bot/tiflash that referenced this pull request Sep 8, 2025

This is an automated cherry-pick of pingcap#10387

85fc134

Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>

ti-chi-bot mentioned this pull request Sep 8, 2025

Limit memory usage of exchange sender (#10387) #10417

Merged

12 tasks

Conversation

windtalker commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What problem does this PR solve?

What is changed and how it works?

Check List

Release note

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

windtalker commented Sep 3, 2025

Uh oh!

Uh oh!

Uh oh!

ti-chi-bot bot commented Sep 4, 2025

Uh oh!

ti-chi-bot bot commented Sep 4, 2025

[LGTM Timeline notifier]

Uh oh!

windtalker commented Sep 4, 2025

Uh oh!

windtalker commented Sep 4, 2025

Uh oh!

windtalker commented Sep 5, 2025

Uh oh!

windtalker commented Sep 5, 2025

Uh oh!

windtalker commented Sep 5, 2025

Uh oh!

Uh oh!

ti-chi-bot commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

windtalker commented Sep 1, 2025 •

edited

Loading