Skip to content

Conversation

@xy720
Copy link
Member

@xy720 xy720 commented Aug 10, 2021

Proposed changes

#6417

This commit is going to reduce thread number of SyncJob .
1、Submit send task to thread pool to send data.
2、Submit eof task to thread pool to block and wake up client to commit transactions.
3、Use SerialExecutorService to ensure correct order of sent data in every channel.

Besides,some bugs have been fixed in this commit
1、Failed to resume syncJob.
2、Failed to do sync data when set multiple tables in a syncJob.
3、In a cluster with multiple Fe, master may hang up after creating syncJob.

Types of changes

What types of changes does your code introduce to Doris?
Put an x in the boxes that apply

  • Optimization. Including functional usability improvements and performance improvements.

Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your code.

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@xy720 xy720 changed the title [Enhance] Reduce thread number of SyncJob to save resources [Enhance] [Binlog] Reduce thread number of SyncJob to save resources Aug 10, 2021

private static final ExecutorService executor = new StripedTaskExecutor();

public SyncTaskPool() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Useless

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed

/**
* max num of thread to handle sync task in sync task thread-pool.
*/
@ConfField public static int max_sync_task_threads_num = 1024;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why need so many threads here?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now is 10.

Copy link
Contributor

@morningman morningman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add UT for SyncTaskPool

// column names of source table
@SerializedName(value = "colNames")
private final List<String> colNames;
@SerializedName(value = "id")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

use same name as variables

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

SerialExecutor serialEx = slots[index];
serialEx.execute(command);
} else {
taskPool.execute(command);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to accept task without index in this pool?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It‘s for compatibility with general thread pools

private static final Logger LOG = LogManager.getLogger(SyncTask.class);

protected long signature;
protected int index;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add comment to explain the index

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@xy720 xy720 force-pushed the reduce-thread-num-in-sync-job branch from fcfb28e to faa6625 Compare September 9, 2021 08:34
Copy link
Contributor

@morningman morningman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added approved Indicates a PR has been approved by one committer. reviewed labels Sep 12, 2021
@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

Copy link
Contributor

@morningman morningman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@morningman morningman merged commit 95cdb7c into apache:master Sep 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. area/mysql-binlog kind/improvement reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Proposal] Support synchronizing MySQL binlog in real time

2 participants