Mining: Prevent slowdown in CreateNewBlock on large mempools #9959

sdaftuar · 2017-03-09T01:30:41Z

I've been investigating performance regressions in CreateNewBlock.

addPackageTxs() is supposed to "fail fast" for transactions that don't fit in a block. However while we skip the most expensive ancestor/descendant calculations for failing transactions (generally), it turns out the other work we're doing (mostly map lookups?) can still be pretty slow.

After trying various approaches to speed up those operations (replacing maps with unordered_maps, changing the sort order on the ancestor score function to avoid needless re-sorting, and even getting rid of the maps altogether in favor of storing set information directly in the mempool entry), the one optimization that dominates all these is to just return earlier when the block is nearly full.

So in this PR: when we're within 4000 weight of the block being full, if we consider and fail to add 1000 transactions in a row, then give up. I've benchmarked this as reducing the average run of CNB from ~~84ms to 63ms~~ 74ms to 61ms, with negligible difference in fees. Most of CNB time is currently taken by the call to TestBlockValidity, which is unaffected by this PR; so focusing just on package selection, the improvement from this change is a reduction in average addPackageTxs() time from 19ms to 7ms, over the time period I analyzed (first week of December, 2016).

I also added some commits that provide benchmarking of CNB when running with -debug=bench, which I thought might be generally useful/interesting.

sdaftuar · 2017-03-09T19:10:29Z

Update OP with a correction and a more relevant statistic.

luke-jr

utACK, certainly an improvement either way. (this behaviour used to exist before the CNB refactoring, maybe the old code is worth looking at)

luke-jr · 2017-03-09T19:27:22Z

src/miner.cpp


-    addPackageTxs();
+    int nPackagesSelected = 0;
+    int nDescendantsUpdated = 0;


Might make more sense to put these on the class, rather than pass by-reference?

I'm working on a refactor of the BlockAssembler members separately (as part of a patch to exclude recent transactions from new blocks, when the fee difference is negligible), so I'd prefer to defer the decision of whether to include this in the class until I'm ready to PR that patch.

luke-jr · 2017-03-09T19:29:06Z

src/miner.cpp

+
+    // Limit the number of attempts to add transactions to the block when it is
+    // close to full.
+    const int64_t MAX_CONSECUTIVE_FAILURES = 1000;


The old code used 50 for this. Does 1000 work good enough in practice?

Yes, well I also tried setting this to 100 and the performance was not really distinguishable.

luke-jr · 2017-03-09T19:31:31Z

src/miner.cpp

+            ++nConsecutiveFailed;
+
+            if (nConsecutiveFailed > MAX_CONSECUTIVE_FAILURES && nBlockWeight >
+                    nBlockMaxWeight - 4000) {


Perhaps the additional condition(s) should be before incrementing, or we could end up just aborting immediately as we approach the max block weight even if we could easily fit a few more txs in.

I'm not following; every time we add a new tx to the block we reset the counter... Can you clarify?

TheBlueMatt · 2017-03-09T20:05:44Z

utACK 03a4076, didnt bother to benchmark, should be obvious wins, even if minor.

TheBlueMatt · 2017-03-24T21:24:54Z

Is this worth backporting (my vote would be weak yes)?

sdaftuar · 2017-03-26T17:20:03Z

@TheBlueMatt I don't feel strongly about whether we backport this change, to be honest (though it's simple enough that I don't think there's any downside).

But it would be nice to merge this into master if there are no further comments, so I can continue work on additional mining changes (to support excluding recently received transactions if the fee difference from doing so is below some threshold).

JeremyRubin

utack. I've reviewed the changes and it seems reasonable. @sdaftuar I also left some ideas for a few complexity optimizations you may not have yet considered for the expensive loop; but those are for future work :).

JeremyRubin · 2017-03-28T20:20:20Z

src/miner.cpp

+int BlockAssembler::UpdatePackagesForAdded(const CTxMemPool::setEntries& alreadyAdded,
        indexed_modified_transaction_set &mapModifiedTx)
 {
+    int nDescendants = 0;


Can you rename to nDescendantsUpdated to make clear that it should not be initialized to alreadyAdded.size()

JeremyRubin · 2017-03-28T20:48:03Z

src/miner.cpp

        CTxMemPool::setEntries descendants;
        mempool.CalculateDescendants(it, descendants);
        // Insert all descendants (not yet in block) into the modified set
        BOOST_FOREACH(CTxMemPool::txiter desc, descendants) {


This is a really interesting section. I got a bit nerd-sniped so apologies for long writeup.

Doing the following may be more efficient (I can code this up, but I don't want to step on your toes if you're operating here).

std::vector<txiter> diff(descendants.size()); auto end = std::set_difference(descendants.begin(), descendants.end(), alreadyAdded.begin(), alreadyAdded.end(), diff.begin()); // Linear time! nDescendants += end - diff.begin(); for (auto it = diff.begin(); it != diff.end(); ++it) { // .... }

It requires an extra copy compared to the naive, but they're iterators so who cares (we can also reuse the vector for the entire call at the top of updatepackages)...

Let N = alreadyAdded.size()
Let M = descendants.size()
This does O(M+N) work (I think that most implementations actually do O(max(M, N)), but the standard specified 2*(M+N-1)), while what's currently happening would appear to be O(M*log(N)).

There is also a pre-loop-check one could do if it's likely they don't intersect.

// O(1) if (descendants.back() < alreadyAdded.front() || descendants.front() > alreadAdded.back()) // skip set_difference, descendants - alreadyAdded = descendants

And a pre-loop narrowing one could do to make it O(min(M, N)).

// O(log(M) + log(N)) auto range1 = descendants.equal_range(alreadyAdded.begin(), alreadyAdded.end()) auto range2 = alreadyAdded.equal_range(descendants.begin(), descendants.end()) std::vector<txiter> diff(range1.second - range1.first); auto end = std::set_difference(range1.first, range1.second, range2.first, range2.second, diff.begin()); // Linear time! nDescendants += end - diff.begin(); for (auto it = diff.begin(); it != diff.end(); ++it) { // .... }

Thanks, will try some of these out in future optimization efforts.

JeremyRubin · 2017-03-28T21:51:19Z

src/miner.cpp

        }

+        // This transaction will make it in; reset the failed counter.
+        nConsecutiveFailed = 0;


Could you add more of a comment on why resetting is correct behavior here?

It seems to me at first glance, that if we fail to add something earlier, we should continue to tally those as failures?
Perhaps:

Assuming that below a certain threshold `T` of probability (i.e.,`T = P(success at M | failure at M-1....0)` where `M = 1000`) of adding something to the block we want to give up, we expect that `1000>D>0. P(success at N+D+1 | failure at N+D,... success at N, failure at N-1, ...) > P(success at N+D | failure at N, failure at N-1, ...)`?

Maybe not the most accurate, but would be helpful for future reviewers trying to understand what the intention is.

This is incremented whenever we fail (not just for failures after we're above a certain block weight), so it needs to be reset when adding a new tx.

sdaftuar · 2017-03-29T16:40:03Z

Addressed @JeremyRubin's nits.

JeremyRubin · 2017-03-29T16:41:24Z

re-utack 4d1eb10

sdaftuar · 2017-03-29T17:59:46Z

Squashed 4d1eb10 -> 011124a

TheBlueMatt · 2017-03-29T19:13:59Z

re-utACK 011124a

gmaxwell

utACK

…ools 011124a Update benchmarking with package statistics (Suhas Daftuar) 42cd8c8 Add benchmarking for CreateNewBlock (Suhas Daftuar) eed816a Mining: return early when block is almost full (Suhas Daftuar) Tree-SHA512: c0d8f71e4e0441acf3f4ca12f8705e413b59b323659346a447145653def71710537fb4c6d80cad8e36d68b0aabf19c92e9eab7135a8897b053ed58720856cdda

Github-Pull: bitcoin#9959 Rebased-From: eed816a

Github-Pull: bitcoin#9959 Rebased-From: 42cd8c8

Github-Pull: bitcoin#9959 Rebased-From: 011124a

laanwj · 2017-03-31T09:46:27Z

Removing 'needs backport' label as a backport exists (#10127)

…ge mempools 011124a Update benchmarking with package statistics (Suhas Daftuar) 42cd8c8 Add benchmarking for CreateNewBlock (Suhas Daftuar) eed816a Mining: return early when block is almost full (Suhas Daftuar) Tree-SHA512: c0d8f71e4e0441acf3f4ca12f8705e413b59b323659346a447145653def71710537fb4c6d80cad8e36d68b0aabf19c92e9eab7135a8897b053ed58720856cdda

Github-Pull: bitcoin#9959 Rebased-From: eed816a

Github-Pull: bitcoin#9959 Rebased-From: 42cd8c8

…ge mempools 011124a Update benchmarking with package statistics (Suhas Daftuar) 42cd8c8 Add benchmarking for CreateNewBlock (Suhas Daftuar) eed816a Mining: return early when block is almost full (Suhas Daftuar) Tree-SHA512: c0d8f71e4e0441acf3f4ca12f8705e413b59b323659346a447145653def71710537fb4c6d80cad8e36d68b0aabf19c92e9eab7135a8897b053ed58720856cdda

fanquake added the Mining label Mar 9, 2017

luke-jr approved these changes Mar 9, 2017

View reviewed changes

laanwj added this to the 0.14.1 milestone Mar 9, 2017

laanwj added the Needs backport label Mar 9, 2017

JeremyRubin approved these changes Mar 28, 2017

View reviewed changes

sdaftuar added 3 commits March 29, 2017 13:57

Mining: return early when block is almost full

eed816a

Add benchmarking for CreateNewBlock

42cd8c8

Update benchmarking with package statistics

011124a

sdaftuar force-pushed the 2017-03-cnb-optimizations branch from 4d1eb10 to 011124a Compare March 29, 2017 17:59

gmaxwell approved these changes Mar 30, 2017

View reviewed changes

laanwj merged commit 011124a into bitcoin:master Mar 30, 2017

sdaftuar added a commit to sdaftuar/bitcoin that referenced this pull request Mar 30, 2017

Mining: return early when block is almost full

b5c3440

Github-Pull: bitcoin#9959 Rebased-From: eed816a

sdaftuar added a commit to sdaftuar/bitcoin that referenced this pull request Mar 30, 2017

Add benchmarking for CreateNewBlock

10028fb

Github-Pull: bitcoin#9959 Rebased-From: 42cd8c8

sdaftuar added a commit to sdaftuar/bitcoin that referenced this pull request Mar 30, 2017

Update benchmarking with package statistics

a296c60

Github-Pull: bitcoin#9959 Rebased-From: 011124a

sdaftuar mentioned this pull request Mar 30, 2017

[0.14 backport] Mining: Prevent slowdown in CreateNewBlock on large mempools #10127

Merged

laanwj removed the Needs backport label Mar 31, 2017

lateminer pushed a commit to lateminer/bitcoin that referenced this pull request Jan 5, 2019

Mining: return early when block is almost full

a0230be

Github-Pull: bitcoin#9959 Rebased-From: eed816a

lateminer pushed a commit to lateminer/bitcoin that referenced this pull request Jan 5, 2019

Add benchmarking for CreateNewBlock

cb96770

Github-Pull: bitcoin#9959 Rebased-From: 42cd8c8

CryptoCentric pushed a commit to absolute-community/absolute that referenced this pull request Feb 27, 2019

Merge bitcoin#9959: Mining: Prevent slowdown in CreateNewBlock on lar…

c26e60b

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Mining: Prevent slowdown in CreateNewBlock on large mempools #9959

Mining: Prevent slowdown in CreateNewBlock on large mempools #9959

Uh oh!

Conversation

sdaftuar commented Mar 9, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdaftuar commented Mar 9, 2017

Uh oh!

luke-jr left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TheBlueMatt commented Mar 9, 2017

Uh oh!

TheBlueMatt commented Mar 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdaftuar commented Mar 26, 2017

Uh oh!

JeremyRubin left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sdaftuar commented Mar 29, 2017

Uh oh!

JeremyRubin commented Mar 29, 2017

Uh oh!

sdaftuar commented Mar 29, 2017

Uh oh!

TheBlueMatt commented Mar 29, 2017

Uh oh!

gmaxwell left a comment

Choose a reason for hiding this comment

Uh oh!

laanwj commented Mar 31, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sdaftuar commented Mar 9, 2017 •

edited

Loading

TheBlueMatt commented Mar 24, 2017 •

edited

Loading