Avoid duplicate getheaders requests #6821

domob1812 · 2015-10-13T17:58:25Z

If new blocks are announced while a node is currently still downloading the headers, it starts to request the same headers multiple times. See #6755.

With this patch, the client tries to guess whether it is still in the initial headers sync and avoids starting the duplicate requests if it is. I explained in comments in the code why this is safe. Here's a summary:

In general, I tried to be "conservative" when deciding to prevent a request. The node only thinks it is in the initial headers sync when it last received a headers message with full size (2,000 headers), building on top of its currently known best header, and at least 12 of the headers were previously unknown. If an attacker wants to fake this condition to a node that is actually up-to-date (to prevent it from getting a newly announced block), he or she needs to construct 12 headers with valid PoW; this seems like a lot of effort for a not-so-critical effect.

If new blocks are announced while a node is currently still downloading the headers, it starts to request the same headers multiple times. See bitcoin#6755. With this patch, the client tries to guess whether it is still in the initial headers sync and avoids starting the duplicate requests if it is.

sdaftuar · 2015-10-13T20:09:21Z

Thanks for trying to address this issue; I agree it is suboptimal that we could be in a situation where we're requesting the same headers from the same peer, multiple times.

However, I believe that sending a getheaders when a new block is announced is currently the only protection a node has against having chosen a bad peer for its initial headers sync. That is, if the peer a node chooses for initial headers sync never responds to the getheaders message, or never sends headers to get it caught up enough to start sending getheaders messages to other peers, the node will never sync up with the network.

So my initial reaction is that this is perhaps not the best approach. I'd be inclined to solve this by trying to keep better per-node state (like time of last getheaders message to try avoiding multiple getheaders messages in flight to the same peer, and if a headers message comes back that doesn't include some announced block and isn't full then we could send another getheaders message, etc). Definitely some edge cases here that need thought...

domob1812 · 2015-10-14T05:18:44Z

I thought about doing something like this per-node as well; but in that case, couldn't the peer still start one redundant "getheaders request chain" per peer when it receives inv messages? This still seems like a lot of waste.

I understand your point about selecting a bad peer, though. My impression, however, is that this is only a related but otherwise independent issue; no? In that case, wouldn't it make more sense to introduce a "timeout" after which the node chooses a different peer for requesting headers? This would solve the issue at the root (and is somehow similar to the current protection with inv messages, except that it would be more predictable and less hackish).

laanwj · 2016-04-19T08:46:19Z

I don't think there's agreement on this, and development seems to be stuck. @sdaftuar @domob1812 how to proceed here?

domob1812 · 2016-04-19T17:18:11Z

I think that this issue should be resolved - while it usually is not a big problem for Bitcoin, it is simply wrong (and could lead to instability) to send duplicate getheaders requests. I think (obviously) that my patch improves the situation, but if there is a better solution, I'm all in favour of it as well.

sdaftuar · 2016-04-19T23:57:49Z

No objection to solving the underlying issue, but I don't think we should adopt the approach taken in this PR. This is a relatively minor problem (triggered only when your sync peer announces a new block during initial headers synchronization), and I don't think it makes sense to patch up a minor problem in a way that could create new problems. In addition to opening up a node to never syncing up with the network if a bad initial sync peer is chosen as I mentioned above, I think this PR has other issues as well, such as this:

Imagine you have a getheaders message in flight to your sync peer (so fInitialHeadersSync is true) and a new block is announced. At the time you receive the inv, you don't know whether your peer will process your earlier getheaders message before or after updating its own tip to include the new block; thus you have no way of knowing whether you need to issue another getheaders request when processing the inv to get the header for the newly announced block.

The existing behavior would just send the getheaders message anyway, which ensures we eventually receive it, at the risk of receiving duplicate data. This PR would not send that getheaders message, so it is entirely possible -- if the subsequent headers response is not full -- that we would then never end up receiving the headers for the newly inv'ed block, and thus not request it (until a subsequent block is announced that builds on top of it).

I think any attempt to solve the issue here needs to be done more carefully, in a way that doesn't introduce new bugs (perhaps by trying an approach along the lines of what I wrote before about keeping more per-peer state).

laanwj · 2016-04-25T09:47:06Z

Closing this PR, then. I agree it would be nice if this is fixed, but seems too minor an issue to need a quick fix that potentially breaks syncing.

The current logic for syncing headers may lead to lots of duplicate getheaders requests being sent: If a new block arrives while the node is in headers sync, it will send getheaders in response to the block announcement. When the headers arrive, the message will be of maximum size and so a follow-up request will be sent---all of that in addition to the existing headers syncing. This will create a second "chain" of getheaders requests. If more blocks arrive, this may even lead to arbitrarily many parallel chains of redundant requests. This patch changes the behaviour to only request more headers after a maximum-sized message when it contained at least one unknown header. This avoids sustaining parallel chains of redundant requests. Note that this patch avoids the issues raised in the discussion of bitcoin#6821: There is no risk of the node being permanently blocked. At the latest when a new block arrives this will trigger a new getheaders request and restart syncing.

domob1812 mentioned this pull request Oct 13, 2015

Duplicate getheaders requests #6755

Closed

laanwj added the P2P label Oct 20, 2015

laanwj closed this Apr 25, 2016

domob1812 mentioned this pull request May 14, 2016

net: Avoid duplicate getheaders requests. #8054

Merged

domob1812 deleted the no-duplicate-getheaders branch May 18, 2016 17:20

decryp2kanon mentioned this pull request May 11, 2020

revert(#78) & fix: disabled more getheaders sugarchain-project/sugarchain#126

Merged

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid duplicate getheaders requests #6821

Avoid duplicate getheaders requests #6821

Uh oh!

domob1812 commented Oct 13, 2015

Uh oh!

sdaftuar commented Oct 13, 2015

Uh oh!

domob1812 commented Oct 14, 2015

Uh oh!

laanwj commented Apr 19, 2016

Uh oh!

domob1812 commented Apr 19, 2016

Uh oh!

sdaftuar commented Apr 19, 2016

Uh oh!

laanwj commented Apr 25, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Avoid duplicate getheaders requests #6821

Avoid duplicate getheaders requests #6821

Uh oh!

Conversation

domob1812 commented Oct 13, 2015

Uh oh!

sdaftuar commented Oct 13, 2015

Uh oh!

domob1812 commented Oct 14, 2015

Uh oh!

laanwj commented Apr 19, 2016

Uh oh!

domob1812 commented Apr 19, 2016

Uh oh!

sdaftuar commented Apr 19, 2016

Uh oh!

laanwj commented Apr 25, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants