rewrite pull syncer

pull syncer has a few edge case bugs that are very hard to trace and debug.

we sometimes do not get chunks at the expected nodes and it is very difficult to support these debugging efforts with the current syncer infrastructure.

work outline:
- [x] submit an initial [spec](https://github.com/ethersphere/swarm/pull/1454) to iterate on top of
- [x] write the protocol boilerplate for initialising a new protocol over the devp2p network

sync:
- [x] retrieve stream cursors upon node connection
- [x] drop cursors on node moved out of depth 
- [x] establish streams inside NN according to kademlia depth
- [x] make sure that stream cancellations happen on depth change
- [x] debounce mechanism

sync-localstore-intervals:
- [x] make sure closed intervals are always delivered from localstore pull

get stream < cursor (history priority):
- [x] test case for continuous intervals (no gaps)
- [x] test case for missing intervals (enclosing interval should still persist)

get stream > cursor (live priority):
- [x] test case for continuous intervals (no gaps)
- [x] ~test case for missing intervals (enclosing interval should still persist)~ difficult to test since intervals are fetched faster than they can be erased to create gaps. on hold

consistency:
- [x] test case for no duplicate chunks delivered
- [x] check no overlap between historical and live streaming
- [ ] make sync bins within depth feature toggle configurable and write test cases that validate syncing with it on and off

cluster/snapshot:
- [x] test that chunks are sent correctly in a star topology (adapt second test from `syncer_test.go` from existing `stream` package)
- [x] test that 3 nodes with full connectivity sync between them and that on each node will be the union of all three nodes' localstores
- [x] test that chunks are synced correctly in a larger full topology w/ discovery. this test vector needs more description

resilience:
- [ ] guaranty that there's always a live historical fetch with an unbounded stream
- [ ] check that existing historical and live historical stream fetchers terminate when depth changes and node moves out of depth

optimisations/benchmarking:
- [x] is it faster to concurrently deliver 3000 chunks in 3000 different messages between two nodes? or is it more effective to send one message with 3000 chunks? this should be easily measurable

tooling:
- [x] add assert for smoke tests to know when syncing is done

logging:
- ~how do add node identity to every logline our node outputs? related to https://github.com/ethersphere/swarm/issues/1393~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rewrite pull syncer #1451

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

rewrite pull syncer #1451

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions