Stream blocks during import by arnetheduck · Pull Request #2937 · status-im/nimbus-eth1

arnetheduck · 2024-12-13T10:15:00Z

When running the import, currently blocks are loaded in batches into a seq then passed to the importer as such.

In reality, blocks are still processed one by one, so the batching does not offer any performance advantage. It does however require that the client wastes memory, up to several GB, on the block sequence while they're waiting to be processed.

This PR introduces a persister that accepts these potentially large blocks one by one and at the same time removes a number of redundant / unnecessary copies, assignments and resets that were slowing down the import process in general.

When running the import, currently blocks are loaded in batches into a `seq` then passed to the importer as such. In reality, blocks are still processed one by one, so the batching does not offer any performance advantage. It does however require that the client wastes memory, up to several GB, on the block sequence while they're waiting to be processed. This PR introduces a persister that accepts these potentially large blocks one by one and at the same time removes a number of redundant / unnecessary copies, assignments and resets that were slowing down the import process in general.

tersec · 2024-12-16T11:27:27Z

+  # the cost of failure is low.
+  # TODO Figure out the right balance for header fields - in particular, if
+  #      we receive instruction from the CL while syncing that a block is
+  #      CL-valid, do we skip validation while "far from head"? probably yes.


The main thing which needs to be checked is likely the basic link back to the parent blocks each time matching the block hash, i.e. the exact set of conditions that the engine API requires even an unsynced EL to perform. Beyond that isn't critical to security, and can be optimized to taste.

yeah, I guess a rule the EL can follow is that if the block is earlier than finalized (according to the cl), we do just parent-hash checking - if it's more recent than finalized, we do full checking with state root and everything - that should be pretty safe given that all the validators approved it already.

Anyway, this comment is just copy-pasted from the earlier code - it's slightly out of date because forkedchain no longer uses this code from what I remember

tersec reviewed Dec 16, 2024

View reviewed changes

Comment thread nimbus/nimbus_import.nim

tersec reviewed Dec 16, 2024

View reviewed changes

tersec approved these changes Dec 18, 2024

View reviewed changes

arnetheduck added 2 commits December 18, 2024 10:35

Merge remote-tracking branch 'origin/master' into stream-blocks

4215ddd

lint

de4ce49

arnetheduck merged commit 7bbb0f4 into master Dec 18, 2024

arnetheduck deleted the stream-blocks branch December 18, 2024 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stream blocks during import#2937

Stream blocks during import#2937
arnetheduck merged 3 commits intomasterfrom
stream-blocks

arnetheduck commented Dec 13, 2024

Uh oh!

Uh oh!

tersec Dec 16, 2024

Uh oh!

arnetheduck Dec 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

arnetheduck commented Dec 13, 2024

Uh oh!

Uh oh!

tersec Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

arnetheduck Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants