Remove RR Sync: Testing

Validate the assumption that ELP2P is working as expected, for performing EL Sync to fill in the unsafe gap.

Run a real world/sysgo testing scenario trying to sync using the op-node.

### List of Tracked Tests

- [x] Test `sync.CLSync` behavior with unsafe payload queue: https://github.com/ethereum-optimism/optimism/pull/17675: `TestSyncAfterInitialELSync` @pcw109550 
- [ ] Test L2EL behavior while EL Syncing, using direct engine / eth API calls to the EL: **Insight: We only need to rely on FCU not newPayload because EL Sync only triggers with FCU.**
    - [x] Test op-geth: https://github.com/ethereum-optimism/optimism/pull/17752: `TestL2ELP2PCanonicalChainAdvancedByFCU` @pcw109550 
    - [x] Test op-reth(not easily mergable to dev because we need reth binary):  https://github.com/ethereum-optimism/optimism/pull/17802: `TestL2ELP2PCanonicalChainAdvancedByFCU` but tweak @pcw109550 
    - [x] ~Merge op-reth tests, or run at [op-rs/kona](https://github.com/op-rs/kona)~
- [x] Test L2EL behavior when FCUed with invalid hash: Expected return: `SYNCING` https://github.com/ethereum-optimism/optimism/pull/18001
    - `TestELP2PFCUUnavailableHash`
- [x] Test while FCU, safe head cannot be advanced when unsafe head hash cannot be validated. https://github.com/ethereum-optimism/optimism/pull/18001
    - `TestSafeDoesNotAdvanceWhenUnsafeIsSyncing_NoELP2P`
- [x] Test multiple scenarios when payload is INVALID (newPayload return value is INVALID) https://github.com/ethereum-optimism/optimism/pull/18001
    - `TestInvalidPayloadThroughCLP2P` (only works in geth. reth have different `newPayload` impl)
- [x]  The test demonstrates that the op-node does not rewind when INVALID payload detected while EL Syncing. (e.g. VALID -> SYNCING -> ... -> SYNCING -> INVALID). The test checks that the rewinding is not implemented yet at op-node. https://github.com/ethereum-optimism/optimism/pull/18001
    - `TestCLUnsafeNotRewoundOnInvalidDuringELSync` 
- [x] Use op-node to check that further sync(with initial sync on/off) completes and unsafe head reaches sequencer tip. This must be validated using a real chain, possibly using sync tester.
   - [x] Verify that if an unsafe chain gap emerges, due to network issues, a node running in CLSync or ELSync mode will fill the gap and continue to advance its unsafe chain, without help from the L1 derivation pipeline. https://github.com/ethereum-optimism/optimism/pull/17751: `TestUnsafeChainStalling_{CLSync|ELSync}` @nonsense 
   - [x] Same as upper case, but with actual stopping/starting of the op-node and not just network issue. Verifies that if an unsafe chain gap emerges, a node running in CLSync or ELSync mode will fill the gap after booting up: https://github.com/ethereum-optimism/optimism/pull/17751: `TestUnsafeChainStalling_{CLSync|ELSync}_RestartOpNode_Long` @nonsense 
   - [x] Show that the unsafe chain does not stall if RR sync is disabled, which is the case on current develop (i.e. we can't just switch RR sync on develop as is, without losing functionality) https://github.com/ethereum-optimism/optimism/pull/17751: `TestUnsafeChainStalling_DisabledReqRespSync` @nonsense 
   - [x] Validate upper behavior using real chain @nonsense 
- [x] ELP2P down, but chain still advancing since the unsafe payloads build on top of the unsafe head: will be implemented on top of tests after we will in the unsafe gap. This scenario occurs when the verifier eventually reached the unsafe head tip @pcw109550 
    - https://github.com/ethereum-optimism/optimism/pull/17895
        - `TestReachUnsafeTipByAppendingUnsafePayload`
- [x] Reorg cases, where safe head reorg happened. FCU result will be INVALID. Test Reset behavior. This may be mocked using the Sync tester, since harder to test. When the safe head reorgs, the FCU call will (eventually) return INVALID, because that unsafe payload will not build on top of the safe head. Use sync-tester or test-sequencer. Ref: https://github.com/ethereum-optimism/optimism/issues/17627#issuecomment-3369432912 @pcw109550 
    - https://github.com/ethereum-optimism/optimism/pull/17893
        - `TestUnsafeGapFillAfterSafeReorg`  
        - `TestUnsafeGapFillAfterUnsafeReorg_RestartL2CL`  
        - `TestUnsafeGapFillAfterUnsafeReorg_RestartCLP2P`  
- [ ] Cover op-reth syncing, all upper scenarios may pass using reth @pcw109550 at https://github.com/ethereum-optimism/optimism/pull/17751, branch `nonsense/deprecate-req-res-use-elsync` commit. [79efd35341c2e55685a1c078e3a00860e7a7b12d](https://github.com/ethereum-optimism/optimism/pull/17751/commits/79efd35341c2e55685a1c078e3a00860e7a7b12d). `TestL2ELP2PCanonicalChainAdvancedByFCU` not tested because it does not use op-node.
    - [x] `TestUnsafeGapFillAfterSafeReorg` 
    - [x] `TestUnsafeGapFillAfterUnsafeReorg_RestartL2CL`
    - [x] `TestUnsafeGapFillAfterUnsafeReorg_RestartCLP2P`
    - [x] `TestReachUnsafeTipByAppendingUnsafePayload` 
    - [x] `TestSyncAfterInitialELSync`
    - [x] `TestUnsafeChainStalling_CLSync_Short`
    - [x] `TestUnsafeChainStalling_CLSync_Long`
    - [x] `TestUnsafeChainStalling_CLSync_RestartOpNode_Long`
    - [x] `TestUnsafeChainStalling_ELSync_Short`
    - [x] `TestUnsafeChainStalling_ELSync_Long`
    - [x] `TestUnsafeChainStalling_ELSync_RestartOpNode_Long`
    - [x] `TestUnsafeChainStalling_DisabledReqRespSync`

reth version:
```
reth-optimism-cli Version: 1.8.2
Commit SHA: fe10c0785241a2ab92ee80c1e68629835d822770
```

### Discussions

We have multiple combinations: 
- syncmode: `sync.CLSync` / `sync.ELSync`
- RR Sync enabled / disabled
- EL implementation: geth / reth

Note that we have an assumption that EL P2P connectivity is stable, which means the syncing EL is connected to the EL which is fully synced

Validate the EL side payload caching behavior, and check that if the gaps are filled, the latest payload can be appended to the unsafe chain and become canonicalized, after the initial EL sync run.

**Should benchmark and examine how long will it takes to fill in the unsafe gap using real chain**. We may directly query the EL using the `eth_getBlockByNumber("latest")` to check that the EL actually reached the unsafe head tip, not relying on the `optimism_syncStatus` result from the op-node.

More concretely the testing scenario will be
1. Prepare op-node and EL(with stable ELP2P) which are fully synced, reaching the unsafe tip
2. Shut op-node down. 
3. Stay for 5 minutes to make the EL(with stable ELP2P) not advance, intentionally making the unsafe gap
4. Start op-node with
    - RR Sync disabled (via flag)
    - Patched to do the EL Sync to fill in the gap
    - Connected via CLP2P and receiving unsafe payloads from the sequencer
5. Measure time until the EL(with stable ELP2P) reaches the tip.






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove RR Sync: Testing #17694

List of Tracked Tests

Discussions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Remove RR Sync: Testing #17694

Description

List of Tracked Tests

Discussions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions