feat: `forest-tool state compute` #6167

hanabi1224 · 2025-10-17T11:55:28Z

Summary of changes

This PR introduces a tool to take db snapshot for validating a tipset. The snapshot can be used for benchmarks (may come in a subsequent PR) to justify critial dependency updates (e.g. wasmtime in fvm)

Changes introduced in this pull request:

add forest-tool state compute
add forest-tool state replay-compute

Example usage

➜  state_compute forest-tool state compute --chain mainnet --epoch 5412980 --export-db-to mainnet_5412980.forest.car.zst
2025-10-17T11:53:23.299386Z  INFO forest::daemon::db_util: Loaded 1 CARs
2025-10-17T11:53:23.303018Z  INFO forest::genesis: Initialized genesis: bafy2bzacecnamqgqmifpluoeldx7zzglxcljo6oja4vrmtj7432rphldpdmm2
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, db_snapshot_size: 16.2 MiB

➜  state_compute forest-tool state compute --chain calibnet --epoch 3111900 --export-db-to calibnet_3111900.forest.car.zst
2025-10-17T11:53:33.962248Z  INFO forest::daemon::db_util: Loaded 1 CARs
2025-10-17T11:53:33.966142Z  INFO forest::genesis: Initialized genesis: bafy2bzacecyaggy24wol5ruvs6qm73gjibs2l2iyhcqmvi7r7a4ph7zx3yqd4
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, db_snapshot_size: 3 MiB

➜  state_compute forest-tool state replay-compute --chain mainnet -n 5 mainnet_5412980.forest.car.zst
2025-10-17T11:54:31.540237Z  INFO forest::genesis: Initialized genesis: bafy2bzacecnamqgqmifpluoeldx7zzglxcljo6oja4vrmtj7432rphldpdmm2
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, took 5s 485ms 678us 801ns.
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, took 453ms 433us 769ns.
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, took 451ms 79us 754ns.
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, took 450ms 547us 244ns.
epoch: 5412980, state_root: bafy2bzaceayrf3zvlnwxoe2gp5inorvczqgh6vcngiv2gdhocixpvoebbaj5m, receipt_root: bafy2bzacebc3cz43bb76ehtpfuv4brc4echaaxmn2ecih5cx2o5zptym6zzq4, took 455ms 183us 853ns.

➜  state_compute forest-tool state replay-compute --chain calibnet -n 5 calibnet_3111900.forest.car.zst
2025-10-17T11:55:11.527308Z  INFO forest::genesis: Initialized genesis: bafy2bzacecyaggy24wol5ruvs6qm73gjibs2l2iyhcqmvi7r7a4ph7zx3yqd4
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, took 3s 694ms 652us 947ns.
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, took 48ms 269us 352ns.
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, took 50ms 52us 706ns.
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, took 47ms 367us 543ns.
epoch: 3111900, state_root: bafy2bzacearxaxwxpn4szgyb3ekhr5ou6kgclhjwexdbx4grdrli45wi2ksuw, receipt_root: bafy2bzacebx5mqrqqgh7s2unqnscqfcili6mumpybdfdkbgimo2zof3mkh6qe, took 49ms 510us 978ns.

Reference issue to close (if applicable)

Closes

Change checklist

I have performed a self-review of my own code,
I have made corresponding changes to the documentation. All new code adheres to the team's documentation standards,
I have added tests that prove my fix is effective or that my feature works (if possible),
I have made sure the CHANGELOG is up-to-date. All user-facing changes should be reflected in this document.

Summary by CodeRabbit

New Features
- Added forest-tool state compute to generate DB snapshots for tipset validation, with optional export and diagnostics.
- Added forest-tool state replay-compute to replay tipset validation from a snapshot, supporting repeated runs and timing output.
Documentation
- CLI reference updated with new state-related commands and reordering of state/backup/completion sections.
Changelog
- Added entries for the new state compute and replay-compute commands.

coderabbitai · 2025-10-17T11:55:52Z

Walkthrough

Adds forest-tool state subcommands (compute, replay-compute), implements a new state_compute_cmd module with compute/replay logic, wires State into the CLI runner, adjusts several module/function visibilities, extends CAR path conversion to accept PathBuf, and updates docs/changelog.

Changes

Cohort / File(s)	Summary
Changelog & CLI docs `CHANGELOG.md`, `docs/docs/users/reference/cli.sh`	Added unreleased changelog entries and expanded CLI reference with `state`, `state compute`, `state replay-compute`, added `state-migration` sections and reordered completion section.
New state compute subcommand `src/tool/subcommands/state_compute_cmd.rs`	New module adding `StateCommand` enum with `Compute` and `ReplayCompute` variants and their `run()` implementations; `ComputeCommand` computes tipset state from a DB and can export a CAR snapshot; `ReplayComputeCommand` loads a CAR snapshot into an in-memory DB and repeats state computation for benchmarking.
CLI wiring `src/tool/main.rs`, `src/tool/subcommands/mod.rs`	Added `State` variant to the `Subcommand` enum, introduced `state_compute_cmd` module, and dispatches `Subcommand::State` asynchronously in the CLI runner.
API/test snapshot visibility changes `src/tool/subcommands/api_cmd.rs`, `src/tool/subcommands/api_cmd/api_compare_tests.rs`, `src/tool/subcommands/api_cmd/generate_test_snapshot.rs`	Adjusted visibilities: `generate_test_snapshot` module to `pub(super)`, `TestDump` struct to `pub`, and `load_db` function to `pub`; removed a `genesis_header.clone()` by moving ownership.
CAR PathBuf conversion `src/db/car/any.rs`	Added `TryFrom<&PathBuf>` impl for `AnyCar<EitherMmapOrRandomAccessFile>` delegating to existing `TryFrom<&Path>`, and updated imports to include `PathBuf`.

Sequence Diagram(s)

sequenceDiagram
    actor User
    participant CLI as forest-tool CLI
    participant StateCmd as StateCommand::Compute
    participant DB as Read-only DB (ManyCar/ParityDb)
    participant ChainStore as ChainStore
    participant SM as StateManager
    participant CAR as CAR Export

    User->>CLI: forest-tool state compute --epoch N --chain <network> [--db <path>] [--export_db_to <path>]
    CLI->>StateCmd: run()
    StateCmd->>DB: load read-only snapshot (load_db)
    StateCmd->>ChainStore: build ChainStore (with genesis header)
    StateCmd->>ChainStore: resolve tipset at epoch N
    StateCmd->>SM: create StateManager
    StateCmd->>SM: compute tipset state
    SM-->>StateCmd: state_root, receipt_root
    StateCmd->>CAR: optionally export CAR snapshot
    StateCmd->>User: print epoch, state_root, receipt_root, snapshot size

sequenceDiagram
    actor User
    participant CLI as forest-tool CLI
    participant ReplayCmd as StateCommand::ReplayCompute
    participant AnyCar as AnyCar (load snapshot)
    participant MemDB as In-memory DB (ManyCar<MemoryDB>)
    participant ChainStore as ChainStore
    participant SM as StateManager

    User->>CLI: forest-tool state replay-compute --snapshot <path> --chain <network> -n <count>
    CLI->>ReplayCmd: run()
    ReplayCmd->>AnyCar: load CAR snapshot
    AnyCar-->>ReplayCmd: heaviest tipset & epoch
    ReplayCmd->>MemDB: build in-memory DB from snapshot
    ReplayCmd->>ChainStore: build ChainStore (with genesis header)
    loop n times
        ReplayCmd->>SM: compute tipset state
        SM-->>ReplayCmd: state_root, receipt_root, duration
        ReplayCmd->>User: print epoch, state_root, receipt_root, duration
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

test: ci test for forest-cli state compute #5964: CI tests exercising the new forest-tool state compute command — directly related to the CLI feature and test coverage.
bench: tipset validation #6181: Adds/reuses CAR snapshot load and replay logic (MemoryDB/ManyCar/ChainStore/StateManager) similar to the new compute/replay paths.

Suggested reviewers

elmattic
akaladarshi
LesnyRumcajs

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The PR title "feat: `forest-tool state compute`" is clearly related to the changeset and accurately describes a major component of the changes. The title specifically refers to the new `forest-tool state compute` subcommand, which is a real and significant part of the PR. However, the PR actually introduces two complementary features: `forest-tool state compute` and `forest-tool state replay-compute`, both of which appear to be equally important based on the PR objectives. The title captures only one of these two main features, making it partially related to the complete scope of the changeset rather than fully related.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch hm/tool-state-compute

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (4)

src/db/car/any.rs (1)

141-146: LGTM with an optional idiom note.

The implementation correctly delegates to the existing TryFrom<&Path> via as_path(). This provides convenient conversion for callers holding &PathBuf.

Note: Rust API guidelines typically favor accepting &Path over &PathBuf since PathBuf derefs to Path, making &Path more flexible. However, as a trait implementation (rather than a function parameter), this may simplify specific call sites. If this isn't used frequently, consider whether callers could use .as_ref() or .as_path() with the existing TryFrom<&Path> instead.

src/tool/subcommands/state_compute_cmd.rs (3)

81-81: Avoid unwraps on fallible constructors; add context

Replace unwrap() with ? plus anyhow::Context for clearer errors.

+use anyhow::Context as _;

 ...
-            )
-            .unwrap(),
+            )
+            .context("creating ChainStore")?,
 ...
-        let state_manager = Arc::new(StateManager::new(chain_store.clone()).unwrap());
+        let state_manager = Arc::new(
+            StateManager::new(chain_store.clone()).context("creating StateManager")?
+        );
 ...
-            )
-            .unwrap(),
+            )
+            .context("creating ChainStore (replay)")?,
 ...
-        let state_manager = Arc::new(StateManager::new(chain_store.clone()).unwrap());
+        let state_manager = Arc::new(
+            StateManager::new(chain_store.clone()).context("creating StateManager (replay)")?
+        );

Also applies to: 90-90, 145-145

100-107: Stream snapshot to file when exporting to avoid double buffering

Currently you always build a full in-memory Vec<u8> then optionally write it to disk. When --export-db-to is set, write directly to the file; fall back to in-memory only when not exporting.

-        let mut db_snapshot = vec![];
-        db.export_forest_car(&mut db_snapshot).await?;
-        println!(
-            "epoch: {epoch}, state_root: {state_root}, receipt_root: {receipt_root}, db_snapshot_size: {}",
-            human_bytes::human_bytes(db_snapshot.len() as f64)
-        );
-        if let Some(export_db_to) = export_db_to {
-            std::fs::write(export_db_to, db_snapshot)?;
-        }
+        if let Some(export_db_to) = export_db_to {
+            let mut file = std::fs::File::create(&export_db_to)?;
+            db.export_forest_car(&mut file).await?;
+            let size = file.metadata()?.len();
+            println!(
+                "epoch: {epoch}, state_root: {state_root}, receipt_root: {receipt_root}, db_snapshot_size: {}",
+                human_bytes::human_bytes(size as u128)
+            );
+        } else {
+            let mut db_snapshot = Vec::new();
+            db.export_forest_car(&mut db_snapshot).await?;
+            println!(
+                "epoch: {epoch}, state_root: {state_root}, receipt_root: {receipt_root}, db_snapshot_size: {}",
+                human_bytes::human_bytes(db_snapshot.len() as u128)
+            );
+        }

153-164: Minor: consistent byte/time formatting across commands

You’re using humantime and human_bytes—good. Consider adding the same byte-size line to replay runs (e.g., snapshot size once before the loop) for parity with compute. Optional.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 15ea901 and e207be1.

📒 Files selected for processing (9)

CHANGELOG.md (1 hunks)
docs/docs/users/reference/cli.sh (1 hunks)
src/db/car/any.rs (2 hunks)
src/tool/main.rs (1 hunks)
src/tool/subcommands/api_cmd.rs (1 hunks)
src/tool/subcommands/api_cmd/api_compare_tests.rs (1 hunks)
src/tool/subcommands/api_cmd/generate_test_snapshot.rs (2 hunks)
src/tool/subcommands/mod.rs (2 hunks)
src/tool/subcommands/state_compute_cmd.rs (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (3)

src/tool/subcommands/state_compute_cmd.rs (6)

src/cli_shared/mod.rs (2)

chain_path (20-22)

read_config (24-41)

src/db/mod.rs (1)

db_root (320-322)

src/genesis/mod.rs (1)

read_genesis_header (21-44)

src/tool/subcommands/api_cmd/generate_test_snapshot.rs (3)

load_db (73-79)

new (190-195)

setting_keys (250-252)

src/db/car/any.rs (4)

new (34-53)

try_from (129-131)

try_from (136-138)

try_from (143-145)

src/networks/mod.rs (1)

from_chain (404-414)

src/db/car/any.rs (2)

src/db/car/forest.rs (2)

std (109-109)

try_from (241-243)

src/db/car/many.rs (2)

try_from (157-159)

try_from (164-166)

src/tool/subcommands/api_cmd/generate_test_snapshot.rs (1)

src/tool/subcommands/shed_cmd/migration.rs (1)

load_db (101-107)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (9)

GitHub Check: Build forest binaries on Linux AMD64
GitHub Check: tests-release
GitHub Check: tests
GitHub Check: Check
GitHub Check: cargo-publish-dry-run
GitHub Check: Build MacOS
GitHub Check: Build Ubuntu
GitHub Check: Deploy to Cloudflare Pages
GitHub Check: All lint checks

🔇 Additional comments (2)

src/db/car/any.rs (1)

21-21: LGTM!

The import update correctly adds PathBuf to support the new TryFrom implementation below.

src/tool/subcommands/state_compute_cmd.rs (1)

101-104: Let me verify the actual state by checking the codebase more carefully:

Based on the verification, I found the human_bytes function in version 0.2.1 accepts both u32 directly and u64 as f64, suggesting it uses generic type handling. However, I cannot definitively verify the exact API signature for version 0.4 from web sources. The inconsistency in the codebase reveals that:

archive_cmd.rs:336 passes index_size_bytes without an f64 cast

state_compute_cmd.rs:103 casts with as f64

snapshot.rs:357 also casts with as f64

If the code currently compiles with these different patterns, the function likely accepts multiple numeric types. The review comment's rationale about "precision loss" for large snapshots is reasonable (f64 loses precision at very large byte counts), but the suggestion to use u128 specifically requires verification of the 0.4 API.

Use integer type for human_bytes to avoid precision loss

The current code casts to f64, which loses precision for large byte counts. However, before applying the suggested change to u128, verify that version 0.4 of human_bytes accepts u128. The codebase already uses this function inconsistently: archive_cmd.rs:336 passes an integer directly without casting, while state_compute_cmd.rs:103 and snapshot.rs:357 both use f64 casts. Align all usages after confirming the supported input types for human_bytes 0.4.

coderabbitai · 2025-10-17T12:32:47Z

src/tool/subcommands/mod.rs

 mod net_cmd;
 mod shed_cmd;
 mod snapshot_cmd;
+mod state_compute_cmd;


⚠️ Potential issue | 🔴 Critical

Fix “private type in public interface” for Subcommand::State

Subcommand is pub, but it exposes state_compute_cmd::StateCommand from a private module, which breaks visibility (E0446). Re-export the type and reference the re-export, or make the module public. Minimal fix:

mod state_compute_cmd; +pub use self::state_compute_cmd::StateCommand; ... - #[command(subcommand)] - State(state_compute_cmd::StateCommand), + #[command(subcommand)] + State(StateCommand),

Also applies to: 85-87

🤖 Prompt for AI Agents

In src/tool/subcommands/mod.rs around lines 15 and also affecting lines 85-87, Subcommand::State exposes StateCommand from a private module causing a "private type in public interface" error; fix by either making the module public (change the module declaration to pub mod state_compute_cmd) or re-exporting the type (add pub use state_compute_cmd::StateCommand) and update Subcommand::State to reference the re-exported StateCommand (e.g., use crate::tool::subcommands::StateCommand) so the public enum only exposes public types.

coderabbitai · 2025-10-17T12:32:47Z