bench: add MS-MARCO v2 full benchmark and profiling scripts by tjgreen42 · Pull Request #271 · timescale/pg_textsearch

tjgreen42 · 2026-03-09T18:57:04Z

Summary

Add run_full_benchmark.sh end-to-end orchestrator for reproducible pg_textsearch vs ParadeDB benchmarks on 138M passages
Add power test scripts (setup_power_test.sql, power_tapir.sql, power_systemx.sql) for concurrent throughput measurement via pgbench
Add profiling tools (profile_build.sh, profile_queries.sql) for perf/flamegraph and per-query latency analysis

Details

run_full_benchmark.sh supports step-based execution:

./run_full_benchmark.sh <step>
  env            - capture machine specs, PG config, extensions
  build-tapir    - build pg_textsearch BM25 index
  build-systemx  - build ParadeDB BM25 index
  query-tapir    - single-client latency benchmarks
  query-systemx  - single-client latency benchmarks
  power-tapir    - concurrent throughput (pgbench)
  power-systemx  - concurrent throughput (pgbench)
  summary        - side-by-side comparison table
  all            - run everything in sequence

Test plan

Verify run_full_benchmark.sh env captures system info
Run power-tapir step end-to-end on a loaded corpus
Run profile_queries.sql and verify latency breakdown output

Add end-to-end benchmark orchestrator (run_full_benchmark.sh) that runs build, single-client latency, and concurrent throughput (power test) for both pg_textsearch and ParadeDB on the 138M passage corpus. Scripts added: - run_full_benchmark.sh: orchestrator with step-based execution - setup_power_test.sql: creates pgbench query table with dense IDs - power_tapir.sql: pgbench script for pg_textsearch throughput - power_systemx.sql: pgbench script for ParadeDB throughput - profile_build.sh: perf/flamegraph profiling for index builds - profile_queries.sql: per-query latency profiling by token bucket

Cherry-picked from release-1.0.0 branch.

…DB (#297) ## Summary - Add weighted query pool setup and pgbench scripts for concurrent throughput benchmarking with MS-MARCO v1 lexeme distribution - Refresh comparison.html with latest benchmark numbers (commit f31d1af) - Cherry-pick MS-MARCO v2 full benchmark and profiling scripts from release branch (#271) - Rename all `systemx` references to `paradedb` across benchmarks, CI workflow, and docs (from release branch) ## Test plan - [ ] Run MS-MARCO v2 benchmark locally to verify numbers are comparable to published results - [ ] Verify pgbench concurrent throughput scripts work end-to-end

tjgreen42 added 2 commits March 9, 2026 18:56

Merge branch 'main' into bench/msmarco-v2-benchmark-scripts

2a7446d

tjgreen42 merged commit 272c023 into main Mar 11, 2026
1 check passed

tjgreen42 deleted the bench/msmarco-v2-benchmark-scripts branch March 11, 2026 18:52

tjgreen42 added a commit that referenced this pull request Mar 27, 2026

bench: add MS-MARCO v2 full benchmark and profiling scripts (#271)

a32fd7b

Cherry-picked from release-1.0.0 branch.

tjgreen42 mentioned this pull request Mar 27, 2026

bench: add pgbench concurrent throughput and rename SystemX to ParadeDB #297

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bench: add MS-MARCO v2 full benchmark and profiling scripts#271

bench: add MS-MARCO v2 full benchmark and profiling scripts#271
tjgreen42 merged 2 commits intomainfrom
bench/msmarco-v2-benchmark-scripts

tjgreen42 commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tjgreen42 commented Mar 9, 2026

Summary

Details

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant