Skip to content

Run benchmark queries repeatedly for stable measurements#74

Merged
tjgreen42 merged 1 commit intomainfrom
benchmark-repeat-queries
Dec 16, 2025
Merged

Run benchmark queries repeatedly for stable measurements#74
tjgreen42 merged 1 commit intomainfrom
benchmark-repeat-queries

Conversation

@tjgreen42
Copy link
Copy Markdown
Collaborator

Summary

  • Each benchmark query now runs 10 times, reporting median/min/max times
  • Reduces variance from cache warming and system noise
  • Updated extract_metrics.sh to parse median from new output format

Testing

Manually verified query function works locally. Benchmark results will be validated on next nightly run.

Each benchmark query is now executed 10 times with median, min, and max
times reported. This reduces variance from cache warming and system noise.

Changes:
- Add benchmark_query() PL/pgSQL function to both Cranfield and MS MARCO
  query files that runs queries in a loop and calculates median
- Update extract_metrics.sh to parse new output format (extracts median
  from "Execution Time: X.XXX ms (min=..., max=...)" lines)
@tjgreen42 tjgreen42 merged commit 866fd1a into main Dec 16, 2025
1 check passed
@tjgreen42 tjgreen42 deleted the benchmark-repeat-queries branch December 16, 2025 06:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant