Skip to content

Extract and publish metrics per-dataset when running all benchmarks#75

Merged
tjgreen42 merged 1 commit intomainfrom
fix-benchmark-per-dataset
Dec 16, 2025
Merged

Extract and publish metrics per-dataset when running all benchmarks#75
tjgreen42 merged 1 commit intomainfrom
fix-benchmark-per-dataset

Conversation

@tjgreen42
Copy link
Copy Markdown
Collaborator

Summary

  • When running dataset=all, metrics are now extracted separately for each dataset
  • Each dataset (Cranfield, MS MARCO, Wikipedia) gets its own benchmark chart
  • Previously, only the first dataset's results were captured

Changes

  • extract_metrics.sh now accepts optional section parameter to extract from log sections
  • Workflow runs extract_metrics.sh once per dataset when running "all"
  • Separate benchmark-action publish steps for each dataset

Testing

Trigger a benchmark run with dataset=all to verify all three datasets appear separately in the results.

When running with dataset=all, metrics are now extracted and published
separately for each dataset (Cranfield, MS MARCO, Wikipedia) rather than
just the first dataset's results.

Changes:
- extract_metrics.sh now accepts optional section parameter to extract
  metrics from a specific section of the log file
- Workflow extracts metrics for each dataset separately when running all
- Each dataset's results are published to separate benchmark charts
@tjgreen42 tjgreen42 merged commit c80221e into main Dec 16, 2025
1 check passed
@tjgreen42 tjgreen42 deleted the fix-benchmark-per-dataset branch December 16, 2025 17:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant