Allow inputting a base hash in Regression workflow by lnkuiper · Pull Request #15082 · duckdb/duckdb

lnkuiper · 2024-12-02T09:14:25Z

Since #14973, our nightly no longer runs regressions against itself (always succeeding) but against the last successful nightly regression, now sometimes failing. This has caught a CSV regression.

If we don't fix this regression, subsequent workflow runs will fail indefinitely. Sometimes, however, we want to accept regressions, for example, so the CSV reader can parse more timestamp types (at the cost of taking more time - at least, I think that's what's happening here). In such cases, we need to re-run the regression workflow against itself so that it succeeds.

This PR adds an input parameter to run the regression test against a specific DuckDB version. This would also allow us to run the current main against v.1.13.

carlopi

Thanks!!
(needs to be undrafted)

Note that currently this is triggered only on duckdb/duckdb main, so CI is not really relevant once the YAML parses. (I think it would be cool to refactor this, at some later time, to be more flexible)

After this is merged, Regression should be triggered for example like:

gh workflow run Regression --repo duckdb/duckdb --raw-field base_hash=e141994cf832e2362952d432e9d38357d48e5919

( this is on tonight nightly, but any value should do there)

That should make so that current main is tested vs latest nightly. That should succeed and make so that on tonight run the latest successful one (that is current main) is used as a reference.

carlopi · 2024-12-02T12:04:56Z

Alternative would be checking against main of 24 hour ago. I think that is also OK, but I see more chances of failure go unnoticed (we don't have yet a system in place to track failure reliably, and I think this can just be moved further).

Pro of checking last successful:

requires active action (either fixing the regression or resetting baseline), so regression are harder to ignore

Pro of using 24 hours ago:

maintenance free

Now that I wrote it down "maintenance free" seems hard to pass on.
@Mytherin?

Either way I would keep the logic for adding a custom baseline, since it can be handy to be able to trigger say "main vs 1.1.3" and see what happened.

lnkuiper · 2024-12-02T12:10:38Z

@carlopi Should be "Maintenance free with a higher chance of letting regressions slip through".

Not saying it's a bad idea, just saying that it's not a win/win - it's a trade-off.

Mytherin · 2024-12-02T12:13:07Z

Since we're running regression tests in between released versions and in the weekly regression runner anyway, I think we should be able to catch all regressions regardless between major releases, so I would say adding an extra layer where we can see this is likely not worth it

Top-N: Perform global boundary checking before doing sort-key conversion (duckdb/duckdb#15087) Allow inputting a base hash in Regression workflow (duckdb/duckdb#15082) Avoid building for Python 3.7 on Windows (duckdb/duckdb#15085)

Top-N: Perform global boundary checking before doing sort-key conversion (duckdb/duckdb#15087) Allow inputting a base hash in Regression workflow (duckdb/duckdb#15082) Avoid building for Python 3.7 on Windows (duckdb/duckdb#15085) Co-authored-by: krlmlr <krlmlr@users.noreply.github.com>

allow hash to be input

5343462

lnkuiper requested a review from carlopi December 2, 2024 09:14

properly specify input type

af8d9d5

duckdb-draftbot marked this pull request as draft December 2, 2024 09:18

carlopi approved these changes Dec 2, 2024

View reviewed changes

lnkuiper marked this pull request as ready for review December 2, 2024 09:53

carlopi added the Ready To Merge label Dec 2, 2024

carlopi added Ready For Review and removed Ready To Merge labels Dec 2, 2024

carlopi requested a review from Mytherin December 2, 2024 12:06

grab latest - not latest successful

c11b820

duckdb-draftbot marked this pull request as draft December 2, 2024 12:17

lnkuiper marked this pull request as ready for review December 2, 2024 12:17

lnkuiper added Ready To Merge and removed Ready For Review labels Dec 2, 2024

Mytherin merged commit 632335b into duckdb:main Dec 2, 2024

github-actions bot mentioned this pull request Dec 28, 2024

vendor: Update vendored sources to duckdb/duckdb@6064047fb295f723d55dd3f29aa4422852bd9383 duckdb/duckdb-r#780

Merged

lnkuiper deleted the regression_input branch April 14, 2025 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow inputting a base hash in Regression workflow#15082

Allow inputting a base hash in Regression workflow#15082
Mytherin merged 3 commits intoduckdb:mainfrom
lnkuiper:regression_input

lnkuiper commented Dec 2, 2024

Uh oh!

carlopi left a comment •

edited

Loading

Uh oh!

carlopi commented Dec 2, 2024

Uh oh!

lnkuiper commented Dec 2, 2024

Uh oh!

Mytherin commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lnkuiper commented Dec 2, 2024

Uh oh!

carlopi left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carlopi commented Dec 2, 2024

Uh oh!

lnkuiper commented Dec 2, 2024

Uh oh!

Mytherin commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

carlopi left a comment •

edited

Loading