Skip to content

Introduce separate track for SQL#223

Merged
Luegg merged 6 commits intoelastic:masterfrom
Luegg:enhance/extraSqlTrack
Dec 6, 2021
Merged

Introduce separate track for SQL#223
Luegg merged 6 commits intoelastic:masterfrom
Luegg:enhance/extraSqlTrack

Conversation

@Luegg
Copy link
Copy Markdown
Contributor

@Luegg Luegg commented Dec 3, 2021

Extracts the SQL challenge into its own track to make it easier to have separate dashboards for the SQL tasks and to reduce coupling between NOAA and SQL benchmarks.

This PR also addresses some other issues that have been identified in the meantime:

  • Force-merges the index into a single segment to reduce variability between runs
  • Adds the wait-until-merges-finish task to ensure merging finished before running the queries
  • Introduces a query_percentage track parameter as a convenience to get faster results when doing experimentations (to be taken with a huge grain of salt of course)
  • Ensures that page_timeout and request_timeout are configured correctly such that the track can be run on revisions with or without SQL: fix use of requestTimeout and pageTimeout query parameters elasticsearch#79360.

Apart from these changes, the queries themselves have not been touched.

@Luegg Luegg requested review from bpintea and dliappis December 3, 2021 16:53
Copy link
Copy Markdown
Contributor

@dliappis dliappis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you. This is much better and will allow for a separate dashboard.
In general this looks good already and my tests passed.

I left a few small comments.

sql/track.json Outdated
"corpora": [
{
"name": "noaa",
"base-url": "https://rally-tracks.elastic.co/noaa",
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Given that we redefine the mappings in this track, I wonder if we are risking problems, in the unlikely case that the noaa corpora gets updated.

Would it make sense to store the same corpora (it's 9GB uncompressed) in a folder/object of its own (and accept the duplication) but thus ensure they are independent?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point, I agree that it's probably worth it to duplicate the compressed data (~1GB) to avoid these issues. I've created a copy in /noaa-sql and updated the path accordingly.

@dliappis dliappis self-requested a review December 6, 2021 12:56
Copy link
Copy Markdown
Contributor

@dliappis dliappis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Let's not merge this yet, to give a chance for @bpintea to review.

Regarding backporting, what branches do you want to cover?

Copy link
Copy Markdown

@bpintea bpintea left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice, LGTM!

@Luegg
Copy link
Copy Markdown
Contributor Author

Luegg commented Dec 6, 2021

Regarding backporting, what branches do you want to cover?

My plan is to backport to the 7.12 and 7.14 branches since they already include the old SQL challenge. I also verified that all the queries run with 7.12.1.

@dliappis
Copy link
Copy Markdown
Contributor

dliappis commented Dec 6, 2021

My plan is to backport to the 7.12 and 7.14 branches since they already include the old SQL challenge. I also verified that all the queries run with 7.12.1.

Ok we are ready to merge then.

@dliappis dliappis added the backport pending Awaiting backport to stable release branch label Dec 6, 2021
@Luegg Luegg merged commit e4e4426 into elastic:master Dec 6, 2021
@Luegg Luegg deleted the enhance/extraSqlTrack branch December 6, 2021 16:06
Luegg pushed a commit to Luegg/rally-tracks that referenced this pull request Dec 6, 2021
Extracts the SQL challenge into its own track to make it easier to have separate dashboards for the SQL tasks and to reduce coupling between NOAA and SQL benchmarks.

This PR also addresses some other issues that have been identified in the meantime:

- Force-merges the index into a single segment to reduce variability between runs
- Adds the `wait-until-merges-finish` task to ensure merging finished before running the queries
- Introduces a `query_percentage` track parameter as a convenience to get faster results when doing experimentations (to be taken with a huge grain of salt of course)
- Ensures that `page_timeout` and `request_timeout` are configured correctly such that the track can be run on revisions with or without elastic/elasticsearch#79360.

Apart from these changes, the queries themselves have not been touched.
Luegg pushed a commit to Luegg/rally-tracks that referenced this pull request Dec 6, 2021
Extracts the SQL challenge into its own track to make it easier to have separate dashboards for the SQL tasks and to reduce coupling between NOAA and SQL benchmarks.

This PR also addresses some other issues that have been identified in the meantime:

- Force-merges the index into a single segment to reduce variability between runs
- Adds the `wait-until-merges-finish` task to ensure merging finished before running the queries
- Introduces a `query_percentage` track parameter as a convenience to get faster results when doing experimentations (to be taken with a huge grain of salt of course)
- Ensures that `page_timeout` and `request_timeout` are configured correctly such that the track can be run on revisions with or without elastic/elasticsearch#79360.

Apart from these changes, the queries themselves have not been touched.
Luegg pushed a commit that referenced this pull request Dec 6, 2021
Extracts the SQL challenge into its own track to make it easier to have separate dashboards for the SQL tasks and to reduce coupling between NOAA and SQL benchmarks.

This PR also addresses some other issues that have been identified in the meantime:

- Force-merges the index into a single segment to reduce variability between runs
- Adds the `wait-until-merges-finish` task to ensure merging finished before running the queries
- Introduces a `query_percentage` track parameter as a convenience to get faster results when doing experimentations (to be taken with a huge grain of salt of course)
- Ensures that `page_timeout` and `request_timeout` are configured correctly such that the track can be run on revisions with or without elastic/elasticsearch#79360.

Apart from these changes, the queries themselves have not been touched.
Luegg pushed a commit that referenced this pull request Dec 6, 2021
Extracts the SQL challenge into its own track to make it easier to have separate dashboards for the SQL tasks and to reduce coupling between NOAA and SQL benchmarks.

This PR also addresses some other issues that have been identified in the meantime:

- Force-merges the index into a single segment to reduce variability between runs
- Adds the `wait-until-merges-finish` task to ensure merging finished before running the queries
- Introduces a `query_percentage` track parameter as a convenience to get faster results when doing experimentations (to be taken with a huge grain of salt of course)
- Ensures that `page_timeout` and `request_timeout` are configured correctly such that the track can be run on revisions with or without elastic/elasticsearch#79360.

Apart from these changes, the queries themselves have not been touched.
@Luegg Luegg removed the backport pending Awaiting backport to stable release branch label Dec 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants