chore: re-enable e2e tests for web-sveltekit by bahrmichael · Pull Request #63910 · sourcegraph/sourcegraph-public-snapshot

bahrmichael · 2024-07-18T12:09:06Z

The goal of this PR is to increase the stability of web-sveltekit e2e-tests so that we don't have to rely on manual runs anymore. They have previously been disabled due to a high number of failures: https://github.com/sourcegraph/sourcegraph/pull/63874

To improve the stability of web-sveltekit e2e-tests, I used sg ci bazel test //client/web-sveltekit:e2e_test --runs_per_test=N with N=5,10,15 to see which tests break under different levels of pressure on the machine. The logs looked like it was mostly timeouts, that got worse when increasing N. That means we can check where tests will break due to timeouts, but we don't really need to raise timeouts so far that it would work with N=20.

With N=5, 10 we get a good understanding if our timeouts are high enough.

You can see two CI runs here after applying higher timeouts and skipping a consistently failing test:

From logs of some other run that I don't have the link to anymore, we can see that some tests take up to 50s so a timeout of 60s (instead of the default 30s) for CI should be a good new ceiling.

Slow test file: [chromium] › src/routes/[...repo=reporev]/(validrev)/(code)/-/blob/[...path]/page.spec.ts (48.9s)
--
  | Slow test file: [chromium] › src/routes/[...repo=reporev]/(validrev)/(code)/page.spec.ts (45.0s)
  | Slow test file: [chromium] › src/routes/search/page.spec.ts (40.6s)
  | Slow test file: [chromium] › src/routes/[...repo=reporev]/(validrev)/(code)/-/tree/[...path]/page.spec.ts (31.7s)
  | Slow test file: [chromium] › src/routes/layout.spec.ts (31.5s)

Test plan

CI

Changelog

bahrmichael · 2024-07-18T13:12:35Z

+    timeout: process.env.BAZEL ? 60_000 : 30_000,
+    expect: {
+        timeout: process.env.BAZEL ? 20_000 : 5_000,
+    },


With these changes, the timeout per test goes from 5s to 30s on local machines, the timeout per assertion stays at 5s; and on Bazel we allow each test to run for 60s with a 20s timeout per assertion. Things may run a bit slower there than on our dev machines.

Do you know if it is possible to collect profiles for the tests to see what is causing such a big difference between CI vs locally? Based on what I know about the recent Intel and Apple chips, a 2x or 3x speedup for single-core performance seems like quite a lot, so it seems weird that we need such large timeouts in CI compared to local dev.

With Playwright we can get traces, where we can compare how long each step takes on local vs. CI. I'm not sure about more low-level profiling, without doing more dev-infra stuff on the CI agents.

cc @sourcegraph/developer-infrastructure

Hey, thanks for the ping. bazel-do jobs didn't previously upload the recorded profile, which is now fixed in #64148 .

I went ahead and fired up a job for you with your test target, you can explore the profile here: https://buildkite.com/sourcegraph/sourcegraph/builds/284928#019103c3-5f71-4aac-89ce-065a5b331d09

You'll need to to feed that to the chrome trace explorer or better, use https://ui.perfetto.dev/ if you want a fancier UI.

It's going to be pretty hairy, so if after a first look, you don't spot much, reach us out on #discuss-dev-infra to schedule a call to explore it together.

bahrmichael · 2024-07-18T13:13:00Z

-    async function openSidebar(page: Page): Promise<void> {
-        return page.getByLabel('Open sidebar').click()
-    }
-


I inlined this function so that the test output lets us find the line where it failed more easily.

bahrmichael · 2024-07-18T13:13:27Z

    })

-    test('error handling non-existing directory -> root', async ({ page, sg }) => {
+    test.skip('error handling non-existing directory -> root', async ({ page, sg }) => {


I skipped this test because it failed reliably when closing the sidebar. There's something broken that can't be fixed with increased timeouts.

camdencheek

Thanks!

keegancsmith · 2024-07-18T16:47:35Z

+    timeout: process.env.BAZEL ? 60_000 : 30_000,
+    expect: {
+        timeout: process.env.BAZEL ? 20_000 : 5_000,
+    },


cc @sourcegraph/developer-infrastructure

Test have been stable since https://github.com/sourcegraph/sourcegraph/pull/63910. See https://buildkite.com/organizations/sourcegraph/analytics/suites/sourcegraph-bazel/tests/e143a9fc-8857-83f0-8cfb-03e1c6f48f7b?branch=main ## Test plan CI ## Changelog

Follow-up to [this comment](https://github.com/sourcegraph/sourcegraph/pull/63910#discussion_r1683190713) where the need was raised for having profiles for further inspection of problematic targets when run in isolation. Basically, every bazel-do will now collect the profile, and it'll be uploaded as a job artifact. ## Test plan See https://buildkite.com/sourcegraph/sourcegraph/builds/284913 for a test run.  ## Changelog

bahrmichael added 6 commits July 18, 2024 10:50

chore: increase test timeout for "file popover"

aa327be

chore: raise timeout

a6488c9

!bazel test //client/web-sveltekit:e2e_test --runs_per_test=10

bd7fd50

chore: raise timeouts further

8a9b255

chore: inline openSideBar

6777d60

chore: skip consistently failing test

467cd7e

cla-bot Bot added the cla-signed label Jul 18, 2024

bahrmichael added 4 commits July 18, 2024 14:13

chore: reduce lenience

c5d3377

chore: skip test again

5bd28d8

Merge branch 'main' into bahrmichae/e2e-svelte-fixes

461e7a1

chore: linting

b77defb

bahrmichael commented Jul 18, 2024

View reviewed changes

bahrmichael marked this pull request as ready for review July 18, 2024 13:13

bahrmichael changed the title ~~chore: reduce flakiness of web-sveltekit e2e-tests~~ chore: reduce flakiness of web-sveltekit e2e-tests by increasing timeouts Jul 18, 2024

bahrmichael requested a review from a team July 18, 2024 13:13

chore: remove manual flag

010707c

bahrmichael changed the title ~~chore: reduce flakiness of web-sveltekit e2e-tests by increasing timeouts~~ chore: re-enable e2e tests for web-sveltekit Jul 18, 2024

camdencheek approved these changes Jul 18, 2024

View reviewed changes

keegancsmith approved these changes Jul 18, 2024

View reviewed changes

bahrmichael merged commit d5292ca into main Jul 18, 2024

bahrmichael deleted the bahrmichae/e2e-svelte-fixes branch July 18, 2024 17:31

bahrmichael added a commit that referenced this pull request Jul 22, 2024

chore: apply fixes from #63942, #63910 and #63879

b3cb1df

bahrmichael mentioned this pull request Jul 29, 2024

chore: remove flaky flag from svelte e2e_test #64123

Merged

jhchabran mentioned this pull request Jul 30, 2024

chore(ci): pass --profile to bazel-do jobs #64148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: re-enable e2e tests for web-sveltekit#63910

chore: re-enable e2e tests for web-sveltekit#63910
bahrmichael merged 11 commits into
mainfrom
bahrmichae/e2e-svelte-fixes

bahrmichael commented Jul 18, 2024 •

edited

Loading

Uh oh!

bahrmichael Jul 18, 2024

Uh oh!

varungandhi-src Jul 18, 2024

Uh oh!

bahrmichael Jul 18, 2024

Uh oh!

keegancsmith Jul 18, 2024

Uh oh!

jhchabran Jul 30, 2024

Uh oh!

bahrmichael Jul 18, 2024

Uh oh!

bahrmichael Jul 18, 2024

Uh oh!

camdencheek left a comment

Uh oh!

keegancsmith Jul 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

bahrmichael commented Jul 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Changelog

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

camdencheek left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bahrmichael commented Jul 18, 2024 •

edited

Loading