Skip to content

P.S.A. Validation throughput #325114

@denelon

Description

@denelon

We've been experiencing throughput issues with our validation infrastructure.

We do have work in progress to directly help as well as longer term work to address the root cause. The initial work is moving to a GitHub App and implementing a queueing mechanism so we can throttle our traffic to reduce the number of failures based on timeouts.

Longer term, we're working with our infrastructure partner on increasing our overall capacity as well as the compute resources we run validation against.

The Azure DevOps Analytics view can provide some detail on the current success/failure rates and typical durations for how long validation takes to complete.

We do have automated retry enabled, but we're still seeing higher failure rates than normal. The most common result is Internal-Error-Dynamic-Scan The test for Dynamic Scanning in the Installation Validation failed. which is often indicative of a timeout. We're processing manual validations where appropriate, but those take additional time to complete and may result in icons not being captured for those packages.

We're also tracking some other known classes of failures:

The current ETA for both items to be completed and deployed is February. Both of the work items are large in scope. We're going to include several "quality of experience" upgrades including exposing the validation failure reasons in the PR body.

Sub-issues

Metadata

Metadata

Assignees

Type

Projects

Status

In Progress

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions