Skip to content

[CI] XPackRestIT test {p0=ml/3rd_party_deployment/Test start and stop multiple deployments} failing #128899

@elasticsearchmachine

Description

@elasticsearchmachine

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:yamlRestTest" --tests "org.elasticsearch.xpack.test.rest.XPackRestIT.test {p0=ml/3rd_party_deployment/Test start and stop multiple deployments}" -Dtests.seed=217B982CD6BD610C -Dtests.locale=gu-Gujr-IN -Dtests.timezone=America/Resolute -Druntime.java=23

Applicable branches:
8.17

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.AssertionError: Failure at [ml/3rd_party_deployment:612]: expected [2xx] status code but api [ml.infer_trained_model] returned [408 Request Timeout] [---
error:
  root_cause:
  - type: "status_exception"
    reason: "timeout [10s] waiting for inference result"
    stack_trace: "org.elasticsearch.ElasticsearchStatusException: timeout [10s] waiting\
      \ for inference result\n\tat org.elasticsearch.ml@8.17.10-SNAPSHOT/org.elasticsearch.xpack.ml.inference.deployment.AbstractPyTorchAction.onTimeout(AbstractPyTorchAction.java:68)\n\
      \tat org.elasticsearch.server@8.17.10-SNAPSHOT/org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:956)\n\
      \tat java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144)\n\
      \tat java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642)\n\
      \tat java.base/java.lang.Thread.run(Thread.java:1575)\n"
  type: "status_e
[truncated]

Issue Reasons:

  • [8.17] 2 failures in test test {p0=ml/3rd_party_deployment/Test start and stop multiple deployments} (0.6% fail rate in 352 executions)
  • [8.17] 2 failures in pipeline elasticsearch-periodic-platform-support (16.7% fail rate in 12 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

Metadata

Metadata

Assignees

Labels

:mlMachine learning>test-failureTriaged test failures from CITeam:MLMeta label for the ML teamlow-riskAn open issue or test failure that is a low risk to future releases

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions