Skip to content

DatafeedJobsIT.testRealtime_multipleStopCalls failure on CI because task doesn't exist #45518

@AthenaEryma

Description

@AthenaEryma

Hit this on CI in an SLM PR that doesn't touch anything related to ML or tasks. Per build stats, this test has failed 7 times in the last 60 days, but only this time was this specific failure.

Build scan
Public Jenkins build

Reproduce line (does not reproduce locally):

./gradlew :x-pack:plugin:ml:qa:native-multi-node-tests:integTestRunner --tests "org.elasticsearch.xpack.ml.integration.DatafeedJobsIT.testRealtime_multipleStopCalls" -Dtests.seed=6D580ED560756B6A -Dtests.security.manager=true -Dtests.locale=es -Dtests.timezone=America/Indiana/Tell_City -Dcompiler.java=12 -Druntime.java=11

Stack trace:

com.carrotsearch.randomizedtesting.UncaughtExceptionError
: 
Captured an uncaught exception in thread: Thread[id=239, name=Thread-9, state=RUNNABLE, group=TGRP-DatafeedJobsIT]
Caused by: 
org.elasticsearch.ResourceNotFoundException
: 
the task with id datafeed-realtime-job-multiple-stop-datafeed and allocation id 16 doesn't exist
Close stacktrace
at __randomizedtesting.SeedInfo.seed([6D580ED560756B6A]:0)
at org.elasticsearch.persistent.PersistentTasksClusterService$4.execute(PersistentTasksClusterService.java:234)
at org.elasticsearch.cluster.ClusterStateUpdateTask.execute(ClusterStateUpdateTask.java:47)
at org.elasticsearch.cluster.service.MasterService.executeTasks(MasterService.java:697)
at org.elasticsearch.cluster.service.MasterService.calculateTaskOutputs(MasterService.java:319)
at org.elasticsearch.cluster.service.MasterService.runTasks(MasterService.java:214)
at org.elasticsearch.cluster.service.MasterService$Batcher.run(MasterService.java:151)
at org.elasticsearch.cluster.service.TaskBatcher.runIfNotProcessed(TaskBatcher.java:150)
at org.elasticsearch.cluster.service.TaskBatcher$BatchedTask.run(TaskBatcher.java:188)
at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:699)
at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedEsThreadPoolExecutor.java:252)
at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:215)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.lang.Thread.run(Thread.java:834)

Metadata

Metadata

Labels

:mlMachine learning>test-failureTriaged test failures from CI

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions