cleanup: transition from stopping to exited#15090
cleanup: transition from stopping to exited#15090openshift-merge-robot merged 1 commit intocontainers:mainfrom
stopping to exited#15090Conversation
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: vrothberg The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
Thank you! |
|
This looks... Questionable. Shouldn't the |
The analysis or the solution?
Currently, it doesn't as it's checking for ... if c.ensureState(define.ContainerStateCreated, define.ContainerStateRunning, define.ContainerStateStopped, define.ContainerStatePaused) {I considered that as well but am totally fine changing. |
|
The solution - it seems like we're papering over the fact that |
Sounds good to me. I'll update the PR. |
Allow the cleanup process (and others) to transition the container from `stopping` to `exited`. This fixes a race condition detected in containers#14859 where the cleanup process kicks in _before_ the stopping process can read the exit file. Prior to this fix, the cleanup process left the container in the `stopping` state and removed the conmon files, such that the stopping process also left the container in this state as it could not read the exit files. Hence, `podman wait` timed out (see the 23 seconds execution time of the test [1]) due to the unexpected/invalid state and the test failed. Further turn the warning during stop to a debug message since it's a natural race due to the daemonless/concurrent architecture and nothing to worry about. [NO NEW TESTS NEEDED] since we can only monitor if containers#14859 continues flaking or not. [1] https://storage.googleapis.com/cirrus-ci-6707778565701632-fcae48/artifacts/containers/podman/6210434704343040/html/sys-remote-fedora-36-rootless-host.log.html#t--00205 Fixes: containers#14859 Signed-off-by: Valentin Rothberg <vrothberg@redhat.com>
|
LGTM |
|
@containers/podman-maintainers PTAL |
|
/lgtm |
|
/hold cancel |
Allow the cleanup process (and others) to transition the container from
stoppingtoexited. This fixes a race condition detected in #14859where the cleanup process kicks in before the stopping process can
read the exit file. Prior to this fix, the cleanup process left the
container in the
stoppingstate and removed the conmon files, suchthat the stopping process also left the container in this state as it
could not read the exit files. Hence,
podman waittimed out (see the23 seconds execution time of the test [1]) due to the unexpected/invalid
state and the test failed.
Further turn the warning during stop to a debug message since it's a
natural race due to the daemonless/concurrent architecture and nothing
to worry about.
[NO NEW TESTS NEEDED] since we can only monitor if #14859 continues
flaking or not.
[1] https://storage.googleapis.com/cirrus-ci-6707778565701632-fcae48/artifacts/containers/podman/6210434704343040/html/sys-remote-fedora-36-rootless-host.log.html#t--00205
Fixes: #14859
Signed-off-by: Valentin Rothberg vrothberg@redhat.com
Does this PR introduce a user-facing change?
No release note as it's fixing an unreleased bug.
@mheon @edsantiago PTAL