cephadm: Manually remove containers#35018
Conversation
|
jenkins test make check |
|
This fixes: ``` Error: error creating container storage: the container name "ceph-<fsid>-mon.b" is already in use by "<container-id>". You have to remove that container to be able to reuse that name.: that name is already in use ``` Relates to https://tracker.ceph.com/issues/44990 Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
179ff31 to
b3fbc03
Compare
| install_path = find_program('install') | ||
| f.write('{install_path} -d -m0770 -o {uid} -g {gid} /var/run/ceph/{fsid}\n'.format(install_path=install_path, fsid=fsid, uid=uid, gid=gid)) | ||
|
|
||
| # Sometimes, adding `--rm` to a run_cmd doesn't work. Let's remove the container manually |
There was a problem hiding this comment.
Do we know why this sometimes doesn't work? It doesn't feel like we've identified the root cause here ..
There was a problem hiding this comment.
which is correct, yes. Looks like podman doesn't honor --rm, if the container creation itself failed. This is a workaround for the broken --rm handling.
However, this doesn't solve, why w're getting a ceph-mon: no such file or directory.
There was a problem hiding this comment.
@mgfritch @sebastian-philipp we also suffered from some issues in older releases of podman and fuse-overlay but they were addressed in newer releases. see https://tracker.ceph.com/issues/44242 , ceph/ceph-container@c44202c and ceph/ceph-container@6285bae . i think our CI (ceph-build and ceph-container) has got them addressed by installing the latest build of podman and its dependencies.
i am wondering if we should do the same in our testing harness as well?
There was a problem hiding this comment.
hm. you're sure this is related?
There was a problem hiding this comment.
related, sage closed #33367 a while ago without merging it.
There was a problem hiding this comment.
i am not sure. i just want to rule out whatever issues related to podman and its dependencies. if i am sure, i would put the analysis here. but i am not familiar with podman or docker at this moment, so this is what i can do.
There was a problem hiding this comment.
my idea is to just merge this one and see if it helps.
|
http://pulpito.ceph.com/swagner-2020-05-15_11:43:55-rados-wip-swagner-testing-2020-05-15-1102-distro-basic-smithi/ failed to verify this: no occurence of |
|
|
next opportunity: http://pulpito.ceph.com:80/swagner-2020-05-18_13:07:20-rados-wip-swagner2-testing-2020-05-18-1100-distro-basic-smithi/ |
|
|
|
sigh. no clue if this actually works or not. |
This fixes:
Relates to https://tracker.ceph.com/issues/44990
Signed-off-by: Sebastian Wagner sebastian.wagner@suse.com
Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard backendjenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume tox