Skip to content

qa/tests: try using cephadm#35622

Closed
yuriw wants to merge 3 commits intoceph:masterfrom
yuriw:wip-yuriw-rados-master
Closed

qa/tests: try using cephadm#35622
yuriw wants to merge 3 commits intoceph:masterfrom
yuriw:wip-yuriw-rados-master

Conversation

@yuriw
Copy link
Contributor

@yuriw yuriw commented Jun 17, 2020

Signed-off-by: Yuri Weinstein yweinste@redhat.com

Checklist

  • References tracker ticket
  • Updates documentation if necessary
  • Includes tests for new functionality or reproducer for bug

Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard backend
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox

@yuriw yuriw requested a review from jdurgin June 17, 2020 17:22
@yuriw
Copy link
Contributor Author

yuriw commented Jun 17, 2020

@sebastian-philipp
Copy link
Contributor

2020-06-17T17:50:38.789 INFO:teuthology.orchestra.run.smithi117.stderr:Error ENOENT: ('name %s already in use', 'b')
debug 2020-06-17T17:50:38.782+0000 7f8276554700 -1 log_channel(cephadm) log [ERR] : _Promise failed
Traceback (most recent call last):
  File "/usr/share/ceph/mgr/orchestrator/_interface.py", line 275, in _finalize
    next_result = self._on_complete(self._value)
  File "/usr/share/ceph/mgr/cephadm/module.py", line 131, in <lambda>
    return CephadmCompletion(on_complete=lambda _: f(*args, **kwargs))
  File "/usr/share/ceph/mgr/cephadm/module.py", line 2004, in add_mon
    return self._add_daemon('mon', spec, self.mon_service.create)
  File "/usr/share/ceph/mgr/cephadm/module.py", line 1957, in _add_daemon
    create_func, config_func)
  File "/usr/share/ceph/mgr/cephadm/module.py", line 1973, in _create_daemons
    forcename=name)
  File "/usr/share/ceph/mgr/cephadm/module.py", line 570, in get_unique_name
    raise orchestrator.OrchestratorValidationError('name %s already in use', forcename)
orchestrator._interface.OrchestratorValidationError: ('name %s already in use', 'b')
debug 2020-06-17T17:50:38.782+0000 7f8276554700 -1 mgr.server reply reply (2) No such file or directory ('name %s already in use', 'b')
2020-06-17T17:50:39.397 DEBUG:teuthology.orchestra.run:got remote process result: 2
2020-06-17T17:50:39.398 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/contextutil.py", line 32, in nested
    vars.append(enter())
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/github.com_yuriw_ceph_wip-yuriw-rados-master/qa/tasks/cephadm.py", line 502, in ceph_mons
    remote.shortname + ':' + ctx.ceph[cluster_name].mons[mon] + '=' + id_,
  File "/home/teuthworker/src/github.com_yuriw_ceph_wip-yuriw-rados-master/qa/tasks/cephadm.py", line 47, in _shell
    **kwargs
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/remote.py", line 206, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 475, in run
    r.wait()
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 162, in wait
    self._raise_for_status()
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 184, in _raise_for_status
    node=self.hostname, label=self.label
teuthology.exceptions.CommandFailedError: Command failed on smithi117 with status 2: 'sudo /home/ubuntu/cephtest/cephadm --image quay.ceph.io/ceph-ci/ceph:9df7e03345a31351fa0be5602855e9fdcda5e993 shell -c /etc/ceph/ceph.conf -k /etc/ceph/ceph.client.admin.keyrin

One idea would be to use the qa/suites/rados/cephadm/smoke/fixed-2.yaml cluster.

@sebastian-philipp
Copy link
Contributor

@yuriw any update on this?

@yuriw
Copy link
Contributor Author

yuriw commented Jun 24, 2020

@sebastian-philipp not yet @JoshD was reviewing

Copy link
Contributor

@tchaikov tchaikov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could you please remove the merge commit from this PR?

@yuriw yuriw force-pushed the wip-yuriw-rados-master branch from f2462d0 to e02cf17 Compare June 29, 2020 16:01
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
@yuriw yuriw force-pushed the wip-yuriw-rados-master branch from e02cf17 to e1dd7ae Compare July 1, 2020 21:50
This sets up the prerequisites for many tests.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
@yuriw yuriw requested a review from a team as a code owner July 1, 2020 21:53
@yuriw
Copy link
Contributor Author

yuriw commented Jul 1, 2020

Bootstrapping with roleless mode would otherwise potentially use a
different host than first_mon/first_mgr, and when the daemons are
later deployed on all hosts, skipping first_mon/first_mgr, we'd be
trying to create the same mon/mgr twice, e.g. mon.smithi028 and
failing because it already existed.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
@jdurgin jdurgin requested a review from sebastian-philipp July 2, 2020 01:39
@yuriw
Copy link
Contributor Author

yuriw commented Jul 2, 2020

'/etc/ceph/{}.client.admin.keyring'.format(cluster_name),
'--output-pub-ssh-key', '{}/{}.pub'.format(testdir, cluster_name),
'--mon-id', first_mon,
'--mgr-id', first_mgr,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2020-07-11T05:43:03.562 INFO:tasks.cephadm:Teardown complete
2020-07-11T05:43:03.562 ERROR:teuthology.run_tasks:Saw exception from tasks.
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-kefu-testing-2020-07-11-1109/qa/tasks/cephadm.py", line 368, in ceph_bootstrap
    '--mgr-id', first_mgr,
UnboundLocalError: local variable 'first_mgr' referenced before assignment

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/run_tasks.py", line 93, in run_tasks
    manager.__enter__()
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-kefu-testing-2020-07-11-1109/qa/tasks/cephadm.py", line 1217, in task
    lambda: cephfs_setup(ctx=ctx, config=config),
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/contextutil.py", line 55, in nested
    raise exc[1]
  File "/usr/lib/python3.6/contextlib.py", line 99, in __exit__
    self.gen.throw(type, value, traceback)
  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-kefu-testing-2020-07-11-1109/qa/tasks/cephadm.py", line 1095, in initialize_config
    yield
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/contextutil.py", line 31, in nested
    vars.append(enter())
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-kefu-testing-2020-07-11-1109/qa/tasks/cephadm.py", line 471, in ceph_bootstrap
    ctx.daemons.get_daemon(type_, id_, cluster).stop()
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/daemon/cephadmunit.py", line 149, in stop
    self.remote.sh(self.stop_cmd)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/remote.py", line 248, in sh
    proc = self.run(**kwargs)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/remote.py", line 204, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)

https://pulpito.ceph.com/kchai-2020-07-11_05:07:55-rados-wip-kefu-testing-2020-07-11-1109-distro-basic-smithi/

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hm. --mon-id and --mgr-id only make sense in roleless mode. We#re leaving the roleless mode when we require the -mon-id param.

I'd say either we have to use the non-roleless mode or we have to fix the tests.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok this change doesn't really make sense. can you revert this mon-ip thingy, please?

@stale
Copy link

stale bot commented Sep 12, 2020

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

@stale stale bot added the stale label Sep 12, 2020
@sebastian-philipp
Copy link
Contributor

unstale.

@stale stale bot removed the stale label Sep 12, 2020
@stale
Copy link

stale bot commented Nov 15, 2020

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

@stale stale bot added the stale label Nov 15, 2020
@yuriw yuriw added the DNM label Dec 14, 2020
@stale stale bot removed the stale label Dec 14, 2020
@yuriw yuriw closed this Dec 14, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants