python-common: fix ServiceSpec validation by mgfritch · Pull Request #35493 · ceph/ceph

mgfritch · 2020-06-09T00:18:36Z

Fixes various issues around validation of a ServiceSpec during orch apply.

Signed-off-by: Michael Fritch mfritch@suse.com

Checklist

References tracker ticket
Updates documentation if necessary
Includes tests for new functionality or reproducer for bug

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard backend
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox

the ServiceSpec needs to be validated during `orch apply`, but not during `orch daemon add` Signed-off-by: Michael Fritch <mfritch@suse.com>

Signed-off-by: Michael Fritch <mfritch@suse.com>

the service_id needs to be validated during `orch apply`, but not during `orch daemon add` Signed-off-by: Michael Fritch <mfritch@suse.com>

sebastian-philipp · 2020-06-09T08:12:56Z

src/python-common/ceph/deployment/service_spec.py

+        if self.service_type in ['mds', 'rgw', 'nfs', 'iscsi'] and not self.service_id:
+            raise ServiceSpecValidationError('Cannot add Service: id required')
+


Is there a chance that there are existing specs in the store that violate this validation?

nice, this is what I meant to do as well.

Is there a chance that there are existing specs in the store that violate this validation?

They probably is. Is the validate method being run on already existing specs though? If not, this would only raise for new or exported(and re-imported) specs.

nice, this is what I meant to do as well.

Is there a chance that there are existing specs in the store that violate this validation?

They probably is. Is the validate method being run on already existing specs though?

right: __init__ doesn't call validate():

ceph/src/python-common/ceph/deployment/service_spec.py

Lines 399 to 411 in 3a757a4

def __init__(self,

service_type: str,

service_id: Optional[str] = None,

placement: Optional[PlacementSpec] = None,

count: Optional[int] = None,

unmanaged: bool = False,

):

self.placement = PlacementSpec() if placement is None else placement # type: PlacementSpec

assert service_type in ServiceSpec.KNOWN_SERVICE_TYPES, service_type

self.service_type = service_type

self.service_id = service_id

self.unmanaged = unmanaged

and https://github.com/ceph/ceph/blob/master/src/pybind/mgr/cephadm/inventory.py doesn't call .validate() either.

should we add it? If yes, how do we handle specs that fail the validation?

If not, this would only raise for new or exported(and re-imported) specs.

should we add it? If yes, how do we handle specs that fail the validation?

we probably only should do that if we have a clear path for communicating these kind of issues. some way of indicating that an already applied spec is not valid anymore..

otoh, we probably shouldn't invalidate existing specs to begin with.. maybe adding an internal tracking id for already existing specs isn't the worst move.

it actually is called on existing specs here:

https://github.com/ceph/ceph/blob/master/src/pybind/mgr/cephadm/schedule.py#L67

which is not as bad as in load() . I think #35456 plus some HEALTH_WARN integration would be enough here.

Is there a chance that there are existing specs in the store that violate this validation?

If so, they would have likely broke the orch ls etc commands...

which is not as bad as in load() . I think #35456 plus some HEALTH_WARN integration would be enough here.

This actually makes a lot of sense in the general case where the cluster state could change and we might want to (re)validate before another placement...

so I think we're good, as long as we provide a way for users to remove that service after they did an upgrade.

src/python-common/ceph/deployment/service_spec.py

sebastian-philipp · 2020-06-10T14:17:35Z

QA run failed due to #35474 (comment) . not 100% sure if this PR is not related.

sebastian-philipp · 2020-06-10T14:18:30Z

http://pulpito.ceph.com/swagner-2020-06-10_12:32:13-rados:cephadm-wip-swagner-testing-2020-06-10-1105-distro-basic-smithi/

sebastian-philipp · 2020-06-12T12:21:56Z

green: http://pulpito.ceph.com/swagner-2020-06-11_14:15:42-rados:cephadm-wip-swagner-testing-2020-06-11-1428-distro-basic-smithi/

sebastian-philipp · 2020-06-12T21:14:41Z

@callithea is this ceph dashboard backend API tests failue related to this PR?

sebastian-philipp · 2020-06-16T14:47:12Z

jenkins test dashboard backend

mgfritch added 3 commits June 8, 2020 18:09

python-common: Add NFSServiceSpec validate

45391a1

the ServiceSpec needs to be validated during `orch apply`, but not during `orch daemon add` Signed-off-by: Michael Fritch <mfritch@suse.com>

python-common: Add missing IscsiServiceSpec test

dcf7fb5

Signed-off-by: Michael Fritch <mfritch@suse.com>

python-common: validate the service_id during apply

c831445

the service_id needs to be validated during `orch apply`, but not during `orch daemon add` Signed-off-by: Michael Fritch <mfritch@suse.com>

mgfritch added orchestrator cephadm labels Jun 9, 2020

mgfritch requested a review from a team as a code owner June 9, 2020 00:18

sebastian-philipp reviewed Jun 9, 2020

View reviewed changes

jschmid1 reviewed Jun 9, 2020

View reviewed changes

src/python-common/ceph/deployment/service_spec.py Show resolved Hide resolved

sebastian-philipp added needs-qa wip-swagner-testing My Teuthology tests labels Jun 10, 2020

sebastian-philipp added wip-swagner-testing My Teuthology tests and removed wip-swagner-testing My Teuthology tests labels Jun 10, 2020

sebastian-philipp approved these changes Jun 12, 2020

View reviewed changes

sebastian-philipp requested a review from callithea June 12, 2020 21:14

sebastian-philipp removed needs-qa wip-swagner-testing My Teuthology tests labels Jun 12, 2020

sebastian-philipp merged commit 57734cf into ceph:master Jun 17, 2020

mgfritch deleted the orch-service-spec-validate branch June 18, 2020 16:15

sebastian-philipp mentioned this pull request Jul 2, 2020

octopus: cephadm batch backport July (1) #35898

Merged

		if self.service_type in ['mds', 'rgw', 'nfs', 'iscsi'] and not self.service_id:
		raise ServiceSpecValidationError('Cannot add Service: id required')

	def __init__(self,
	service_type: str,
	service_id: Optional[str] = None,
	placement: Optional[PlacementSpec] = None,
	count: Optional[int] = None,
	unmanaged: bool = False,
	):
	self.placement = PlacementSpec() if placement is None else placement # type: PlacementSpec

	assert service_type in ServiceSpec.KNOWN_SERVICE_TYPES, service_type
	self.service_type = service_type
	self.service_id = service_id
	self.unmanaged = unmanaged

Conversation

mgfritch commented Jun 9, 2020

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sebastian-philipp commented Jun 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sebastian-philipp commented Jun 10, 2020

Uh oh!

sebastian-philipp commented Jun 12, 2020

Uh oh!

sebastian-philipp commented Jun 12, 2020

Uh oh!

sebastian-philipp commented Jun 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sebastian-philipp commented Jun 10, 2020 •

edited

Loading