Only generate model uuid when logging model by WeichenXu123 · Pull Request #5167 · mlflow/mlflow

WeichenXu123 · 2021-12-15T06:11:37Z

Signed-off-by: Weichen Xu weichen.xu@databricks.com

What changes are proposed in this pull request?

Only generate model uuid when logging model.

Before:
When loading model from a old version mlflow model, a new model uuid will be generated. Loading multiple times will generate multiple different model uuid.

After:
When loading model from a old version mlflow model, the model uuid attribute will be None.

How is this patch tested?

Unit tests.

Does this PR change the documentation?

No. You can skip the rest of this section.
Yes. Make sure the changed pages / sections render correctly by following the steps below.

Check the status of the ci/circleci: build_doc check. If it's successful, proceed to the
next step, otherwise fix it.
Click Details on the right to open the job page of CircleCI.
Click the Artifacts tab.
Click docs/build/html/index.html.
Find the changed pages / sections and make sure they render correctly.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

harupy · 2021-12-15T06:22:57Z


    reloaded_model_config = Model.load(os.path.join(model_path, "MLmodel"))
    assert model_config.__dict__ == reloaded_model_config.__dict__
+    assert model_config.model_uuid is not None and _is_valid_uuid(model_config)


Suggested change

assert model_config.model_uuid is not None and _is_valid_uuid(model_config)

assert model_config.model_uuid is not None

assert _is_valid_uuid(model_config.model_uuid)

harupy · 2021-12-15T06:24:38Z

Can you add a test that model_uuid is None when we load Model from MLmodel file without model_uuid?

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

WeichenXu123 · 2021-12-15T07:51:43Z

+            model_uuid = uuid.uuid4().hex
+            mlflow_model = cls(artifact_path=artifact_path, run_id=run_id, model_uuid=model_uuid)


We should not set uuid here. Otherwise there're many other place we need also set uuid.
e.g.

mlflow/mlflow/pyfunc/__init__.py

Line 1072 in f9b5745

mlflow_model = Model()

mlflow/mlflow/catboost.py

Line 122 in f9b5745

mlflow_model = Model()

etc.

I move the set uuid code into Model constructor again.

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

dbczumar · 2021-12-15T08:34:22Z

        flavors=None,
        signature=None,  # ModelSignature
        saved_input_example_info: Dict[str, Any] = None,
-        model_uuid=None,


Instead of removing model_uuid from the constructor and modifying the attribute during load(), can we define a third type of value for model_uuid?

By default, model_uuid should be a function that generates a UUID, e.g. lambda: uuid.uuid4().hex

Model uuid could be None, indicating that the model has no ID

Model uuid could be a string, indicating that the model already has a UUID

In the constructor, we can check if the input is a function and, if it is, call it to generate an ID. Otherwise, set self.model_uuid = model_uuid.

sounds good

harupy · 2021-12-15T08:35:40Z

        self.signature = signature
        self.saved_input_example_info = saved_input_example_info
-        self.model_uuid = uuid.uuid4().hex if model_uuid is None else model_uuid
+        self.model_uuid = uuid.uuid4().hex


This basically means every Model instance has a different model_uuid. Is this really the desired behavior?

Every new constructed model has a different ID, this is desired.
We only need keep the rule that the model load back from saved model should load old ID back or set it None if saved model don't have ID

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

harupy · 2021-12-15T10:21:04Z

+        if callable(model_uuid):
+            self.model_uuid = model_uuid()
+        else:
+            self.model_uuid = model_uuid


Suggested change

if callable(model_uuid):

self.model_uuid = model_uuid()

else:

self.model_uuid = model_uuid

self.model_uuid = model_uuid() if callable(model_uuid) else model_uuid

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

dbczumar

LGTM! Thanks @WeichenXu123 !

init

e2dc085

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

WeichenXu123 mentioned this pull request Dec 15, 2021

Add model_uuid to Java Model #5165

Merged

29 tasks

github-actions Bot added rn/bug-fix Mention under Bug Fixes in Changelogs. area/artifacts Artifact stores and artifact logging labels Dec 15, 2021

add test

00db8ac

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

harupy reviewed Dec 15, 2021

View reviewed changes

update

1d279b6

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

WeichenXu123 commented Dec 15, 2021

View reviewed changes

update

c45d16d

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

dbczumar reviewed Dec 15, 2021

View reviewed changes

harupy reviewed Dec 15, 2021

View reviewed changes

update

fcba89d

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

harupy reviewed Dec 15, 2021

View reviewed changes

updates

2d4a3cb

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>

dbczumar approved these changes Dec 16, 2021

View reviewed changes

WeichenXu123 merged commit f9046f9 into mlflow:master Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only generate model uuid when logging model#5167

Only generate model uuid when logging model#5167
WeichenXu123 merged 6 commits into
mlflow:masterfrom
WeichenXu123:fix-model-id

WeichenXu123 commented Dec 15, 2021 •

edited

Loading

Uh oh!

harupy Dec 15, 2021

Uh oh!

harupy commented Dec 15, 2021

Uh oh!

WeichenXu123 Dec 15, 2021

Uh oh!

dbczumar Dec 15, 2021

Uh oh!

WeichenXu123 Dec 15, 2021

Uh oh!

harupy Dec 15, 2021 •

edited

Loading

Uh oh!

WeichenXu123 Dec 15, 2021

Uh oh!

harupy Dec 15, 2021

Uh oh!

dbczumar left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	assert model_config.model_uuid is not None and _is_valid_uuid(model_config)
	assert model_config.model_uuid is not None
	assert _is_valid_uuid(model_config.model_uuid)

		model_uuid = uuid.uuid4().hex
		mlflow_model = cls(artifact_path=artifact_path, run_id=run_id, model_uuid=model_uuid)

Conversation

WeichenXu123 commented Dec 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How is this patch tested?

Does this PR change the documentation?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Uh oh!

harupy Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

harupy commented Dec 15, 2021

Uh oh!

WeichenXu123 Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

dbczumar Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

WeichenXu123 Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

harupy Dec 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

WeichenXu123 Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

harupy Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

dbczumar left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WeichenXu123 commented Dec 15, 2021 •

edited

Loading

harupy Dec 15, 2021 •

edited

Loading