[Proxied artifact operations] Implement REST API endpoints and artifact repository by harupy · Pull Request #4946 · mlflow/mlflow

harupy · 2021-10-26T07:34:54Z

Signed-off-by: harupy 17039389+harupy@users.noreply.github.com

What changes are proposed in this pull request?

Implement REST API endpoints + artifact repository.

How is this patch tested?

Unit tests

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

harupy · 2021-10-28T01:35:05Z

+        return f.read()
+
+
+def test_log_artifact(tmpdir):


This is a minimal integration test to make sure an aritafct can be logged using the mlflow artifacts service.

dbczumar

@harupy This looks like an excellent start! Awesome stuff!

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: harupy <hkawamura0130@gmail.com>

dbczumar · 2021-11-04T21:21:02Z

+This example requires a wheel for `mlflow` to be stored in `dist`.
+
+```sh
+# Clean up existing wheels
+rm dist/*
+
+# Build a wheel for the development version of mlflow
+pip wheel --no-deps --wheel-dir dist ../..
+
+# Build a wheel for the latest version of mlflow on PyPI
+pip wheel --no-deps --wheel-dir dist mlflow
+```


Why are these steps required? If there's a way to avoid them, that would be nice in order to reduce complexity for users.

This is mainly for development purposes. We can simplify the workflow by preparing two Docker files (Dockerfile and Dockerfile.dev) and switching them with variable substitution.

docker-compose.yml would look like:

# docker-compose.yml tracking-server: build: context: . # Defaults to `Dockerfile` if `DOCKERFILE` environment variable doesn't exist. dockerfile: "${DOCKERFILE:-Dockerfile}" depends_on: - postgres - artifacts-server

README.md would look like:

This directory contains a set of files for demonstrating the MLflow Artifacts service. ## Run the example ```sh # Build services docker-compose build # Launch tracking and artifacts servers docker-compose up -d # Run `run.py` that uploads, downloads, and list artifacts docker-compose run -v ${PWD}/run.py:/app/run.py client python run.py ``` ## Explore the logging results: ```sh # Make sure both tracking and artifacts servers are running docker-compose ps ``` - MLflow UI is available at http://localhost:5000 to explore tracking results. - MinIO Console is available at http://localhost:9001 to explore logged artifacts. The login username and password are `user` and `password`. ## Reset tracking and artifacts servers ``` docker-compose down --volumes --remove-orphans ``` 👇👇👇 ## Development ``` pip wheel --no-deps --wheel-dir dist ../.. DOCKERFILE=Dockerfile.dev docker-compose build docker-compose run -v ${PWD}/run.py:/app/run.py client python run.py ```

What do you think about this approach?

Dockerfile and Dockerfile.dev would look like this.

Dockerfile:

FROM python:3.6 WORKDIR /app RUN pip install psycopg2 boto3 RUN pip install mlflow

Dockerfile.dev:

FROM python:3.6 WORKDIR /app RUN pip install psycopg2 boto3 RUN pip install mlflow COPY dist ./dist RUN pip install dist/mlflow-*.whl

dbczumar · 2021-11-04T21:23:17Z

+    metavar="URI",
+    default="./mlartifacts",
+    help=(
+        "The base artifact location from which to resolve artifact upload/download/list requests "


Love it! Can we also clarify that this only applies when the tracking server is configured to stream artifacts and the experiment artifact root is http or mlflow-artifacts?

In general, we should figure out how to tell users about the http and mlflow-artifacts URIs for --default-artifact-root as well.

Love it! Can we also clarify that this only applies when the tracking server is configured to stream artifacts and the experiment artifact root is http or mlflow-artifacts?

Sure!

In general, we should figure out how to tell users about the http and mlflow-artifacts URIs for --default-artifact-root as well.

Can we new sections for HTTP and mlflow-artifacts schemes in https://www.mlflow.org/docs/latest/tracking.html#artifact-stores?

In general, we should figure out how to tell users about the http and mlflow-artifacts URIs for --default-artifact-root as well.

That's a good idea. We should also make it easy for users to learn about these through the mlflow server CLI in a follow-up PR.

dbczumar · 2021-11-04T21:29:09Z

+        with open(tmp_path, "wb") as f:
+            chunk_size = 1024 * 1024  # 1 MB
+            while True:
+                chunk = request.stream.read(chunk_size)


Is there any timeout on the stream read operation? Else, we may get stuck here forever.

request.stream is an instance of gunicorn.http.body.Body.

gunicorn.http.body.Body.read doesn't seem to have an option related to timeout:

Source: https://github.com/benoitc/gunicorn/blob/ff58e0c6da83d5520916bc4cc109a529258d76e1/gunicorn/http/body.py#L202

def read(self, size=None): size = self.getsize(size) ...

Got it. This probably isn't too big a deal because we set gunicorn worker timeouts. Worst case, we may hang here until the worker times out, which is fine :).

dbczumar · 2021-11-04T21:53:50Z

+
+    def _download_file(self, remote_file_path, local_path):
+        url = posixpath.join(self.artifact_uri, remote_file_path)
+        with self._session.get(url, stream=True) as resp:


Is there a timeout we should specify here? Else, we may get stuck here

~~Yes, what about 10 minutes?~~. I'm checking the behavior of timeout when stream=True.

Found a related discussion:

psf/requests#1803 (comment)

Can we set timeout=10 (seconds)?

dbczumar · 2021-11-04T21:55:10Z

@@ -0,0 +1,80 @@
+syntax = "proto2";


Can we add a docstring to this proto file explaining which use cases it serves?

dbczumar

@harupy Looks great! Happy to approve once remaining comments are addressed and tests pass.

dbczumar · 2021-11-04T22:03:57Z

@@ -0,0 +1,45 @@
+This directory contains a set of files for demonstrating the MLflow Artifacts service.


Can we explain what the artifacts service does and provide a small example command that users could run manually to serve artifacts via mlflow server in the README?

Updated the README!

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

Signed-off-by: harupy <hkawamura0130@gmail.com>

dbczumar

LGTM again, thanks @harupy ! Really awesome!

dbczumar · 2021-11-08T21:00:34Z

+        with open(tmp_path, "wb") as f:
+            chunk_size = 1024 * 1024  # 1 MB
+            while True:
+                chunk = request.stream.read(chunk_size)


Got it. This probably isn't too big a deal because we set gunicorn worker timeouts. Worst case, we may hang here until the worker times out, which is fine :).

harupy · 2021-11-08T23:29:57Z

@dbczumar Thanks for reviewing this PR!

harupy changed the title ~~[Proxied artifact operations] Implement REST API endpoints + artifact repository~~ [WIP][Proxied artifact operations] Implement REST API endpoints + artifact repository Oct 26, 2021

github-actions Bot added the rn/feature Mention under Features in Changelogs. label Oct 26, 2021

harupy changed the title ~~[WIP][Proxied artifact operations] Implement REST API endpoints + artifact repository~~ [WIP][Proxied artifact operations] Implement REST API endpoints and artifact repository Oct 26, 2021

harupy commented Oct 28, 2021

View reviewed changes

Comment thread mlflow/protos/mlflow_artifacts.proto

harupy commented Oct 28, 2021

View reviewed changes

Comment thread mlflow/protos/mlflow_artifacts.proto

harupy commented Oct 28, 2021

View reviewed changes

Comment thread mlflow/server/handlers.py

dbczumar reviewed Oct 28, 2021

View reviewed changes

Comment thread mlflow/protos/mlflow_artifacts.proto Outdated

dbczumar reviewed Oct 28, 2021

View reviewed changes

harupy force-pushed the proxied-artifact-opertions-endpoints branch from d138f38 to d32c6a4 Compare October 29, 2021 15:48

harupy changed the title ~~[WIP][Proxied artifact operations] Implement REST API endpoints and artifact repository~~ [Proxied artifact operations] Implement REST API endpoints and artifact repository Oct 29, 2021

harupy requested a review from dbczumar November 1, 2021 00:03

harupy commented Nov 1, 2021

View reviewed changes

Comment thread examples/mlflow_artifacts/README.md Outdated

harupy and others added 17 commits November 1, 2021 22:50

initial commit

be7de3e

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

singular

5475bc0

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

fix response

c1762f7

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

add converter

3624797

Signed-off-by: harupy <hkawamura0130@gmail.com>

wip

265b9b6

Signed-off-by: harupy <hkawamura0130@gmail.com>

remove pagination token

5f79e76

Signed-off-by: harupy <hkawamura0130@gmail.com>

add test_http_artifact_repo.py

fcffde5

Signed-off-by: harupy <hkawamura0130@gmail.com>

wip

6034b94

Signed-off-by: harupy <hkawamura0130@gmail.com>

enhance tests

86843f8

Signed-off-by: harupy <hkawamura0130@gmail.com>

lint

440ba51

Signed-off-by: harupy <hkawamura0130@gmail.com>

proto

782f91f

Signed-off-by: harupy <hkawamura0130@gmail.com>

clean up

ab3117a

Signed-off-by: harupy <hkawamura0130@gmail.com>

Add end2end example

4c4d804

Signed-off-by: harupy <hkawamura0130@gmail.com>

rename import

ae9c5cb

Signed-off-by: harupy <hkawamura0130@gmail.com>

run mlflow artifacts example

ea8dfaa

Signed-off-by: harupy <hkawamura0130@gmail.com>

fix example

f7952e2

Signed-off-by: harupy <hkawamura0130@gmail.com>

run step

fe371a2

Signed-off-by: harupy <hkawamura0130@gmail.com>

dbczumar reviewed Nov 4, 2021

View reviewed changes

workaround for windows

23c6d11

Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

harupy commented Nov 5, 2021

View reviewed changes

Comment thread mlflow/store/artifact/http_artifact_repo.py Outdated

harupy added 8 commits November 5, 2021 14:15

set timeout

418778f

Signed-off-by: harupy <hkawamura0130@gmail.com>

try sh

192a1a5

Signed-off-by: harupy <hkawamura0130@gmail.com>

fix assert_called_with

0917687

Signed-off-by: harupy <hkawamura0130@gmail.com>

skip test_mlflow_artifacts_example on windows

15c201f

Signed-off-by: harupy <hkawamura0130@gmail.com>

add comment on proto file

6216b36

Signed-off-by: harupy <hkawamura0130@gmail.com>

remove datetime

68431a5

Signed-off-by: harupy <hkawamura0130@gmail.com>

update example

7ae7156

Signed-off-by: harupy <hkawamura0130@gmail.com>

add a few more comments

626ac67

Signed-off-by: harupy <hkawamura0130@gmail.com>

harupy commented Nov 5, 2021

View reviewed changes

Comment thread examples/mlflow_artifacts/README.md

harupy added 5 commits November 5, 2021 23:32

update example

ff67b4f

Signed-off-by: harupy <hkawamura0130@gmail.com>

improve artifacts-destination description

a234ec2

Signed-off-by: harupy <hkawamura0130@gmail.com>

fix

3328f1d

Signed-off-by: harupy <hkawamura0130@gmail.com>

add link

970687a

Signed-off-by: harupy <hkawamura0130@gmail.com>

Fix wording

17f9f9e

Signed-off-by: harupy <hkawamura0130@gmail.com>

harupy requested a review from dbczumar November 6, 2021 03:11

dbczumar mentioned this pull request Nov 8, 2021

[DOC-FIX] Doc missing for REST API POST 2.0/mlflow/artifacts/credentials-for-read #4839

Closed

3 tasks

dbczumar approved these changes Nov 8, 2021

View reviewed changes

harupy merged commit a05360c into mlflow:master Nov 8, 2021

harupy deleted the proxied-artifact-opertions-endpoints branch November 8, 2021 23:51

BenWilson2 mentioned this pull request Nov 10, 2021

Add server option for serving only artifacts and proxied serving mode #5045

Merged

29 tasks

		@@ -0,0 +1,45 @@
		This directory contains a set of files for demonstrating the MLflow Artifacts service.

Conversation

harupy commented Oct 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dbczumar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dbczumar Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy Nov 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy Nov 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy Nov 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy Nov 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy Nov 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dbczumar left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dbczumar left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harupy commented Nov 8, 2021

harupy commented Oct 26, 2021 •

edited

Loading

dbczumar Nov 4, 2021 •

edited

Loading

harupy Nov 5, 2021 •

edited

Loading

harupy Nov 5, 2021 •

edited

Loading

harupy Nov 5, 2021 •

edited

Loading

harupy Nov 5, 2021 •

edited

Loading

harupy Nov 5, 2021 •

edited

Loading

dbczumar left a comment •

edited

Loading