[Endpoints] [3/x] Entities base definitions by BenWilson2 · Pull Request #19004 · mlflow/mlflow

BenWilson2 · 2025-11-24T17:57:49Z

🥞 Stacked PR

Use this link to review incremental changes.

stack/endpoints/crypto [Files changed]
- stack/endpoints/entities [Files changed]
  - stack/endpoints/abstract [Files changed]
    - stack/endpoints/sql-store [Files changed]
      - stack/endpoints/rest [Files changed]
        
        stack/endpoints/rest-2 [Files changed]
        
        stack/endpoints/cache [Files changed]
        
        stack/endpoints/litellm [Files changed]
        
        stack/endpoints/ui-apis [Files changed]
        
        stack/endpoints/ui-create-endpoint [Files changed]
        stack/endpoints/ui-tabs [Files changed]
        stack/endpoints/key-management [Files changed]
        stack/endpoints/passphrase [Files changed]

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

Adds the base entities definitions for secrets and endpoints definitions. Note that proto interfaces are handled in a later PR in this stack.

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes should be selected for bug fixes, documentation updates, and other small changes. No should be selected for new features and larger changes. If you're unsure about the release classification of this PR, leave this unchecked to let the maintainers decide.

What is a minor/patch release?

Minor release: a release that increments the second part of the version number (e.g., 1.2.0 -> 1.3.0).
Bug fixes, doc updates and new features usually go into minor releases.
Patch release: a release that increments the third part of the version number (e.g., 1.2.0 -> 1.2.1).
Bug fixes and doc updates usually go into patch releases.

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

github-actions · 2025-11-26T03:49:53Z

Documentation preview for c017f38 is available at:

https://pr-19004--mlflow-docs-preview.netlify.app/docs/latest/

More info

Ignore this comment if this PR does not change the documentation.
The preview is updated when a new commit is pushed to this PR.
This comment was created by this workflow run.
The documentation was built by this workflow run.

B-Step62 · 2025-12-03T08:07:46Z

mlflow/entities/endpoint.py

+    last_updated_at: int
+    created_by: str | None = None
+    last_updated_by: str | None = None
+    endpoint_count: int = 0


Can we make it lazy load property? IIUC this requires querying the mapping table, but don't use referenced endpoint count for every use case.

The endpoint count can be removed - we can infer this with the length of the mappings present in the model binding table. This is safe to delete.

B-Step62 · 2025-12-03T08:09:04Z

mlflow/entities/endpoint.py

+    mapping_id: str
+    endpoint_id: str
+    model_definition_id: str
+    model_definition: ModelDefinition | None


Why do we need to store this within the mapping entity?

The goal here is to minimize the query complexity on the front-end. Without having this embedded within this entity (we use it directly in the UI when populating the model information card for an endpoint), we would have N queries for each model that attached to an endpoint and then perform client-side joins to populate the page.
While this wouldn't be my choice in design for SDK / cli APIs, I intentionally went with this approach here for the UI in order to minimize the front end complexity and to reduce page render time.

example: EndpointDetailsPage.tsx directly accesses the result of the return of this entity.

B-Step62 · 2025-12-03T08:09:52Z

mlflow/entities/endpoint.py

+    name: str
+    created_at: int
+    last_updated_at: int
+    model_mappings: list[EndpointModelMapping] = field(default_factory=list)


I think this should also be queried lazily

The primary use of this in the UI is to display critical information in order to populate the reference fields that are directly displayed. We don't really have a use case in the UI display for non-eager return of this in the main endpoint listing page.
Where we directly need this:

Provider filtering (providers are mapped to models, not to endpoints as an endpoint can fallback to using a model from a different provider)

Displayed model contents for each endpoint for link references to the models details page entries

Rendering of named models and underlying provider model names

If scale is a concern, I think we can introduce pagination in the future (i.e. users create thousands of endpoints), but the alternative (lazy loading) for the UI would be many queries being issued in order to populate the required aspects of the list page, dramatically increasing the front end complexity and forcing spin-loading elements within the page.

Thanks for the clarification! It makes sense to include model mapping in the entity to show the model names on UI and allow filter based on it.

B-Step62 · 2025-12-03T08:10:53Z

mlflow/entities/endpoint.py

+        last_updated_at: Timestamp (milliseconds) when the binding was last updated.
+        created_by: User ID who created the binding.
+        last_updated_by: User ID who last updated the binding.
+        endpoint_name: Optional endpoint name, populated via JOIN for display.


Suggested change

endpoint_name: Optional endpoint name, populated via JOIN for display.

endpoint_name: Optional endpoint name, populated via JOIN for display.

Where do we plan to display this?

The endpoint name is displayed on the main list page, in the model mappings detail pages, and in the key mapping details pages.

B-Step62 · 2025-12-03T08:11:36Z

mlflow/entities/endpoint.py

+    created_by: str | None = None
+    last_updated_by: str | None = None
+    endpoint_name: str | None = None
+    model_mappings: list[EndpointModelMapping] = field(default_factory=list)


Why is this a part of a binding entity?

This was originally intended to fulfill the 'filter the models for a single-model use resource_id' but it's overly complicated and doesn't need to be overloaded in this. We don't use this field in the UI and splitting out the resource-based binding information into a separate API is far simpler and more tailored to what is needed. I'm going to simplify this and then add the dedicated API for resources to fetch their binding information.

B-Step62 · 2025-12-03T08:14:33Z

mlflow/entities/gateway_endpoint.py

Can we name this gateway_endpoint.py or add a docstring at the top explain these models are used within Gateway? The term "endpoint" "model" "resource" is pretty generic without that context.

yeah I'll preface these with gateway so that it's easier to see their relationships ontologically

B-Step62 · 2025-12-03T08:20:46Z

mlflow/entities/endpoint.py

+
+
+@dataclass
+class ModelConfig(_MlflowObject):


What is the purpose of these entities? I casually read through the remaining stack but didn't find a usage. If we need these, can we define this somewhere in more limited scope than general entities to avoid confusion with ModelDefinition and Endpoint entities?

These are the interfaces for the downstream access of decrypted secrets server-side. They won't be exposed in the rest interface (non-public). The sql store will have the implementation that returns these with an internal RPC.

Can we move these entities into different module? All classes under mlflow/entities are for public interface. Also it doesn't need to be _MlflowObject subclass then cuz we probably want to be stricter about the print and serde method, rather than using its base methods.

yep! Since these are entirely UI-only I'll migrate them to make it clear that they're isolated. Good call.

B-Step62 · 2025-12-03T08:22:11Z

tests/entities/test_gateway_endpoint.py

I'm not sure these tests (test_endpoint.py and test_secrets.py) are meaningful.... they are basically only testing default dataclass constructor right...?

Yeah in this PR they're nonsense stubs. The full test suite is added with the addition of the from_proto() / to_proto() methods in the rest branch.

Can we remove these tests? I saw rest branch adds more tests, but my point was we don't want to have non-useful tests, because that makes the code lengthy and can hide important details.

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

Copilot

Pull request overview

This PR introduces base entity definitions for gateway endpoints and secrets management, along with comprehensive cryptographic utilities for secure secret storage using envelope encryption. The implementation adds support for storing encrypted API keys and credentials with KEK/DEK encryption schemes.

Key Changes:

Adds cryptographic utilities for AES-256-GCM encryption with KEK/DEK envelope encryption
Introduces entity definitions for gateway endpoints, model definitions, and secrets
Implements CLI commands for KEK rotation operations
Provides comprehensive test coverage for cryptography and entity classes

Reviewed changes

Copilot reviewed 12 out of 12 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`mlflow/utils/cryptography.py`	Core cryptographic utilities implementing AES-256-GCM encryption, PBKDF2 key derivation, and envelope encryption with KEK/DEK pattern
`mlflow/entities/gateway_secrets.py`	Entity definition for encrypted secrets with LLM provider credentials
`mlflow/entities/secrets.py`	Duplicate/unused entity file with minor docstring differences
`mlflow/entities/gateway_endpoint.py`	Entity definitions for endpoints, model definitions, mappings, and configurations
`mlflow/entities/__init__.py`	Updated exports to include new gateway and secret entities
`mlflow/cli/cryptography.py`	CLI commands for KEK rotation operations with comprehensive user guidance
`mlflow/cli/__init__.py`	Integration of cryptography CLI commands
`tests/utils/test_cryptography.py`	Comprehensive test coverage for cryptographic functions
`tests/entities/test_gateway_secrets.py`	Test coverage for secret entity creation and attributes
`tests/entities/test_gateway_endpoint.py`	Test coverage for endpoint and model definition entities
`tests/cli/test_cryptography.py`	Test coverage for CLI commands with mocked database operations
`docs/api_reference/api_inventory.txt`	API documentation inventory updates

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-04T01:59:47Z

mlflow/entities/secrets.py

+from dataclasses import dataclass
+
+from mlflow.entities._mlflow_object import _MlflowObject
+
+
+@dataclass
+class GatewaySecret(_MlflowObject):
+    """
+    Represents an encrypted secret for authenticating with LLM providers.
+
+    Store encrypted API keys and authentication credentials using envelope encryption.
+    The actual secret value is encrypted with a DEK (Data Encryption Key), which is itself
+    encrypted by a KEK (Key Encryption Key) for secure storage.
+
+    Args:
+        secret_id: Unique identifier for this secret.
+        secret_name: User-friendly name for the secret (must be unique).
+        masked_value: Masked version of the secret for display (e.g., "sk-...xyz123").
+        provider: LLM provider this secret is for (e.g., "openai", "anthropic").
+        created_at: Timestamp (milliseconds) when the secret was created.
+        last_updated_at: Timestamp (milliseconds) when the secret was last updated.
+        created_by: User ID who created the secret.
+        last_updated_by: User ID who last updated the secret.
+    """
+
+    secret_id: str
+    secret_name: str
+    masked_value: str
+    created_at: int
+    last_updated_at: int
+    provider: str | None = None
+    created_by: str | None = None
+    last_updated_by: str | None = None


Minor documentation inconsistency: Line 11 says "Store encrypted API keys..." while the identical class in gateway_secrets.py says "Secrets store encrypted API keys...". While mlflow/entities/secrets.py exists, it's not imported in __init__.py - only gateway_secrets.py is imported. This file appears to be unused and should likely be removed to avoid confusion, or the import should be updated to use this file instead.

Suggested change

from dataclasses import dataclass

from mlflow.entities._mlflow_object import _MlflowObject

@dataclass

class GatewaySecret(_MlflowObject):

"""

Represents an encrypted secret for authenticating with LLM providers.

Store encrypted API keys and authentication credentials using envelope encryption.

The actual secret value is encrypted with a DEK (Data Encryption Key), which is itself

encrypted by a KEK (Key Encryption Key) for secure storage.

Args:

secret_id: Unique identifier for this secret.

secret_name: User-friendly name for the secret (must be unique).

masked_value: Masked version of the secret for display (e.g., "sk-...xyz123").

provider: LLM provider this secret is for (e.g., "openai", "anthropic").

created_at: Timestamp (milliseconds) when the secret was created.

last_updated_at: Timestamp (milliseconds) when the secret was last updated.

created_by: User ID who created the secret.

last_updated_by: User ID who last updated the secret.

"""

secret_id: str

secret_name: str

masked_value: str

created_at: int

last_updated_at: int

provider: str | None = None

created_by: str | None = None

last_updated_by: str | None = None

# FILE REMOVED: This file was unused and duplicated functionality from gateway_secrets.py.

Let's remove this file

Copilot · 2025-12-04T01:59:47Z

mlflow/cli/cryptography.py

+@click.group("crypto", help="Commands for managing MLflow's cryptographic passphrase.")
+def commands():
+    """
+    MLflow cryptopgraphic management CLI. Allows for the management of the envelope


There's a typo in the docstring: "cryptopgraphic" should be "cryptographic".

Suggested change

MLflow cryptopgraphic management CLI. Allows for the management of the envelope

MLflow cryptographic management CLI. Allows for the management of the envelope

B-Step62

Looks good once #19004 (comment) and #19004 (comment) is addressed!

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

TomeHirata · 2025-12-05T02:29:24Z

mlflow/entities/gateway_endpoint.py

+        endpoint_id: ID of the endpoint.
+        model_definition_id: ID of the model definition.
+        model_definition: The full model definition (populated via JOIN).
+        weight: Routing weight for traffic distribution (default 1).


@BenWilson2 What does the unit of weight here? Is it percentage?

BenWilson2 changed the title ~~WIP~~ [Endpoints] [3/x] Entities base definitions Nov 24, 2025

BenWilson2 marked this pull request as ready for review November 24, 2025 18:22

BenWilson2 mentioned this pull request Nov 24, 2025

[Endpoints] [8/x] Add credential cache #19014

Merged

29 tasks

github-actions bot added area/tracking Tracking service, tracking client APIs, autologging rn/feature Mention under Features in Changelogs. labels Nov 24, 2025

BenWilson2 force-pushed the stack/endpoints/entities branch 3 times, most recently from f06dead to cba8c42 Compare November 26, 2025 03:39

BenWilson2 mentioned this pull request Nov 26, 2025

[Endpoints] [11/x] Main Gateway page and create endpoint #19042

Closed

29 tasks

This was referenced Nov 27, 2025

[Endpoints] [12/x] Add Left hand nav and management stub #19070

Closed

[Endpoints] [13/x] Add the API Key management pages #19071

Closed

BenWilson2 force-pushed the stack/endpoints/entities branch from cba8c42 to 58189cc Compare November 27, 2025 06:31

BenWilson2 mentioned this pull request Nov 27, 2025

[Endpoints] [14/x] Models management UI #19074

Closed

29 tasks

BenWilson2 force-pushed the stack/endpoints/entities branch 2 times, most recently from e821af0 to 7c62891 Compare December 2, 2025 02:14

BenWilson2 added the team-review Trigger a team review request label Dec 2, 2025

github-actions bot requested review from B-Step62, daniellok-db, harupy, kevin-lyn and serena-ruan December 2, 2025 16:03

github-actions bot requested review from TomeHirata, WeichenXu123 and xq-yin December 2, 2025 16:03

BenWilson2 mentioned this pull request Dec 2, 2025

[Endpoints] [14/x] Add encryption availability endpoints #19176

Closed

29 tasks

BenWilson2 force-pushed the stack/endpoints/entities branch 2 times, most recently from c02afd1 to 384adf3 Compare December 3, 2025 01:52

B-Step62 reviewed Dec 3, 2025

View reviewed changes

BenWilson2 force-pushed the stack/endpoints/entities branch 2 times, most recently from 14fce99 to 09aba4e Compare December 4, 2025 00:27

BenWilson2 requested a review from B-Step62 December 4, 2025 00:30

Add crypto capabilities

1da3509

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

BenWilson2 force-pushed the stack/endpoints/entities branch from 09aba4e to 3c17d6c Compare December 4, 2025 01:54

Copilot AI review requested due to automatic review settings December 4, 2025 01:54

Copilot started reviewing on behalf of BenWilson2 December 4, 2025 01:55 View session

Copilot finished reviewing on behalf of BenWilson2 December 4, 2025 01:56

Copilot AI reviewed Dec 4, 2025

View reviewed changes

B-Step62 approved these changes Dec 4, 2025

View reviewed changes

WIP

c017f38

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

BenWilson2 force-pushed the stack/endpoints/entities branch from 3c17d6c to c017f38 Compare December 4, 2025 21:12

BenWilson2 added this pull request to the merge queue Dec 4, 2025

Merged via the queue into mlflow:master with commit 0944a18 Dec 4, 2025
50 checks passed

BenWilson2 deleted the stack/endpoints/entities branch December 4, 2025 21:55

TomeHirata reviewed Dec 5, 2025

View reviewed changes

	endpoint_name: Optional endpoint name, populated via JOIN for display.
	endpoint_name: Optional endpoint name, populated via JOIN for display.

	MLflow cryptopgraphic management CLI. Allows for the management of the envelope
	MLflow cryptographic management CLI. Allows for the management of the envelope

Conversation

BenWilson2 commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🥞 Stacked PR

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Should this PR be included in the next patch release?

Uh oh!

github-actions bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes:

Reviewed changes

Uh oh!

Copilot AI Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

B-Step62 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

BenWilson2 commented Nov 24, 2025 •

edited

Loading

github-actions bot commented Nov 26, 2025 •

edited

Loading

B-Step62 left a comment •

edited

Loading