[Endpoints] [9/x] Add provider, model, and configuration handling by BenWilson2 · Pull Request #19009 · mlflow/mlflow

BenWilson2 · 2025-11-24T17:58:04Z

🥞 Stacked PR

Use this link to review incremental changes.

stack/endpoints/litellm [Files changed]
- stack/endpoints/ui-apis [Files changed]
  - stack/endpoints/ui-create-endpoint [Files changed]
    - stack/endpoints/ui-tabs [Files changed]
      - stack/endpoints/key-management [Files changed]
        
        stack/endpoints/passphrase [Files changed]

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

Adds backend APIs for fetching providers, models from providers, and provider-specific configurations needed to fulfill a request. Adds a new dependency to the [genai] optional installation to support these backend APIs.

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes should be selected for bug fixes, documentation updates, and other small changes. No should be selected for new features and larger changes. If you're unsure about the release classification of this PR, leave this unchecked to let the maintainers decide.

What is a minor/patch release?

Minor release: a release that increments the second part of the version number (e.g., 1.2.0 -> 1.3.0).
Bug fixes, doc updates and new features usually go into minor releases.
Patch release: a release that increments the third part of the version number (e.g., 1.2.0 -> 1.2.1).
Bug fixes and doc updates usually go into patch releases.

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

github-actions · 2025-12-02T22:57:42Z

Documentation preview for 146bd37 is available at:

https://pr-19009--mlflow-docs-preview.netlify.app/docs/latest/

More info

Ignore this comment if this PR does not change the documentation.
The preview is updated when a new commit is pushed to this PR.
This comment was created by this workflow run.
The documentation was built by this workflow run.

BenWilson2 · 2025-12-09T21:21:36Z

@BenWilson2 I thought we would just rely on LiteLLM because we don't want to manage the custom mapping. Did we change the decision? At least we should not have both? If we need to maintain the static mapping anyway, we don't need to litellm-based backend handler and just share the config file between UI and backend.

@B-Step62 litellm responds with a list of all optional valid fields, but doesn't organize them into "these are the ones that are used together", hence the need to have the static mapping. It's not ideal, but it's better than having hard-mappings of all providers AND allows us to validate that the backend can support or not support a given provider (which is why for this version I surmised that allowing an editable connection configuration might be a bad idea for a random provider - a user wouldn't know until after having spent time configuring an endpoint that it's valid or not).

For the static mapping, I figured that it would be less confusing to users if we provided the auth groupings that are required for these providers instead of a full list of "choose the ones you know you need" to reduce the cognitive load when creating an endpoint with a given selectable configuration.

Open to hear some thoughts on alternatives here!

BenWilson2 · 2025-12-09T21:28:23Z

Btw, for provider specific configuration, can we reuse our configurations for existing gateway? These classes already specifies the necessary information to make a LLM call for a specific provider. We still need to rely on LiteLLM for models though.

@TomeHirata the config that we have in Gateway doesn't have the other fields that we want to use for UI to display for helper text / easily legible naming or information. They do contain the field mappings for validation, but I felt that it might be better to contain the full config for these special case providers in a single location (also didn't want to make a future deprecated portion of the code base linked to a new feature that would complicate cleanup in the future).

mlflow/utils/providers.py

mlflow/store/tracking/gateway/sqlalchemy_mixin.py

mlflow/utils/providers.py

TomeHirata · 2025-12-10T01:43:22Z

the config that we have in Gateway doesn't have the other fields that we want to use for UI to display for helper text / easily legible naming or information. They do contain the field mappings for validation, but I felt that it might be better to contain the full config for these special case providers in a single location

Understood. To share the context behind my question, we'll have two type of providers in our gateway: first tier support that MLflow owns schema unification, and second tier support that we'll use LiteLLM. For the first tier providers (openai, anthropic,...) we'll still reuse the existing gateway provider implementation and the per provider configuration should be consistent between the here and gateway provider implementation. Not a blocker, we can add more auth ways to the gateway implementation later.

TomeHirata

LGTM once this credential_name related question is solved!

B-Step62 · 2025-12-11T09:30:09Z

@B-Step62 litellm responds with a list of all optional valid fields, but doesn't organize them into "these are the ones that are used together", hence the need to have the static mapping. It's not ideal, but it's better than having hard-mappings of all providers AND allows us to validate that the backend can support or not support a given provider (which is why for this version I surmised that allowing an editable connection configuration might be a bad idea for a random provider - a user wouldn't know until after having spent time configuring an endpoint that it's valid or not).

@BenWilson2 Hmm I'm still not following what is the intention of the current implementation.... according to the description you tried to add missing information (combination of optional fields), but if I read the logic of _get_credential_fields, we never use the hard-coded mapping in conjunction with the LiteLLM response.

If I read the code correctly, what current implementation does is

Get field configuration from LiteLLM via get_provider_fields.
If it returns some configuration, return it as is.
Otherwise, fallback to the static config.

Am I missing something? I don't see _PROVIDER_CREDENTIAL_MAPPING used anywhere else.

    get_provider_fields = _get_provider_fields()Add a comment on  lines R610 to R616Add diff commentMarkdown input:  edit mode selected.WritePreviewAdd a suggestionHeadingBoldItalicQuoteCodeLinkUnordered listNumbered listTask listMentionReferenceSaved repliesAdd FilesPaste, drop, or click to add filesCancelCommentStart a reviewReturn to code
    provider_fields = get_provider_fields(provider)

    if provider_fields and len(provider_fields) > 0:
        return [
            {
                "name": field["field_name"],
                "type": field.get("field_type", "string"),
                "description": field.get("field_description", ""),
                "required": True,
            }
            for field in provider_fields
        ]
    elif provider in _PROVIDER_CREDENTIAL_MAPPING:
        return _PROVIDER_CREDENTIAL_MAPPING[provider]

    return []

P.S. Re-read your comment above, it seems we are talking about different mapping. What I referred to is _PROVIDER_CREDENTIAL_MAPPING. It seems you are talking about _PROVIDER_AUTH_MODES, and I understand we need the latter as discussed offline.

BenWilson2 · 2025-12-11T15:41:42Z

@B-Step62 litellm responds with a list of all optional valid fields, but doesn't organize them into "these are the ones that are used together", hence the need to have the static mapping. It's not ideal, but it's better than having hard-mappings of all providers AND allows us to validate that the backend can support or not support a given provider (which is why for this version I surmised that allowing an editable connection configuration might be a bad idea for a random provider - a user wouldn't know until after having spent time configuring an endpoint that it's valid or not).

@BenWilson2 Hmm I'm still not following what is the intention of the current implementation.... according to the description you tried to add missing information (combination of optional fields), but if I read the logic of _get_credential_fields, we never use the hard-coded mapping in conjunction with the LiteLLM response.

If I read the code correctly, what current implementation does is

Get field configuration from LiteLLM via get_provider_fields.

If it returns some configuration, return it as is.

Otherwise, fallback to the static config.

Am I missing something? I don't see _PROVIDER_CREDENTIAL_MAPPING used anywhere else.
    get_provider_fields = _get_provider_fields()Add a comment on  lines R610 to R616Add diff commentMarkdown input:  edit mode selected.WritePreviewAdd a suggestionHeadingBoldItalicQuoteCodeLinkUnordered listNumbered listTask listMentionReferenceSaved repliesAdd FilesPaste, drop, or click to add filesCancelCommentStart a reviewReturn to code
    provider_fields = get_provider_fields(provider)

    if provider_fields and len(provider_fields) > 0:
        return [
            {
                "name": field["field_name"],
                "type": field.get("field_type", "string"),
                "description": field.get("field_description", ""),
                "required": True,
            }
            for field in provider_fields
        ]
    elif provider in _PROVIDER_CREDENTIAL_MAPPING:
        return _PROVIDER_CREDENTIAL_MAPPING[provider]

    return []
P.S. Re-read your comment above, it seems we are talking about different mapping. What I referred to is _PROVIDER_CREDENTIAL_MAPPING. It seems you are talking about _PROVIDER_AUTH_MODES, and I understand we need the latter as discussed offline.

WOW, yeah, I was 100% referring to the auth modes mapping. Sorry about that!
We're:

Requiring litellm to be installed
Needing the config to be compatible with the library interface

Having a 'safe fallback' is pointless and just is another vector for us to keep updated as new providers are added (or dropped).

@B-Step62 thanks for refocusing my attention on this. I'm going to remove the typed dict entry and the fallback mapping as it provides zero value and actually actively makes maintenance harder.

Thank you!

Signed-off-by: Ben Wilson <benjamin.wilson@databricks.com>

TomeHirata · 2025-12-12T00:29:22Z

mlflow/protos/service.proto

@@ -4110,9 +4110,11 @@ message CreateGatewaySecret {
  optional string secret_value = 2;


@BenWilson2 I think we should update the schema for secret_value here to support arbitrary key-value pair?

TomeHirata · 2025-12-12T00:29:55Z

mlflow/store/tracking/gateway/abstract_mixin.py

@@ -22,7 +22,6 @@ def create_secret(
        secret_name: str,
        secret_value: str,


@BenWilson2 I think we should update the type hint here to str | dict[str, str]?

BenWilson2 changed the title ~~dynamic providers~~ [Endpoints] [9/x] Add provider, model, and configuration handling Nov 24, 2025

BenWilson2 marked this pull request as ready for review November 24, 2025 22:06

BenWilson2 force-pushed the stack/endpoints/litellm branch from 6a9292e to 4c0e39f Compare November 24, 2025 22:19

github-actions bot added area/tracking Tracking service, tracking client APIs, autologging rn/feature Mention under Features in Changelogs. labels Nov 24, 2025

BenWilson2 force-pushed the stack/endpoints/litellm branch 2 times, most recently from e176d7d to 00d7a50 Compare November 26, 2025 03:40

This was referenced Nov 26, 2025

[Endpoints] [11/x] Main Gateway page and create endpoint #19042

Closed

[Endpoints] [12/x] Add Left hand nav and management stub #19070

Closed

[Endpoints] [13/x] Add the API Key management pages #19071

Closed

BenWilson2 force-pushed the stack/endpoints/litellm branch from 00d7a50 to de3bd35 Compare November 27, 2025 06:31

BenWilson2 mentioned this pull request Nov 27, 2025

[Endpoints] [14/x] Models management UI #19074

Closed

29 tasks

BenWilson2 force-pushed the stack/endpoints/litellm branch 2 times, most recently from 85037f1 to 0526203 Compare December 2, 2025 02:14

BenWilson2 mentioned this pull request Dec 2, 2025

[Endpoints] [14/x] Add encryption availability endpoints #19176

Closed

29 tasks

BenWilson2 force-pushed the stack/endpoints/litellm branch 2 times, most recently from 93c2b51 to dca21ae Compare December 2, 2025 22:49

BenWilson2 force-pushed the stack/endpoints/litellm branch 3 times, most recently from 0bcf261 to 2546def Compare December 3, 2025 22:36

BenWilson2 force-pushed the stack/endpoints/litellm branch from 3c5d12d to 02deb34 Compare December 9, 2025 20:27

BenWilson2 force-pushed the stack/endpoints/litellm branch 2 times, most recently from f076132 to dd53418 Compare December 9, 2025 22:45