Support search traces by prompt version by TomeHirata · Pull Request #18906 · mlflow/mlflow

TomeHirata · 2025-11-19T01:29:42Z

🛠 DevTools 🛠

Install mlflow from this PR

# mlflow
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/18906/merge
# mlflow-skinny
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/18906/merge#subdirectory=libs/skinny

For Databricks, use the following command:

%sh curl -LsSf https://raw.githubusercontent.com/mlflow/mlflow/HEAD/dev/install-skinny.sh | sh -s pull/18906/merge

Related Issues/PRs

n/a

What changes are proposed in this pull request?

This PR introduces a new prompts filter for search_traces. As part is this change, we migrate the linkage storage from trace tag to entity association table of tracking store.

traces = mlflow.search_traces(filter_string="prompts = 'prompt-a/1'")

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes should be selected for bug fixes, documentation updates, and other small changes. No should be selected for new features and larger changes. If you're unsure about the release classification of this PR, leave this unchecked to let the maintainers decide.

What is a minor/patch release?

Minor release: a release that increments the second part of the version number (e.g., 1.2.0 -> 1.3.0).
Bug fixes, doc updates and new features usually go into minor releases.
Patch release: a release that increments the third part of the version number (e.g., 1.2.0 -> 1.2.1).
Bug fixes and doc updates usually go into patch releases.

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

- Removed the deprecated LINKED_PROMPTS_TAG_KEY from prompt constants. - Updated all references to linked prompts in the codebase to use TraceTagKey.LINKED_PROMPTS. - Added validation for prompts filter in search functionality to ensure only exact matches are allowed. - Enhanced error handling for invalid prompts filter format. This change centralizes the management of linked prompts and improves the clarity of the codebase. Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

github-actions · 2025-11-19T01:43:10Z

Documentation preview for 9d623ce is available at:

https://pr-18906--mlflow-docs-preview.netlify.app/docs/latest/

Changed Pages (1)

genai/tracing/search-traces (modified)

More info

Ignore this comment if this PR does not change the documentation.
The preview is updated when a new commit is pushed to this PR.
This comment was created by this workflow run.
The documentation was built by this workflow run.

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

docs/docs/genai/tracing/search-traces.mdx

B-Step62 · 2025-12-01T05:09:41Z

docs/docs/genai/tracing/search-traces.mdx

Can we add some documentation under /genai/prompt-registry/ as well? Maybe it's worth having a single page explains link between prompt and traces. Maybe follow-up after adding UI feature to jump to a list of traces from the prompt page.

Yeah, let me add the doc section after adding UI changes.

B-Step62 · 2025-12-01T05:17:31Z

mlflow/store/tracking/sqlalchemy_store.py


            if SearchTraceUtils.is_tag(key_type, comparator):
                entity = SqlTraceTag
+                # Special handling for prompts filter: only support exact match with name/version


Can we actually use the entity association somehow? Since traces are stored in the experiment, search query goes through tracking store anyway.

We could migrate the linkage to EntityAssociation, but what the motivation would be? For storing more structured linkage?

For performance. Entity association table stores one row per prompt+trace combination, ooposed to a tag that combines multiple prompts into one row, so we can use the exact match condition and benefit from the index of destination ID (WHERE destination_type = 'prompt' and destination_id = '<name/version>'). This should be much more efficient than LIKE on tag values.

Understood, do we think the efficiency gain is worth breaking the backward compatibility right now? Current search does not require any data model change, and we can always switch to entity association if users report search slowness.

Understood, do we think the efficiency gain is worth breaking the backward compatibility right now?

Yes. Afaik we don't provide any place users can see the link between tag and prompt in OSS side, then I don't think it breaks anything. The migration becomes harder when we do it after adding the search functionality.

If you mean Databricks side, we can keep tag logging for DBX. They also plan to migrate to entity association table fwiw.

I think consolidating the linkage in entity association table and populate trace fields (either tag or metadata) from the entity association is clearer since we can keep single source of truth.

Yes, it removes duplication, but one downside is every prompt load needs to make additional SQL query, even if the trace is not linked to any prompt. Since tags are stored in TraceInfo, searching traces also triggers that for every prompt.

Also unfortunately Databricks UI already shows prompts link based on trace tag (which is different from OSS). This means that, even after dbx migrate to entity table, we cannot remove the tag dependence at least for a certain period😔

This means that, the end state will be

Option 1 (yours):

Write: Set the tag in dbx only. Store a record in entity association table.

Read: Populate tag/metadata at load time.

Option 2:

Write: Set the tag and store to entity association table always.

Read: none

I'm not sure if the state 1 is cleaner than 2. The single source of truth is nice in general, but not always the best - for example, we can think it as a kind of databse denormalization - we denormalize the info to entity table for search optimization.

I got your point, but do you think the single additional query for retrieving associations is that problematic? As you said earlier, it should be quick since it relies on database indices, and as I implemented, it does not cause N+1 issue by using batch loading. Btw not necessarily objecting to option 2, we can go with dual writing if skipping one query is really worth database denormalization.

Btw, there are several todo comments to remove tag based linkage, didn't we want to remove the tag based approach?

mlflow/mlflow/tracing/utils/prompt.py

Line 8 in b3f0e30

# TODO: Remove tag based linking once we migrate to LinkPromptsToTraces endpoint

I'm fine with populating it to tags if you feel strongly about it. For the TODO comment, I didn't expect to have "linked prompt" tab in the trace UI but just to have a button like "see linked prompts" that jumps to search page:)

I see, shall we defer the decision by implementing option 2 for now? Option 2 -> option 1 is feasible while the opposite is not.

mlflow/store/tracking/sqlalchemy_store.py

tests/store/tracking/test_sqlalchemy_store.py

mlflow/store/tracking/sqlalchemy_store.py

…y across documentation and codebase - Changed all instances of 'prompts' to 'prompt' in the search_traces API documentation and related code. - Updated error messages and test cases to reflect the new filter name. - Ensured that the filter only supports exact matches with the format "name/version". This change enhances clarity and consistency in the API usage. Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

…low into feat/prompt/search-traces

- Updated the filter validation logic to check for exactly one '/' in the prompt value, improving error handling for invalid inputs. - Adjusted the test cases to include a new scenario for invalid prompt format. This change enhances the robustness of the prompts filter functionality. Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

harupy · 2025-12-01T10:06:31Z

/review Only review mlflow/store/tracking/sqlalchemy_store.py and suggest a cleaner approach using match statement to validate the filter value.

🚀 Review workflow started

✅ Review completed.

Review Output

Perfect! I've successfully reviewed the PR and added a comment suggesting a cleaner approach using pattern matching.

Summary

I reviewed PR #18906 focusing on mlflow/store/tracking/sqlalchemy_store.py as requested. I found one style improvement opportunity:

Issue Found: The code uses the verbose approach of checking value.count("/") != 1 followed by value.split("/", 1) to validate and parse the prompt filter value format.

Suggestion: Replace this with pattern matching (match/case statement) which is cleaner, more concise, and aligns with MLflow's Python style guide recommendations for string splitting with fixed parts.

The review comment has been posted to the PR with a code suggestion that maintainers can apply with one click: #18906 (comment)

mlflow/store/tracking/sqlalchemy_store.py

- Introduced the `linkPromptsToTrace` RPC to associate multiple prompt versions with a trace, enhancing prompt tracking workflows. - Implemented the necessary data structures and request handlers to support this functionality. - Updated the tracking store interfaces to allow linking prompts to traces using entity associations. - Added tests to verify the correct behavior of the new API and its integration with the existing system. This change improves the ability to manage prompt versions in relation to traces, facilitating better tracking and organization of prompt usage. Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

…Store - Added a method to populate the LINKED_PROMPTS tag from entity associations for traces that lack this tag, ensuring backward compatibility. - Updated the get_trace_info method to call the new population method. - Enhanced search_traces to include the LINKED_PROMPTS tag for relevant traces. - Added unit tests to verify the correct population of LINKED_PROMPTS in various scenarios. This change improves trace management by maintaining compatibility with existing data structures. Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

…traces Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

B-Step62

LGTM!

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

github-actions bot added v3.6.1 area/tracing MLflow Tracing and its integrations rn/feature Mention under Features in Changelogs. labels Nov 19, 2025

TomeHirata requested review from B-Step62 and serena-ruan November 19, 2025 02:00

TomeHirata added 2 commits November 19, 2025 14:28

Enhance search_traces documentation to include linked prompts filter

159f7b5

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Merge branch 'master' into feat/prompt/search-traces

a027905

B-Step62 reviewed Dec 1, 2025

View reviewed changes

harupy reviewed Dec 1, 2025

View reviewed changes

mlflow/store/tracking/sqlalchemy_store.py Outdated Show resolved Hide resolved

serena-ruan reviewed Dec 1, 2025

View reviewed changes

tests/store/tracking/test_sqlalchemy_store.py Outdated Show resolved Hide resolved

harupy reviewed Dec 1, 2025

View reviewed changes

tests/store/tracking/test_sqlalchemy_store.py Outdated Show resolved Hide resolved

harupy reviewed Dec 1, 2025

View reviewed changes

tests/store/tracking/test_sqlalchemy_store.py Outdated Show resolved Hide resolved

harupy reviewed Dec 1, 2025

View reviewed changes

mlflow/store/tracking/sqlalchemy_store.py Outdated Show resolved Hide resolved

TomeHirata added the v3.7.0 label Dec 1, 2025

TomeHirata added 3 commits December 1, 2025 18:44

Merge branch 'feat/prompt/search-traces' of github.com:TomeHirata/mlf…

b123831

…low into feat/prompt/search-traces

github-actions bot reviewed Dec 1, 2025

View reviewed changes

mlflow/store/tracking/sqlalchemy_store.py Outdated Show resolved Hide resolved

TomeHirata added 6 commits December 1, 2025 21:56

Update linkPromptsToTrace API documentation for clarity

511eecc

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Enhance trace information population in SqlAlchemyStore

9b29c06

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Merge remote-tracking branch 'mlflow/master' into feat/prompt/search-…

59788b3

…traces Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Add back tag based logging

643a2a9

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

TomeHirata force-pushed the feat/prompt/search-traces branch from 0d368f2 to 643a2a9 Compare December 2, 2025 08:34

TomeHirata added 2 commits December 2, 2025 18:41

fix test

703102c

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

lint

9d623ce

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

TomeHirata requested a review from B-Step62 December 2, 2025 09:43

B-Step62 approved these changes Dec 3, 2025

View reviewed changes

TomeHirata added this pull request to the merge queue Dec 3, 2025

Merged via the queue into mlflow:master with commit 9671b90 Dec 3, 2025
63 of 65 checks passed

TomeHirata deleted the feat/prompt/search-traces branch December 3, 2025 07:57

BenWilson2 pushed a commit to BenWilson2/mlflow that referenced this pull request Dec 4, 2025

Support search traces by prompt version (mlflow#18906)

0d4632e

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

BenWilson2 pushed a commit that referenced this pull request Dec 4, 2025

Support search traces by prompt version (#18906)

2f31e9c

Signed-off-by: Tomu Hirata <tomu.hirata@gmail.com>

Conversation

TomeHirata commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Install mlflow from this PR

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Should this PR be included in the next patch release?

Uh oh!

github-actions bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

B-Step62 Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

B-Step62 Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

B-Step62 Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

B-Step62 Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

B-Step62 Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

B-Step62 Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

TomeHirata Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

harupy commented Dec 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

B-Step62 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

TomeHirata commented Nov 19, 2025 •

edited

Loading

github-actions bot commented Nov 19, 2025 •

edited

Loading

B-Step62 Dec 1, 2025 •

edited

Loading

TomeHirata Dec 1, 2025 •

edited

Loading

B-Step62 Dec 1, 2025 •

edited

Loading

B-Step62 Dec 1, 2025 •

edited

Loading

TomeHirata Dec 2, 2025 •

edited

Loading

harupy commented Dec 1, 2025 •

edited by github-actions bot

Loading