Improve scorer trace picker UX and validation by danielseong1 · Pull Request #20178 · mlflow/mlflow

danielseong1 · 2026-01-21T07:29:28Z

Related Issues/PRs

#xxx

What changes are proposed in this pull request?

This PR makes two adjustments to the scorer evaluation UI based on UX feedback:

Remove "Last trace/session" dropdown option - Users must now explicitly select traces/sessions via the modal picker. This simplifies the UI and makes trace selection more intentional.
Disable "Run judge" button for non-gateway models - Direct model endpoints are not supported for running judges from the UI, so the button is now disabled with an explanatory tooltip.

Key changes:

Simplified itemsToEvaluate state from { itemCount, itemIds } to just selectedItemIds: string[]
Replaced dropdown with a simple button that opens the selection modal
Added validation to disable run button when no traces are selected
Added validation to disable run button for direct (non-gateway) models

How is this PR tested?

Existing unit/integration tests
New unit/integration tests
Manual tests

Updated existing unit tests to reflect the simplified state shape. Manually verified the UI changes work correctly.

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

🤖 Generated with Claude Code

github-actions · 2026-01-21T07:29:49Z

🛠 DevTools 🛠

Install mlflow from this PR

# mlflow
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/20178/merge
# mlflow-skinny
pip install git+https://github.com/mlflow/mlflow.git@refs/pull/20178/merge#subdirectory=libs/skinny

For Databricks, use the following command:

%sh curl -LsSf https://raw.githubusercontent.com/mlflow/mlflow/HEAD/dev/install-skinny.sh | sh -s pull/20178/merge

github-actions · 2026-01-21T07:38:22Z

Documentation preview for 31bdb7e is available at:

https://pr-20178--mlflow-docs-preview.netlify.app/docs/latest/

More info

Ignore this comment if this PR does not change the documentation.
The preview is updated when a new commit is pushed to this PR.
This comment was created by this workflow run.
The documentation was built by this workflow run.

This PR makes two improvements to the scorer evaluation UI: 1. Remove "Last trace/session" dropdown option - Users must now explicitly select traces/sessions via the modal picker. This simplifies the UI and makes trace selection more intentional. 2. Disable "Run judge" button for non-gateway models - Direct model endpoints are not supported for running judges from the UI, so the button is now disabled with an explanatory tooltip. Key changes: - Simplified `itemsToEvaluate` state from `{ itemCount, itemIds }` to just `selectedItemIds: string[]` - Replaced dropdown with a simple button that opens the selection modal - Added validation to disable run button when no traces are selected - Added validation to disable run button for direct (non-gateway) models 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> Signed-off-by: Daniel Seong <daniel.leem.seong@gmail.com>

smoorjani

LGTM, thanks for the fixes!

Signed-off-by: Daniel Seong <daniel.leem.seong@gmail.com> Co-authored-by: Daniel Seong <daniel.leem.seong@gmail.com> Co-authored-by: Claude <noreply@anthropic.com>

github-actions bot added area/tracing MLflow Tracing and its integrations area/uiux Front-end, user experience, plotting, JavaScript, JavaScript dev server rn/feature Mention under Features in Changelogs. labels Jan 21, 2026

danielseong1 requested a review from hubertzub-db January 21, 2026 07:59

github-actions bot added area/tracing MLflow Tracing and its integrations and removed area/tracing MLflow Tracing and its integrations labels Jan 21, 2026

danielseong1 added the v3.9.0 label Jan 21, 2026

danielseong1 force-pushed the scorers-bugs branch from 1709b5b to 31bdb7e Compare January 21, 2026 08:03

github-actions bot added rn/bug-fix Mention under Bug Fixes in Changelogs. and removed rn/feature Mention under Features in Changelogs. area/tracing MLflow Tracing and its integrations labels Jan 21, 2026

danielseong1 requested a review from smoorjani January 21, 2026 18:00

smoorjani approved these changes Jan 21, 2026

View reviewed changes

github-actions bot assigned smoorjani Jan 22, 2026

danielseong1 added this pull request to the merge queue Jan 22, 2026

Merged via the queue into mlflow:master with commit 4737370 Jan 22, 2026
55 of 57 checks passed

danielseong1 deleted the scorers-bugs branch January 22, 2026 01:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve scorer trace picker UX and validation#20178

Improve scorer trace picker UX and validation#20178
danielseong1 merged 1 commit intomlflow:masterfrom
danielseong1:scorers-bugs

danielseong1 commented Jan 21, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 21, 2026

Install mlflow from this PR

Uh oh!

github-actions bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

smoorjani left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielseong1 commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Should this PR be included in the next patch release?

Uh oh!

github-actions bot commented Jan 21, 2026

Install mlflow from this PR

Uh oh!

github-actions bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smoorjani left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

danielseong1 commented Jan 21, 2026 •

edited

Loading

github-actions bot commented Jan 21, 2026 •

edited

Loading