Skip to content

[bugfix] eval list default version + eval-task reason truncation#229

Merged
khushalsonawat merged 2 commits intodevfrom
fix/th-4488-eval-list-default-version
May 5, 2026
Merged

[bugfix] eval list default version + eval-task reason truncation#229
khushalsonawat merged 2 commits intodevfrom
fix/th-4488-eval-list-default-version

Conversation

@NVJKKartik
Copy link
Copy Markdown
Contributor

Summary

Two backend fixes bundled on this branch:

  • TH-4488 — eval list default version: EvalTemplateListView was hardcoding version_count=1 / current_version="V1" for every template, so changing the default version never reflected in the outer evals list. Added fetch_version_metadata() helper in model_hub/utils/eval_list.py (bulk count + default version_number per page, no N+1) and wired it into the view. Falls back to V1/1 when a template has no version rows.
  • eval-task usage reason truncation: tracer/eval-task/get_usage/ was capping the per-log eval_explanation at 200 chars and appending "..." in the row payload, so the full reason was unrecoverable client-side. Now sends the full explanation; the table can handle visual truncation. The sibling detail.results_explanation was already untruncated.

Test plan

  • pytest futureagi/model_hub/tests/test_eval_list.py -k version -v — covers the new test_list_reflects_default_version case (V1, V2 default, V3) and the no-versions fallback
  • Hit /model-hub/eval-templates/list/ after promoting a non-V1 version to default → list should now show the new default version
  • Hit /tracer/eval-task/get_usage/ for a task with long eval explanations → logs.items[].reason should be the full string, no trailing ...

NVJKKartik added 2 commits May 5, 2026 14:14
EvalTemplateListView was hardcoding version_count=1 and current_version="V1"
for every template, so changing the default version never showed up in the
outer evals list. Adds fetch_version_metadata() helper alongside the other
list-page utilities and uses the real default version_number / count.
get_usage was capping the per-log eval_explanation at 200 chars and appending
"...", so the frontend could never render the full reason. Send the full
explanation; the table can handle visual truncation.
@khushalsonawat khushalsonawat merged commit 2cd1ad9 into dev May 5, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants