[bugfix] eval list default version + eval-task reason truncation by NVJKKartik · Pull Request #229 · future-agi/future-agi

NVJKKartik · 2026-05-05T11:05:41Z

Summary

Two backend fixes bundled on this branch:

TH-4488 — eval list default version: EvalTemplateListView was hardcoding version_count=1 / current_version="V1" for every template, so changing the default version never reflected in the outer evals list. Added fetch_version_metadata() helper in model_hub/utils/eval_list.py (bulk count + default version_number per page, no N+1) and wired it into the view. Falls back to V1/1 when a template has no version rows.
eval-task usage reason truncation: tracer/eval-task/get_usage/ was capping the per-log eval_explanation at 200 chars and appending "..." in the row payload, so the full reason was unrecoverable client-side. Now sends the full explanation; the table can handle visual truncation. The sibling detail.results_explanation was already untruncated.

Test plan

pytest futureagi/model_hub/tests/test_eval_list.py -k version -v — covers the new test_list_reflects_default_version case (V1, V2 default, V3) and the no-versions fallback
Hit /model-hub/eval-templates/list/ after promoting a non-V1 version to default → list should now show the new default version
Hit /tracer/eval-task/get_usage/ for a task with long eval explanations → logs.items[].reason should be the full string, no trailing ...

EvalTemplateListView was hardcoding version_count=1 and current_version="V1" for every template, so changing the default version never showed up in the outer evals list. Adds fetch_version_metadata() helper alongside the other list-page utilities and uses the real default version_number / count.

get_usage was capping the per-log eval_explanation at 200 chars and appending "...", so the frontend could never render the full reason. Send the full explanation; the table can handle visual truncation.

NVJKKartik added 2 commits May 5, 2026 14:14

[bugfix] eval-task usage: stop truncating reason server-side

1bc1f0d

get_usage was capping the per-log eval_explanation at 200 chars and appending "...", so the frontend could never render the full reason. Send the full explanation; the table can handle visual truncation.

github-actions Bot added backend datasets tracing labels May 5, 2026

NVJKKartik mentioned this pull request May 5, 2026

[bugfix] eval list default version + eval-task reason truncation #230

Closed

3 tasks

khushalsonawat approved these changes May 5, 2026

View reviewed changes

khushalsonawat merged commit 2cd1ad9 into dev May 5, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] eval list default version + eval-task reason truncation#229

[bugfix] eval list default version + eval-task reason truncation#229
khushalsonawat merged 2 commits intodevfrom
fix/th-4488-eval-list-default-version

NVJKKartik commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

NVJKKartik commented May 5, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants