Release v0.31.0#231
Merged
Merged
Conversation
- Bump version to 0.31.0 in version.txt and extension.yaml - Backfill CHANGELOG.md for v0.25.0 through v0.30.1 (gap since [0.24.0]) - Add v0.31.0 release notes Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Release v0.31.0
Added
.agent.md) eval support — Discover.agent.mdfiles alongsideSKILL.md, parse agent-specific frontmatter (tools,model,handoffs,mcp-servers,agents), auto-injecttool_constraintgrader from agenttools:field, complete worked example underexamples/custom-agent/, and new "Evaluating Custom Agents" docs guide (feat: support custom agent (.agent.md) file discovery and parsing #225 #226, closes feat: Support VS Code custom agent (.agent.md) evaluation #225)Fixed
_output_containsexpectations against file contents now work in CI without a real model. Mock response includes task metadata, file paths, and a 1KB content preview per resource (fix: mock engine echoes file content for CI evals (#227) #228, closes bug: Waza Evaluation CI fails on main — code-explainer mock eval returns 0% pass rate #227)waza serveno longer crashes when stdin is not a terminal — MCP stdio server only starts whenterm.IsTerminal()is true; piped input or background mode no longer kills the HTTP dashboard (fix: waza serve crashes when stdin is not a terminal #224)Changed
BenchmarkSpec→EvalSpec,TestRunner→EvalRunner. Not a breaking change for external consumers (types live ininternal/) (refactor: complete vocabulary renames — BenchmarkSpec→EvalSpec, TestRunner→EvalRunner (#166) #222)Documentation
Dependencies
Backfilled changelog entries
This release also backfills changelog entries for v0.25.0 through v0.30.1 — versions that shipped between 2026-04-21 and 2026-04-22 without changelog updates. The backfill is curated from git history per Keep-a-Changelog conventions.
Release process
After merge, push the tag to trigger the unified release workflow (
release.yml):The workflow will: build CLI binaries (6 platforms), build + pack the azd extension, create GitHub releases, publish to azd registry, and sync
registry.json.Checklist
version.txtandextension.yamlgo test ./...passesgo vet ./...cleannpm run buildin /site)Co-authored-by: Copilot 223556219+Copilot@users.noreply.github.com