Migrated from spboyer/waza#386
Summary
Map OpenAI Evals' modelgraded YAML schema to waza prompt grader config with concrete translation examples.
OpenAI Format (examples from their registry)
fact.yaml: prompt + choice_strings (ABCDE) + input_outputs
closedqa.yaml: prompt + eval_type (cot_classify) + choice_scores (Y:1.0, N:0.0)
battle.yaml: prompt + choice_scores + dual input_outputs for head-to-head comparison
Deliverable
Field-level mapping table and working translations of fact.yaml, closedqa.yaml, battle.yaml → waza eval.yaml.
Summary
Map OpenAI Evals' modelgraded YAML schema to waza
promptgrader config with concrete translation examples.OpenAI Format (examples from their registry)
fact.yaml: prompt + choice_strings (ABCDE) + input_outputs
closedqa.yaml: prompt + eval_type (cot_classify) + choice_scores (Y:1.0, N:0.0)
battle.yaml: prompt + choice_scores + dual input_outputs for head-to-head comparison
Deliverable
Field-level mapping table and working translations of fact.yaml, closedqa.yaml, battle.yaml → waza eval.yaml.
Parent: #385