[AMD][CI] Add GLM-5-MXFP4 accuracy and perf nightly tests for MI35x#21773
[AMD][CI] Add GLM-5-MXFP4 accuracy and perf nightly tests for MI35x#21773
Conversation
There was a problem hiding this comment.
Code Review
This pull request adds GSM8K accuracy evaluation and performance benchmarking scripts for the GLM-5-MXFP4 model on AMD MI35x GPUs. The review feedback suggests moving module-level environment variable configurations to setUpClass or passing them directly to the runner to avoid side effects. Other improvements include replacing ast.literal_eval with int() for more robust numerical parsing and adding a safety check for zero division when calculating Inter-Token Latency (ITL).
cb68ea7 to
899c460
Compare
ef0fc1b to
4cd9139
Compare
14ab3b2 to
8d43fd8
Compare
|
Addressed @1am9trash's review: added |
a3b92d6 to
81524b0
Compare
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
81524b0 to
b7c261c
Compare
b7c261c to
cfc8a76
Compare
HaiShaw
left a comment
There was a problem hiding this comment.
@michaelzhang-ai check comment.
|
@HaiShaw This follows the existing pattern used across all AMD perf tests ( |
874fa67 to
4542d8f
Compare
851996e to
2c9b9b5
Compare
Add nightly CI tests for amd/GLM-5-MXFP4 (Quark MXFP4 quantized) on MI35x GPUs with accuracy (GSM8K, threshold 0.90) and performance (bench_one_batch, 1024 in / 1024 out) benchmarks. Test files: - test/registered/amd/accuracy/mi35x/test_glm5_mxfp4_eval_mi35x.py - test/registered/amd/perf/mi35x/test_glm5_mxfp4_perf_mi35x.py Note: Workflow entries and engine fixes already merged via earlier PRs.
2c9b9b5 to
206b3d3
Compare
|
@amd-bot ci-status |
|
@HaiShaw All 12 errors are pre-existing AMD CI issues, none related to this PR:
The 4 warnings are all Our PR only adds 2 test files and modifies 2 workflow YAML files. It doesn't change any engine code, model code, or existing tests. |
CI Status for PR #21773PR: [AMD][CI] Add GLM-5-MXFP4 accuracy and perf nightly tests for MI35x This PR only modifies nightly workflow definitions and adds new nightly test files. It does not change any runtime code, PR CI test files, or test infrastructure. None of the CI failures below are related to this PR. AMD CI: 10 failures (0 likely related) | Others: 9 failures (0 related) AMD CI Failures
Other CI Failures
DetailsAll 19 failures are unrelated to this PR. This PR only modifies nightly AMD workflow definitions (adding GLM-5-MXFP4 test jobs, removing old GLM-5 jobs, reorganizing GLM-5.1 jobs) and adds two new nightly test files. None of the changed files are executed during PR CI. The failures fall into these pre-existing categories:
Verdict: No action needed from the PR author. All failures are pre-existing infrastructure or flaky test issues on
|
Summary
Model Details
Files Changed (4 files, +528/-130)
Test Runs
Test Plan