Record PR time benchmark results in JSON format by huydhn · Pull Request #140493 · pytorch/pytorch

huydhn · 2024-11-13T02:53:08Z

I'm trying to make this benchmark results available on OSS benchmark database, so that people can query it from outside. The first step is to also record the results in the JSON format compatible with the database schema defined in pytorch/test-infra#5839.

Existing CSV files remain unchanged.

Testing

The JSON results are uploaded as artifacts to S3 https://github.com/pytorch/pytorch/actions/runs/11809725848/job/32901411180#step:26:13, for example https://gha-artifacts.s3.amazonaws.com/pytorch/pytorch/11809725848/1/artifact/test-jsons-test-pr_time_benchmarks-1-1-linux.g4dn.metal.nvidia.gpu_32901411180.zip

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

pytorch-bot · 2024-11-13T02:53:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140493

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[DomainsOnly] Jobs fail with GLIBC version not found

✅ No Failures

As of commit d361a76 with merge base 4acd56e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

huydhn · 2024-11-13T03:06:28Z

@pytorchbot drci

.github/workflows/_linux-test.yml

huydhn · 2024-11-14T21:14:18Z

@pytorchbot drci

huydhn · 2024-11-14T21:14:50Z

@laithsakka The upload is working https://github.com/pytorch/pytorch/actions/runs/11831781500/job/33006545698#step:24:138

To ease the process of gathering the benchmark metadata before uploading the the database, I'm adding a script `.github/scripts/benchmarks/gather_metadata.py` to gather this information and pass it to the upload script. From #5839, the benchmark metadata includes the following required fields: ``` -- Metadata `timestamp` UInt64, `schema_version` String DEFAULT 'v3', `name` String, -- About the change `repo` String DEFAULT 'pytorch/pytorch', `head_branch` String, `head_sha` String, `workflow_id` UInt64, `run_attempt` UInt32, `job_id` UInt64, -- The raw records on S3 `s3_path` String, ``` I'm going to test this out with PT2 compiler instruction count benchmark at pytorch/pytorch#140493 ### Testing https://github.com/pytorch/test-infra/actions/runs/11831746632/job/32967412160?pr=5918#step:5:105 gathers the metadata and upload the benchmark results correctly Also, an actual upload at https://github.com/pytorch/pytorch/actions/runs/11831781500/job/33006545698#step:24:138

huydhn · 2024-11-19T18:21:30Z

@pytorchbot rebase

pytorchmergebot · 2024-11-19T18:23:10Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2024-11-19T18:23:14Z

Successfully rebased upload-pr_time_benchmarks onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout upload-pr_time_benchmarks && git pull --rebase)

laithsakka · 2024-11-19T18:48:39Z

benchmarks/dynamo/pr_time_benchmarks/benchmarks/basic_modules_benchmarks.py

            opt_m(self.input)

+    def _write_to_json(self, output_dir: str):
+        records = []


def properties() {
device = None
is_dynmao = None
type = None
backend = None
}

laithsakka · 2024-11-19T21:23:58Z