Support production models with predictor benchmark by highker · Pull Request #9855 · pytorch/pytorch

highker · 2018-07-26T00:59:46Z

Summary:
Support production models with predictor benchmark
Two new flags are added:
--update_prod: pull production data (netdef, input types, input dims) from Hive and store locally
--use_prod: run benchmark with local production data with the same workload as in production.
By default, 300 models will be loaded.

production vs benchmark
avg net run time:
(collected by prod: https://fburl.com/scuba/6lb91zfx and bench: https://fburl.com/ngjj1dc8)
prod: 408us vs bench: 543us
(With prod data distribution, this should be even closer)

framework overhead (as of 2018-07-22):
prod:

9.111%    BlackBoxPredictor::Run
4.602%    SimpleNet::Run
2.377%    Operator::Run
1.786%    BlackBoxPredictor::AllocateMemory
1.372%    Observable::StartAllObservers
1.358%    Observable::StartObserver
1.206%    Blob::GetMutable

bench:

8.577%    BlackBoxPredictor::operator()
3.276%    SimpleNet::Run
1.954%    Operator::Run
1.697%    BlackBoxPredictor::AllocateMemory
1.477%    Tensor::ShareData
1.230%    Blob::GetMutable
1.034%    Observable::StartObserver

Differential Revision: D8942996

Summary: Pull Request resolved: pytorch#9855 Support production models with predictor benchmark Two new flags are added: `--update_prod`: pull production data (netdef, input types, input dims) from Hive and store locally `--use_prod`: run benchmark with local production data with the same workload as in production. By default, 300 models will be loaded. production vs benchmark avg net run time: (collected by prod: https://fburl.com/scuba/6lb91zfx and bench: https://fburl.com/ngjj1dc8) **prod: `408us` vs bench: `543us`** (With prod data distribution, this should be even closer) framework overhead (as of 2018-07-22): prod: ``` 9.111% BlackBoxPredictor::Run 4.602% SimpleNet::Run 2.377% Operator::Run 1.786% BlackBoxPredictor::AllocateMemory 1.372% Observable::StartAllObservers 1.358% Observable::StartObserver 1.206% Blob::GetMutable ``` bench: ``` 8.577% BlackBoxPredictor::operator() 3.276% SimpleNet::Run 1.954% Operator::Run 1.697% BlackBoxPredictor::AllocateMemory 1.477% Tensor::ShareData 1.230% Blob::GetMutable 1.034% Observable::StartObserver ``` Reviewed By: yinghai Differential Revision: D8942996 fbshipit-source-id: 38a1a9790c048fb81e92aad2b2c82a1651b11e0c

ezyang · 2018-07-26T22:08:50Z

@pytorchbot retest this please

highker · 2018-07-26T22:19:54Z

@pytorchbot retest this please

highker · 2018-07-27T03:19:53Z

test passed: https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-mkl-ubuntu16.04-trigger-test/12115/

ezyang · 2018-07-27T04:33:00Z

@pytorchbot retest this please

Summary: Pull Request resolved: pytorch#9855 Support production models with predictor benchmark Two new flags are added: `--update_prod`: pull production data (netdef, input types, input dims) from Hive and store locally `--use_prod`: run benchmark with local production data with the same workload as in production. By default, 300 models will be loaded. production vs benchmark avg net run time: (collected by prod: https://fburl.com/scuba/6lb91zfx and bench: https://fburl.com/ngjj1dc8) **prod: `408us` vs bench: `543us`** (With prod data distribution, this should be even closer) framework overhead (as of 2018-07-22): prod: ``` 9.111% BlackBoxPredictor::Run 4.602% SimpleNet::Run 2.377% Operator::Run 1.786% BlackBoxPredictor::AllocateMemory 1.372% Observable::StartAllObservers 1.358% Observable::StartObserver 1.206% Blob::GetMutable ``` bench: ``` 8.577% BlackBoxPredictor::operator() 3.276% SimpleNet::Run 1.954% Operator::Run 1.697% BlackBoxPredictor::AllocateMemory 1.477% Tensor::ShareData 1.230% Blob::GetMutable 1.034% Observable::StartObserver ``` Reviewed By: yinghai Differential Revision: D8942996 fbshipit-source-id: 27355d7bb5a9fd8d0a40195261d13a97fa24ce17

highker force-pushed the export-D8942996 branch from ed4e52a to a4f9405 Compare July 26, 2018 07:46

highker force-pushed the export-D8942996 branch from a4f9405 to 447bc9d Compare July 26, 2018 18:56

highker force-pushed the export-D8942996 branch from 447bc9d to 230d218 Compare July 26, 2018 19:58

highker added the caffe2 label Jul 26, 2018

highker force-pushed the export-D8942996 branch from 230d218 to 879b767 Compare July 26, 2018 22:03

facebook-github-bot closed this in aa671dd Jul 27, 2018

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support production models with predictor benchmark#9855

Support production models with predictor benchmark#9855
highker wants to merge 1 commit intopytorch:masterfrom
highker:export-D8942996

highker commented Jul 26, 2018

Uh oh!

ezyang commented Jul 26, 2018

Uh oh!

highker commented Jul 26, 2018

Uh oh!

highker commented Jul 27, 2018

Uh oh!

ezyang commented Jul 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

highker commented Jul 26, 2018

Uh oh!

ezyang commented Jul 26, 2018

Uh oh!

highker commented Jul 26, 2018

Uh oh!

highker commented Jul 27, 2018

Uh oh!

ezyang commented Jul 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants