[DOC] Model caching feature overview by nosovmik · Pull Request #5519 · openvinotoolkit/openvino

nosovmik · 2021-05-05T16:14:48Z

Details:

Model caching feature overview

Tickets:

53433

ledmonster · 2021-05-07T06:48:38Z

I'm very interested in this feature. Does GPU plugin support Import/Export capability?

nosovmik · 2021-05-07T07:27:55Z

I'm very interested in this feature. Does GPU plugin support Import/Export capability?

Thanks a lot for your interest in this feature :-) Right now model caching is enabled in 'master' for GNA (PR #4889) and Myriad (PR #4868) plugins, planned to be available in next official release.
For GPU, as far as I know, there is no clear plans to add such model caching support so far

ledmonster · 2021-05-07T09:49:12Z

Thank you for reply.

For GPU, as far as I know, there is no clear plans to add such model caching support so far

Are there any reason for that? Do you recommend not to use GPU plugin? Or are there any difficulties around model caching for GPU plugin?

nosovmik · 2021-05-07T12:20:50Z

Are there any reason for that? Do you recommend not to use GPU plugin? Or are there any difficulties around model caching for GPU plugin?

Of course, it is not the reason to not use GPU plugin :-) This plugin already has a lot of other useful features for people to use and GPU team have a lot of new features for performance improvement in a pipeline :-)
By the way, can you please share some more information about your use case? Seems like your model takes quite a lot of time to load on GPU and you believe that 'model caching' feature will significantly improve this, right?

ledmonster · 2021-05-07T13:07:58Z

@nosovmik Thank you for quick and kind reply.

Seems like your model takes quite a lot of time to load on GPU and you believe that 'model caching' feature will significantly improve this, right?

Yes, I have a custom pose estimation model, and I want to run it in an embedded device (robot). Since CPU resource is limited, hopefully I want to run the model with GPU. But while CPU plugin takes less than 1 second to load the model, GPU plugin takes more than 30 seconds for that. So I want to cache the result of model optimization for GPU plugin.

vladimir-paramuzov · 2021-05-07T13:27:51Z

@ledmonster Did you get this 30 seconds for model loading with kernels cache enabled? Import/Export API is not implemented yet for GPU, but if you enable CACHE_DIR option, then GPU plugin will enable caching of compiled OCL kernels. And since the OCL kernels build is the most expensive stage on model loading, this caching may significantly improve loading time.

ledmonster · 2021-05-08T02:34:39Z

if you enable CACHE_DIR option, then GPU plugin will enable caching of compiled OCL kernels.

Thank you. I didn't know that. I'll try it next week.

By the way, I can find that option in API Reference, but it is not listed in GPU Plugin documentation. Is the document missing some config parameters?

ledmonster · 2021-05-10T14:33:50Z

By specifying CACHE_DIR like this:

ie.set_config(config={"CACHE_DIR": "/tmp/openvino/gpu/"}, device_name="gpu")

cache files are created on first run, which is about 70MB, and initialization got faster on second run. While it took about 40 sec on first run, on second run, it took only 0.5 sec.

Thank you so much!!

nosovmik · 2021-05-17T12:26:49Z

@andrew-zaytsev @ilya-lavrenov Can you please review this PR? Should I add someone else to this review?

docs/IE_DG/Model_caching_overview.md

ilya-lavrenov · 2021-06-21T08:43:53Z

docs/IE_DG/Model_caching_overview.md

+Please also note that very first LoadNetwork (when cache is not yet created) will take slightly longer time to 'export' compiled blob into a cache file
+![caching_enabled]
+
+## Even faster: use LoadNetwork(\<modelName\>)


modelName -> modelFileName? to be more specific

Renamed to modelPath as used in code snippets

docs/IE_DG/Intro_to_Performance.md

docs/IE_DG/Model_caching_overview.md

Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

- Moved code examples to snippets - Added link to Model Caching overview from "Inference Engine Developer Guide" - Few minor changes

Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

* Docs: Model caching feature overview * Update docs/IE_DG/Intro_to_Performance.md Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com> * Apply suggestions from code review Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com> * Review comments - Moved code examples to snippets - Added link to Model Caching overview from "Inference Engine Developer Guide" - Few minor changes * Update docs/IE_DG/Intro_to_Performance.md Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com> Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

nosovmik requested review from andrew-zaytsev and ilya-lavrenov May 5, 2021 16:14

nosovmik requested a review from a team as a code owner May 5, 2021 16:14

openvino-pushbot added the category: docs OpenVINO documentation label May 5, 2021

Docs: Model caching feature overview

4b43d3e

ledmonster mentioned this pull request May 11, 2021

[Bug] GPU Plugin document doesn't refer to KEY_CACHE_DIR configuration #5585

Closed

3 tasks

ilya-lavrenov self-assigned this Jun 18, 2021

ilya-lavrenov added this to the 2022.1 milestone Jun 18, 2021

ilya-lavrenov reviewed Jun 21, 2021

View reviewed changes

ilya-lavrenov approved these changes Jun 21, 2021

View reviewed changes

avladimi reviewed Jun 21, 2021

View reviewed changes

nosovmik and others added 5 commits June 21, 2021 14:37

Update docs/IE_DG/Intro_to_Performance.md

cd8faff

Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

Apply suggestions from code review

f54d927

Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

Merge remote-tracking branch 'upstream/master' into caching_overview

ab5c8c9

Review comments

3dce2e4

- Moved code examples to snippets - Added link to Model Caching overview from "Inference Engine Developer Guide" - Few minor changes

Update docs/IE_DG/Intro_to_Performance.md

d596296

Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>

ilya-lavrenov merged commit 4a4c3e8 into openvinotoolkit:master Jun 23, 2021

Conversation

nosovmik commented May 5, 2021

Details:

Tickets:

Uh oh!

ledmonster commented May 7, 2021

Uh oh!

nosovmik commented May 7, 2021

Uh oh!

ledmonster commented May 7, 2021

Uh oh!

nosovmik commented May 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ledmonster commented May 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vladimir-paramuzov commented May 7, 2021

Uh oh!

ledmonster commented May 8, 2021

Uh oh!

ledmonster commented May 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nosovmik commented May 17, 2021

Uh oh!

Uh oh!

ilya-lavrenov Jun 21, 2021

Choose a reason for hiding this comment

Uh oh!

nosovmik Jun 21, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nosovmik commented May 7, 2021 •

edited

Loading

ledmonster commented May 7, 2021 •

edited

Loading

ledmonster commented May 10, 2021 •

edited

Loading