[ML] Store compressed model definitions in ByteReferences by davidkyle · Pull Request #71679 · elastic/elasticsearch

davidkyle · 2021-04-14T11:11:40Z

Binary data is stored in lucene base64 encoded, the same data stored in a Java string uses 2 bytes (UTF16) to represent each base64 character consuming twice the amount of memory required. This change uses ByteReferences to hold the binary data which is not base64 encoded, encoding must take place the bytes can be persisted and this is performed by the XContent classes.

For BWC I've added a new field mapping binary_definition to .ml-inference-* which means the index version has to be incremented.

Compatibility for HLRC and REST API users is preserved as the compressed_definition field still contains the base64 encoded rep.

elasticmachine · 2021-04-14T11:11:43Z

Pinging @elastic/ml-core (Team:ML)

davidkyle · 2021-04-15T08:59:16Z

.../ml/src/main/java/org/elasticsearch/xpack/ml/inference/persistence/TrainedModelProvider.java

total definition length is now tracked again as we need to know the size of PyTorch models up front

dimitris-athanasiou · 2021-04-15T10:27:06Z

...alClusterTest/java/org/elasticsearch/xpack/ml/integration/ChunkedTrainedModelRestorerIT.java

It might be nice to have a private method BytesArray base64Encode(String) and use it throughout

benwtrent · 2021-04-20T12:15:50Z

...e/src/main/java/org/elasticsearch/xpack/core/ml/inference/InferenceToXContentCompressor.java

OK, so this will write out the raw bytes of the GZIP. Is this what we want or do we want to run the Base64 encoder?

I thought the guarantees around base64 character sizes was one of the reasons we could skip transforming into a string?

Binary data is stored in lucene base64 encoded,

Ah, so since the mapping is binary we get that for free.

It's handled by the Jackson JSON generator which is used by the various XContentBuilder::value(byte[] value) methods to write bytes

benwtrent · 2021-04-20T12:24:16Z

.../ml/src/main/java/org/elasticsearch/xpack/ml/inference/persistence/TrainedModelProvider.java

Might be good to indicate that the length is in UTF-8 bytes.

EDIT: Well, maybe not utf-8 bytes...but bytes or something

I added the fact the size is in bytes to the message in 7a59661

.../ml/src/main/java/org/elasticsearch/xpack/ml/inference/persistence/TrainedModelProvider.java

The feature branch contains changes to configure PyTorch models with a TrainedModelConfig and defines a format to store the binary models. The _start and _stop deployment actions control the model lifecycle and the model can be directly evaluated with the _infer endpoint. 2 Types of NLP tasks are supported: Named Entity Recognition and Fill Mask. The feature branch consists of these PRs: #73523, #72218, #71679 #71323, #71035, #71177, #70713

davidkyle added the :ml Machine learning label Apr 14, 2021

elasticmachine added the Team:ML Meta label for the ML team label Apr 14, 2021

davidkyle force-pushed the binary-def branch 2 times, most recently from 7714b2e to 6dd2c1e Compare April 14, 2021 17:24

davidkyle commented Apr 15, 2021

View reviewed changes

dimitris-athanasiou reviewed Apr 15, 2021

View reviewed changes

benwtrent reviewed Apr 20, 2021

View reviewed changes

davidkyle added 9 commits April 20, 2021 15:12

Store binary data as bytes

6c2a889

Switch to bytes ref

4ca1e5c

new index mappings version

00590f3

Fixing up tests

79afa7b

Actually update version instead of just the comment

6b8b4f0

More tests

93fb226

Always check for EOS

03863ee

Say size is in bytes

7a59661

review changes

bb2638c

davidkyle force-pushed the binary-def branch from 234cf57 to bb2638c Compare April 20, 2021 14:30

benwtrent approved these changes Apr 20, 2021

View reviewed changes

davidkyle merged commit 64c04e5 into elastic:feature/pytorch-inference Apr 20, 2021

davidkyle mentioned this pull request Jun 2, 2021

[ML] Merge the pytorch-inference feature branch #73660

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Store compressed model definitions in ByteReferences#71679

[ML] Store compressed model definitions in ByteReferences#71679
davidkyle merged 9 commits intoelastic:feature/pytorch-inferencefrom
davidkyle:binary-def

davidkyle commented Apr 14, 2021 •

edited

Loading

Uh oh!

elasticmachine commented Apr 14, 2021

Uh oh!

davidkyle Apr 15, 2021

Uh oh!

dimitris-athanasiou Apr 15, 2021

Uh oh!

benwtrent Apr 20, 2021

Uh oh!

benwtrent Apr 20, 2021

Uh oh!

davidkyle Apr 20, 2021

Uh oh!

benwtrent Apr 20, 2021 •

edited

Loading

Uh oh!

davidkyle Apr 20, 2021

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

davidkyle commented Apr 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Apr 14, 2021

Uh oh!

davidkyle Apr 15, 2021

Choose a reason for hiding this comment

Uh oh!

dimitris-athanasiou Apr 15, 2021

Choose a reason for hiding this comment

Uh oh!

benwtrent Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

benwtrent Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

davidkyle Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

benwtrent Apr 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidkyle Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

davidkyle commented Apr 14, 2021 •

edited

Loading

benwtrent Apr 20, 2021 •

edited

Loading