Simplify and Speed up some Compression Usage by original-brownbear · Pull Request #60953 · elastic/elasticsearch

original-brownbear · 2020-08-11T09:33:40Z

Use thread-local buffers and deflater and inflater instances to speed up
compressing and decompressing from in-memory bytes.
Not manually invoking end() on these should be safe since their off-heap memory
will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals
that are not instantiated at a high frequency.
This significantly reduces the amount of byte copying and object creation relative to the previous approach
which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied
bytes out of that buffer to a freshly allocated byte[], used 4k stream buffers needlessly when working with
bytes that are already in arrays (writeTo handles efficient writing to the compression logic now) etc.

Relates #57284 which should be helped by this change to some degree.
Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these
code paths.

Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.

elasticmachine · 2020-08-11T09:33:42Z

Pinging @elastic/es-core-infra (:Core/Infra/Core)

original-brownbear · 2020-08-11T09:36:24Z

Jenkins run elasticsearch-ci/packaging-sample-windows (known Jenkins+Windows issue)

original-brownbear · 2020-08-11T09:52:48Z

Jenkins run elasticsearch-ci/1 (unrelated test failure #60954)

jaymode · 2020-08-11T16:28:54Z

server/src/main/java/org/elasticsearch/common/compress/CompressedXContent.java

        }

-        return Arrays.equals(uncompressed(), that.uncompressed());
+        return uncompressed().equals(uncompressed());


Suggested change

return uncompressed().equals(uncompressed());

return uncompressed().equals(that.uncompressed());

also looking at the code above, I wonder why we don't compare crc32 first before comparing the compressed byte arrays

🤦 thanks for spotting.

I wonder why we don't compare crc32 first before comparing the compressed byte arrays

I guess you could argue that it's really unlikely that the compressed bytes in the equal (as in equal uncompressed bytes) case aren't actually equal so you'd just add an extra int comparison in the equal case. So if we assume it's mostly the equal case here then the current version is better, I have no clue if that's true though. Probably doesn't matter much in practice since byte array comparison in JDK9+ is blazing fast anyway via jdk.internal.util.ArraysSupport#vectorizedMismatch? :)

Probably doesn't matter much in practice

I agree that it doesn't matter much. Just a random thought that popped into my head :)

jaymode · 2020-08-11T16:33:18Z

server/src/main/java/org/elasticsearch/common/compress/DeflateCompressor.java

        };
    }
+
+    private static final ThreadLocal<Inflater> inflaterRef = ThreadLocal.withInitial(() -> new Inflater(true));


IMO it would be a good idea to add a comment about why these threadlocals are not used in the other methods of the class

jaymode

LGTM

original-brownbear · 2020-08-11T20:46:46Z

Thanks Jay!

Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.

original-brownbear added >non-issue :Core/Infra/Core Core issues without another label v8.0.0 v7.10.0 labels Aug 11, 2020

elasticmachine added the Team:Core/Infra Meta label for core/infra team label Aug 11, 2020

original-brownbear mentioned this pull request Aug 11, 2020

the transport/http worker thread will not be released for a long time #57284

Closed

jaymode requested changes Aug 11, 2020

View reviewed changes

original-brownbear added 2 commits August 11, 2020 21:24

Merge remote-tracking branch 'elastic/master' into faster-compression

a02149d

CR: comments

af815ae

original-brownbear requested a review from jaymode August 11, 2020 19:59

jaymode approved these changes Aug 11, 2020

View reviewed changes

original-brownbear merged commit 9059cfb into elastic:master Aug 11, 2020

original-brownbear deleted the faster-compression branch August 11, 2020 20:47

original-brownbear added the backport pending label Aug 11, 2020

original-brownbear mentioned this pull request Aug 12, 2020

Simplify and Speed up some Compression Usage (#60953) #61008

Merged

original-brownbear removed the backport pending label Aug 12, 2020

original-brownbear restored the faster-compression branch December 6, 2020 19:00

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify and Speed up some Compression Usage#60953

Simplify and Speed up some Compression Usage#60953
original-brownbear merged 3 commits intoelastic:masterfrom
original-brownbear:faster-compression

original-brownbear commented Aug 11, 2020

Uh oh!

elasticmachine commented Aug 11, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

jaymode Aug 11, 2020

Uh oh!

original-brownbear Aug 11, 2020

Uh oh!

jaymode Aug 11, 2020

Uh oh!

jaymode Aug 11, 2020

Uh oh!

original-brownbear Aug 11, 2020

Uh oh!

jaymode left a comment

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	return uncompressed().equals(uncompressed());
	return uncompressed().equals(that.uncompressed());

Conversation

original-brownbear commented Aug 11, 2020

Uh oh!

elasticmachine commented Aug 11, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

jaymode Aug 11, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear Aug 11, 2020

Choose a reason for hiding this comment

Uh oh!

jaymode Aug 11, 2020

Choose a reason for hiding this comment

Uh oh!

jaymode Aug 11, 2020

Choose a reason for hiding this comment

Uh oh!

original-brownbear Aug 11, 2020

Choose a reason for hiding this comment

Uh oh!

jaymode left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants