Fix for incorrect values from CompositeByteBuf#component(int) by njhill · Pull Request #9416 · netty/netty

njhill · 2019-07-31T22:09:32Z

Motivation

The way that Components are added/stored in CompositeByteBuf was changed in #8437 to minimize slicing and runtime access indirection, with the obvious intention to preserve the effective behaviour of public methods.

@jingene discovered a discrepancy, reported in #9398, where the component(int) and componentAtOffset(int) methods can return an incorrect ByteBuf, i.e. not equivalent to a slice of the added buffer at the time it was added (the previous behaviour). This can happen in
particular if the added ByteBuf is a wrapped slice, e.g. a leak aware or unreleasable buffer.

Upon further scrutiny another subtle deviation was also noticed: When the added ByteBuf is a slice whose readable region does not cover the whole buffer internalComponent(int) returns the original slice (with capacity != readableBytes) rather than a sliced slice.

Modifications

Unfortunately to fix this robustly required slightly more invasive change than first expected.

A final ByteBuf srcBuf field has been added to the Component class, which always holds the originally-added buffer. This simplifies some of the existing fragile logic where we need access to this (e.g. when releasing)
Component has been made abstract with two final impls. Which of these is used depends on whether the srcBuf is already a slice of the component's range (or equivalent to one)
Correctly handle the various places where access to the pre-unwrapped buffer is important, including the problem component(int) methods
Extend the pre-unwrapping optimization to cover WrappedByteBufs, SwappedByteBufs, and duplicates in addition to slices
Unit test for the original bug provided by @jingene

Result

Correct behaviour of the (internal)component(AtOffset) methods in all cases
Reduced indirection when accessing components that were added from more kinds of ByteBufs (duplicates in particular)
Less fragile logic related to lazy-slicing components

I don't expect there to be any noticeable performance impact of splitting Component into two subclasses since most of the methods are still final in the superclass and the ones that aren't will benefit from bimorphic inlining. But will aim to benchmark nonetheless.

There could be slight increase in mem use due to two additional fields in one of the two Component subtypes, but I think this is needed for correctness and still much better than slicing components unconditionally when adding them.

Co-authored-by: jingene jingene0206@gmail.com

netty-bot · 2019-07-31T22:09:48Z

Can one of the admins verify this patch?

njhill · 2019-08-03T04:23:52Z

Performance comparison looks good

jingene · 2019-08-05T01:48:22Z

@njhill Reviewed. Thank you for your patch!

jingene · 2019-08-06T02:55:19Z

@njhill I'm sorry but... when can I use this patch?

njhill · 2019-08-06T05:03:24Z

@jingene it still needs additional review but hopefully will be in the next release 4.1.39.Final. I don't have an ETA for that though sorry, and it likely depends on whether @normanmaurer decides to come back from vacation or retire on the beach.

normanmaurer · 2019-08-07T07:54:04Z

@netty-bot test this please

normanmaurer · 2019-08-14T11:40:40Z

@njhill I wonder if all the complexity really worth it and if we should not better remove some optimisations to make the code easier to reason about....

njhill · 2019-08-14T14:58:00Z

@normanmaurer sure I'll have a go at that

normanmaurer · 2019-08-19T07:27:02Z

@njhill let me know once there is something I should review / re-review.

njhill · 2019-08-19T17:36:16Z

@normanmaurer sure, hopefully I'll get to it in the next day or 2

jingene · 2019-08-27T06:36:29Z

@njhill ping..

njhill · 2019-08-27T17:05:19Z

@jingene apologies, juggling bunch of things, will try to do it today. Would def like for this to get into the next release.

@jingene

Motivation The way that Components are added/stored in CompositeByteBuf was changed in netty#8437 to minimize slicing and runtime access indirection, with the obvious intention to preserve the effective behaviour of public methods. @jingene discovered a discrepancy, reported in netty#9398, where the component(int) and componentAtOffset(int) methods can return an incorrect ByteBuf, i.e. not equivalent to a slice of the added buffer at the time it was added (the previous behaviour). This can happen in particular if the added ByteBuf is a wrapped slice, e.g. a leak aware or unreleasable buffer. Upon further scrutiny another subtle deviation was also noticed: When the added `ByteBuf is a slice whose readable region does not cover the whole buffer internalComponent(int) returns the original slice (with capacity != readableBytes) rather than a sliced slice. Modifications Unfortunately to fix this robustly required slightly more invasive change than first expected. - A final ByteBuf srcBuf field has been added to the Component class, which always holds the originally-added buffer. This simplifies some of the existing fragile logic where we need access to this (e.g. when releasing) - Component has been made abstract with two final impls. Which of these is used depends on whether the srcBuf is already a slice of the component's range (or equivalent to one) - Correctly handle the various places where access to the pre-unwrapped buffer is important, including the problem component(int) methods - Extend the pre-unwrapping optimization to cover WrappedByteBufs, SwappedByteBufs, and duplicates in addition to slices - Unit test for the original bug provided by @jingene Result - Correct behaviour of the (internal)component(AtOffset) methods in all cases - Reduced indirection when accessing components that were added from more kinds of ByteBufs (duplicates in particular) - Less fragile logic related to lazy-slicing components I don't expect there to be any noticeable performance impact of splitting Component into two subclasses since most of the methods are still final in the superclass and the ones that aren't will benefit from bimorphic inlining. But will aim to benchmark nonetheless. There could be slight increase in mem use due to two additional fields in _one_ of the two Component subtypes, but I think this is needed for correctness and still much better than slicing components unconditionally when adding them. Co-authored-by: jingene <jingene0206@gmail.com>

njhill · 2019-08-28T01:39:07Z

@normanmaurer I have pushed a commit with some simplification, that hopefully makes things a little clearer.

Unfortunately most of the changes are for correctness than performance. For example the slice cache must be volatile or there could be thread-safety issues if multiple threads iterate over the components and/or call component(int) concurrently. An alternative could be to revert to always slicing up-front, but that means a lot more unnecessary allocations.

In either case I do think it makes sense to deal differently with added buffers whose entire capacity is readable since these would be quite common and don't need to be sliced at any stage.

normanmaurer · 2019-08-28T04:40:38Z

@netty-bot Test this please

normanmaurer · 2019-08-28T07:50:45Z

@njhill I see... sure saving all the allocations is quite nice but looking at the code it seems like a maintainance nightmare which just waits to break once we change some internals :/

njhill · 2019-08-28T19:24:19Z

@normanmaurer that's a fair comment! Are you referring to this PR or the current CompositeByteBuf impl in general? If the former I wonder if you could elaborate on which aspects in particular are of concern. I'm assuming it's the new Component subclass, which we can get rid of but I'm actually not sure how much simpler the resulting code would be, and we would need to choose between:

Unconditional upfront slices of components when adding them, despite not being needed in most cases
Slice every time if/when components are accessed directly - this would disadvantage cases where components of a particular CBB are accessed repeatedly though maybe that's rare and thus the best option?
Retain lazy slice-caching behaviour similar to the existing (unsafe) logic, but introduce volatile read every time components are accessed directly (including internalComponent)

njhill · 2019-08-30T02:53:48Z

@normanmaurer @jingene I've opened #9525 in place this, PTAL!

@jingene

Motivation This is a "simpler" alternative to netty#9416 which fixes the same CompositeByteBuf bugs described there, originally reported by @jingene in netty#9398. Modifications - Add fields to Component class for the original buffer along with its adjustment, which may be different to the already-stored unwrapped buffer. Use it in appropriate places to ensure correctness and equivalent behaviour to that prior to the earlier optimizations - Add comments explaining purpose of each of the Component fields - Unwrap more kinds of buffers in newComponent method to extend scope of the existing indirection-reduction optimization - De-duplicate common buffer consolidation logic - Unit test for the original bug provided by @jingene Result - Correct behaviour / fixed bugs - Some code deduplication / simplification - Unwrapping optimization applied to more types of buffers The downside is increased mem footprint from the two new fields, and additional allocations in some specific cases, though those should be rare. Co-authored-by: jingene <jingene0206@gmail.com>

@jingene

Motivation This is a "simpler" alternative to #9416 which fixes the same CompositeByteBuf bugs described there, originally reported by @jingene in #9398. Modifications - Add fields to Component class for the original buffer along with its adjustment, which may be different to the already-stored unwrapped buffer. Use it in appropriate places to ensure correctness and equivalent behaviour to that prior to the earlier optimizations - Add comments explaining purpose of each of the Component fields - Unwrap more kinds of buffers in newComponent method to extend scope of the existing indirection-reduction optimization - De-duplicate common buffer consolidation logic - Unit test for the original bug provided by @jingene Result - Correct behaviour / fixed bugs - Some code deduplication / simplification - Unwrapping optimization applied to more types of buffers The downside is increased mem footprint from the two new fields, and additional allocations in some specific cases, though those should be rare. Co-authored-by: jingene <jingene0206@gmail.com>

@jingene

Motivation This is a "simpler" alternative to #9416 which fixes the same CompositeByteBuf bugs described there, originally reported by @jingene in #9398. Modifications - Add fields to Component class for the original buffer along with its adjustment, which may be different to the already-stored unwrapped buffer. Use it in appropriate places to ensure correctness and equivalent behaviour to that prior to the earlier optimizations - Add comments explaining purpose of each of the Component fields - Unwrap more kinds of buffers in newComponent method to extend scope of the existing indirection-reduction optimization - De-duplicate common buffer consolidation logic - Unit test for the original bug provided by @jingene Result - Correct behaviour / fixed bugs - Some code deduplication / simplification - Unwrapping optimization applied to more types of buffers The downside is increased mem footprint from the two new fields, and additional allocations in some specific cases, though those should be rare. Co-authored-by: jingene <jingene0206@gmail.com>

njhill added the defect label Jul 31, 2019

njhill mentioned this pull request Jul 31, 2019

CompositeByteBuf#newComponent() failed to resolve source index #9398

Closed

njhill mentioned this pull request Aug 21, 2019

Simplify EventLoop abstractions for timed scheduled tasks #9470

Merged

njhill and others added 2 commits August 27, 2019 16:54

Attempt to simplify

434b6e6

njhill force-pushed the cbb-component-fix branch from 77ecc66 to 434b6e6 Compare August 28, 2019 01:24

njhill mentioned this pull request Aug 30, 2019

Fix for incorrect values from CompositeByteBuf#component(int) #9525

Merged

njhill closed this Aug 30, 2019

njhill deleted the cbb-component-fix branch September 4, 2019 04:08

Uh oh!

Conversation

njhill commented Jul 31, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netty-bot commented Jul 31, 2019

Uh oh!

njhill commented Aug 3, 2019

Uh oh!

jingene commented Aug 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jingene commented Aug 6, 2019

Uh oh!

njhill commented Aug 6, 2019

Uh oh!

normanmaurer commented Aug 7, 2019

Uh oh!

normanmaurer commented Aug 14, 2019

Uh oh!

njhill commented Aug 14, 2019

Uh oh!

normanmaurer commented Aug 19, 2019

Uh oh!

njhill commented Aug 19, 2019

Uh oh!

jingene commented Aug 27, 2019

Uh oh!

njhill commented Aug 27, 2019

Uh oh!

njhill commented Aug 28, 2019

Uh oh!

normanmaurer commented Aug 28, 2019

Uh oh!

normanmaurer commented Aug 28, 2019

Uh oh!

njhill commented Aug 28, 2019

Uh oh!

njhill commented Aug 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

njhill commented Jul 31, 2019 •

edited

Loading

jingene commented Aug 5, 2019 •

edited

Loading