Fix SocketAsyncEventArgs' handling of ExecutionContext by stephentoub · Pull Request #30712 · dotnet/corefx

stephentoub · 2018-06-28T01:14:41Z

SocketAsyncEventArgs has a few issues with ExecutionContext, presumably stemming from the fact that capturing ExecutionContext in .NET Framework is not a cheap operation. As a result, when this code was written, it was optimized for avoiding calls to ExecutionContext.Capture. The SAEA tries to hold onto a captured ExecutionContext for as long as possible, only re-capturing when either the SAEA is used with a different socket instance or when an event handler is changed. That has several problems, though. First, it largely violates the purpose of ExecutionContext, which is to flow information from the point where the async operation begins to the continuation/callback, but if the context is only being captured when the Socket or handler is changed, then the context isn't actually tied to the location where the async operation begins, and that means that data like that in an AsyncLocal doesn't properly flow across the async point. Second, it means that the SocketAsyncEventArgs (the whole purpose of which is to cache it) can end up keeping state in an ExecutionContext alive well beyond when it should be kept alive, because the SocketAsyncEventArgs is holding onto the ExecutionContext instance until either the Socket or handler is changed.

This commit fixes this behavior. Since ExecutionContext.Capture in .NET Core is relatively cheap (no allocation, primarily just a ThreadStatic access), we now just always capture the context when starting an operation, and then clear it out when completing the operation.

Fixes #30670
Depends on dotnet/coreclr#18670 (the new SocketsHttpHandler test won't pass until that's merged and consumed into corefx)

cc: @geoffkizer, @davidsh, @kouvel

SocketAsyncEventArgs has a few issues with ExecutionContext, presumably stemming from the fact that capturing ExecutionContext in .NET Framework is not a cheap operation. As a result, when this code was written, it was optimized for avoiding calls to ExecutionContext.Capture. The SAEA tries to hold onto a captured ExecutionContext for as long as possible, only re-capturing when either the SAEA is used with a different socket instance or when an event handler is changed. That has several problems, though. First, it largely violates the purpose of ExecutionContext, which is to flow information from the point where the async operation begins to the continuation/callback, but if the context is only being captured when the Socket or handler is changed, then the context isn't actually tied to the location where the async operation begins, and that means that data like that in an AsyncLocal doesn't properly flow across the async point. Second, it means that the SocketAsyncEventArgs (the whole purpose of which is to cache it) can end up keeping state in an ExecutionContext alive well beyond when it should be kept alive, because the SocketAsyncEventArgs is holding onto the ExecutionContext instance until either the Socket or handler is changed. This commit fixes this behavior. Since ExecutionContext.Capture in .NET Core is relatively cheap (no allocation, primarily just a ThreadStatic access), we now just always capture the context when starting an operation, and then clear it out when completing the operation.

smbecker · 2018-06-29T19:12:54Z

Any chance that this will be back-ported to release/2.1? Or release/2.2 at the very least?

stephentoub · 2018-06-29T20:13:00Z

Any chance that this will be back-ported to release/2.1? Or release/2.2 at the very least?

I don't think this would make the bar for 2.1, though @danmosemsft can correct me if I'm wrong.

@karelz, can you comment on 2.2?

karelz · 2018-06-30T19:17:39Z

What is the impact of the bug? Which scenarios are impacted? How bad is it? How common are the scenarios?
The bug bar for 2.2 is not clear yet. @danmosemsft likely knows more at this point.

stephentoub · 2018-06-30T20:57:19Z

What is the impact of the bug?

If there are objects in AsyncLocals when you make an http request that requires opening a new http connection, those objects may be kept alive until the next time that same pooled state in the connection manager is reused to open another connection.

Fully addressing it also requires dotnet/coreclr#18670.

smbecker · 2018-07-01T01:10:48Z

In our case, it caused a pretty severe memory leak that crashed our servers since we are opening connections to a variety of sources all the time. However, for now we are working around it using https://github.com/dotnet/corefx/issues/30670#issuecomment-400776344.

smbecker · 2018-07-01T01:12:50Z

Another work around is to disable the SocketsHttpHandler using environment variables but that is not very desirable.

geoffkizer · 2018-07-01T01:32:34Z

src/System.Net.Sockets/src/System/Net/Sockets/SocketAsyncEventArgs.cs

            FinishOperationSyncFailure(socketError, bytesTransferred, flags);

-            if (_context == null)
+            if (context == null)


Since this pattern is repeated a couple times, couldn't we encapsulate it into something like "InvokeCompletion"?

And couldn't we move the clearing of the execution context there?

And couldn't we move the clearing of the execution context there?

We need to clear it even on synchronous completion, in which case we're not invoking a callback.

stephentoub · 2018-07-01T02:29:40Z

it caused a pretty severe memory leak that crashed our servers since we are opening connections to a variety of sources all the time

This is surprising to me. We only cache Environment.ProcessorCount event arg instances, so at worst I'd expect that many ExecutionContexts to be kept alive beyond when they could otherwise be collected. Can you elaborate on how this resulted in such a severe leak?

benaadams · 2018-08-27T11:31:51Z

Requested backport to 2.2 https://github.com/dotnet/corefx/issues/31969#issuecomment-416197153

…#30712) SocketAsyncEventArgs has a few issues with ExecutionContext, presumably stemming from the fact that capturing ExecutionContext in .NET Framework is not a cheap operation. As a result, when this code was written, it was optimized for avoiding calls to ExecutionContext.Capture. The SAEA tries to hold onto a captured ExecutionContext for as long as possible, only re-capturing when either the SAEA is used with a different socket instance or when an event handler is changed. That has several problems, though. First, it largely violates the purpose of ExecutionContext, which is to flow information from the point where the async operation begins to the continuation/callback, but if the context is only being captured when the Socket or handler is changed, then the context isn't actually tied to the location where the async operation begins, and that means that data like that in an AsyncLocal doesn't properly flow across the async point. Second, it means that the SocketAsyncEventArgs (the whole purpose of which is to cache it) can end up keeping state in an ExecutionContext alive well beyond when it should be kept alive, because the SocketAsyncEventArgs is holding onto the ExecutionContext instance until either the Socket or handler is changed. This commit fixes this behavior. Since ExecutionContext.Capture in .NET Core is relatively cheap (no allocation, primarily just a ThreadStatic access), we now just always capture the context when starting an operation, and then clear it out when completing the operation. Commit migrated from dotnet/corefx@851a53b

stephentoub added the area-System.Net.Sockets label Jun 28, 2018

stephentoub added this to the 3.0 milestone Jun 28, 2018

stephentoub self-assigned this Jun 28, 2018

geoffkizer reviewed Jul 1, 2018

View reviewed changes

geoffkizer approved these changes Jul 2, 2018

View reviewed changes

stephentoub merged commit 851a53b into dotnet:master Jul 3, 2018

stephentoub deleted the socketsecflow branch July 3, 2018 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SocketAsyncEventArgs' handling of ExecutionContext#30712

Fix SocketAsyncEventArgs' handling of ExecutionContext#30712
stephentoub merged 1 commit intodotnet:masterfrom
stephentoub:socketsecflow

stephentoub commented Jun 28, 2018

Uh oh!

smbecker commented Jun 29, 2018

Uh oh!

stephentoub commented Jun 29, 2018

Uh oh!

karelz commented Jun 30, 2018

Uh oh!

stephentoub commented Jun 30, 2018 •

edited

Loading

Uh oh!

smbecker commented Jul 1, 2018

Uh oh!

smbecker commented Jul 1, 2018

Uh oh!

geoffkizer Jul 1, 2018

Uh oh!

stephentoub Jul 1, 2018

Uh oh!

geoffkizer Jul 2, 2018

Uh oh!

stephentoub commented Jul 1, 2018

Uh oh!

benaadams commented Aug 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

stephentoub commented Jun 28, 2018

Uh oh!

smbecker commented Jun 29, 2018

Uh oh!

stephentoub commented Jun 29, 2018

Uh oh!

karelz commented Jun 30, 2018

Uh oh!

stephentoub commented Jun 30, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smbecker commented Jul 1, 2018

Uh oh!

smbecker commented Jul 1, 2018

Uh oh!

geoffkizer Jul 1, 2018

Choose a reason for hiding this comment

Uh oh!

stephentoub Jul 1, 2018

Choose a reason for hiding this comment

Uh oh!

geoffkizer Jul 2, 2018

Choose a reason for hiding this comment

Uh oh!

stephentoub commented Jul 1, 2018

Uh oh!

benaadams commented Aug 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

stephentoub commented Jun 30, 2018 •

edited

Loading