add streaming types to ASF scaffold and APIs by thrau · Pull Request #6552 · localstack/localstack

thrau · 2022-07-29T14:57:26Z

This PR updates ASF to correctly handle smithy streaming types. This is one of the last big changes to ASF, and will unblock the S3 migration.

The changes include:

updated scaffold to generate streaming types (see details)
re-generated community APIs
updated serializer and parser to deal with streaming types
added several tests on different layers
update lambda and API gateway providers /cc @dfangl @dominikschubert
changed the MetricHandler a bit (see details below) /cc @steffyP

some details on the API

After several discussions with @alexrashed, we decided not to make the service-level shapes the streaming types, but to only type hint the members of request/response objects.

For Requests, which are set by the server, we specify typing.IO[bytes], which will essentially be set to werkzeug.Request.stream by the parser. This will require the most changes, as we need to update all implementing services that expect payload to be bytes, to use payload.read() which will be a stream.
For Responses, which are set by the developer, we specify Union[bytes, IO[bytes], Iterable[bytes], making it possible to deal with the three cases outlined in implement AWS streaming trait for ASF #6527. These three types are also supported by the werkzeug.Response object, so nothing needs to be done in the serializer, and no changes are necessary im services.

metric handler change

with the new streaming trait, ServiceRequest dictionaries can contain file-like or other IO objects that cannot be serialized. in record_parsed_request we were copying the entire request using deep copy, and that was now raising errors. on certain requests.

it seems we weren't really using the entire request, but just collecting the names of the parameters that were in the request, so i simplified the logic to just collect the parameter names rather than the entire request, which is now a harmless call to list(request.keys()).

if we at some point need the entire request in the metric collection, we'll need to revisit this change.

limitations

http request IO is still a problem. when you consume an incoming request payload, e.g., in s3 PutObject, using body.read(), you are consuming the stream underlying the http request, which is shared with werkzeug's Request object. that means, if you at a later point want to call request.data on the request object, you'll get an empty byte buffer back. although this is expected and correct behavior, it's also very inconvenient and i can see a lot of hours going into debugging of weird "why is my request empty" issues.

since quite a bit of the code base still builds on the assumption of always having access to the raw http request data, we may need to make some compromises with respect to performance and memory usage. ideally, we never read from the stream until we need it, because there's always a chance that we are going to proxy large payloads to some backend (like invoking a lambda), in which case you don't want to load everything into memory, keep it there, and then flush it to the outgoing socket again. but if you want reliable access to request.data, even though we've previously run body.read(), then we're going to have to store the stream data somewhere, and make it seekable, e.g., through TemporarySpooledFiles, which would solve the problem of loading large payloads into memory (if they exceed a certain size the file is rolled to disk, otherwise it is kept in memory), but obviously still has the overhead when proxying. 🤷

fixes #6527

coveralls · 2022-07-29T15:36:35Z

Coverage decreased (-0.01%) to 91.573% when pulling e2bc2fd on asf-streaming-api into 736e7ad on master.

github-actions · 2022-07-29T16:09:09Z

LocalStack integration with Pro

      3 files       3 suites 1h 4m 40s ⏱️
1 163 tests 1 121 ✔️ 42 💤 0 ❌
1 524 runs 1 451 ✔️ 73 💤 0 ❌

Results for commit 0756cc0.

♻️ This comment has been updated with latest results.

alexrashed

Really nice set of changes! This is basically fixing the last real big limitation we had in ASF! 🥳
I only had really small nitpicks and only on some tests.
It's cool that the parser and serializer basically didn't need to be changed.
The newly generated API seems great, and I double-checked that all method signatures that are implemented have also been adjusted.

tests/unit/aws/protocol/test_serializer.py

tests/integration/test_moto.py

thrau · 2022-08-03T14:27:47Z

tested merge with #6583 and seems to work fine

steffyP

metric changes look good! 😄 thank you!

thrau requested a review from alexrashed as a code owner July 29, 2022 14:57

thrau temporarily deployed to localstack-ext-tests July 29, 2022 14:57 Inactive

thrau force-pushed the asf-streaming-api branch from bfe6bad to 8cd4e0e Compare July 29, 2022 14:59

thrau temporarily deployed to localstack-ext-tests July 29, 2022 15:00 Inactive

thrau force-pushed the asf-streaming-api branch from 8cd4e0e to 5bb8dfa Compare July 29, 2022 16:38

thrau temporarily deployed to localstack-ext-tests July 29, 2022 16:38 Inactive

thrau requested review from dfangl and dominikschubert as code owners July 29, 2022 19:47

thrau temporarily deployed to localstack-ext-tests July 29, 2022 19:47 Inactive

thrau temporarily deployed to localstack-ext-tests July 29, 2022 20:03 Inactive

thrau marked this pull request as draft July 29, 2022 20:09

thrau temporarily deployed to localstack-ext-tests July 29, 2022 20:24 Inactive

thrau requested a review from steffyP July 29, 2022 20:40

thrau force-pushed the asf-streaming-api branch from 7b61188 to 743b056 Compare July 29, 2022 20:53

thrau temporarily deployed to localstack-ext-tests July 29, 2022 20:54 Inactive

thrau temporarily deployed to localstack-ext-tests July 29, 2022 23:44 Inactive

thrau force-pushed the asf-streaming-api branch from e6f111f to b826826 Compare July 29, 2022 23:59

thrau temporarily deployed to localstack-ext-tests July 30, 2022 00:00 Inactive

thrau force-pushed the asf-streaming-api branch from b826826 to 61e3336 Compare July 30, 2022 13:11

thrau temporarily deployed to localstack-ext-tests July 30, 2022 13:11 Inactive

thrau marked this pull request as ready for review July 30, 2022 19:29

thrau temporarily deployed to localstack-ext-tests July 30, 2022 20:12 Inactive

alexrashed approved these changes Aug 1, 2022

View reviewed changes

alexrashed mentioned this pull request Aug 1, 2022

Update ASF APIs #6563

Closed

thrau force-pushed the asf-streaming-api branch from e2bc2fd to ddd3fbf Compare August 3, 2022 14:26

thrau temporarily deployed to localstack-ext-tests August 3, 2022 14:26 Inactive

update scaffold to generate streaming types

8fe5107

thrau added 7 commits August 3, 2022 16:55

update scaffold to generate streaming types

8fe5107

regenerate api specs with streaming types

50e2efe

implement streaming trait in ASF

28d4ff5

add tracing for itest-lambda-provider

e284eb6

update MetricHandler to not deep-copy service request objects

a7b825a

encapsulate create_input_stream method in parser

daeb797

parameterize tests

0756cc0

thrau force-pushed the asf-streaming-api branch from ddd3fbf to 0756cc0 Compare August 3, 2022 14:55

thrau temporarily deployed to localstack-ext-tests August 3, 2022 14:56 Inactive

steffyP approved these changes Aug 3, 2022

View reviewed changes

thrau merged commit 33ad9a8 into master Aug 3, 2022

thrau deleted the asf-streaming-api branch August 3, 2022 17:49

localstack locked and limited conversation to collaborators Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add streaming types to ASF scaffold and APIs#6552

add streaming types to ASF scaffold and APIs#6552
thrau merged 7 commits intomasterfrom
asf-streaming-api

thrau commented Jul 29, 2022 •

edited

Loading

Uh oh!

coveralls commented Jul 29, 2022 •

edited

Loading

Uh oh!

github-actions bot commented Jul 29, 2022 •

edited

Loading

Uh oh!

alexrashed left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thrau commented Aug 3, 2022

Uh oh!

steffyP left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

thrau commented Jul 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

some details on the API

metric handler change

limitations

Uh oh!

coveralls commented Jul 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LocalStack integration with Pro

Uh oh!

alexrashed left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thrau commented Aug 3, 2022

Uh oh!

steffyP left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

thrau commented Jul 29, 2022 •

edited

Loading

coveralls commented Jul 29, 2022 •

edited

Loading

github-actions bot commented Jul 29, 2022 •

edited

Loading