ratelimit: new per descriptor hits-addend support and dynamic hits addend by wbpcode · Pull Request #37567 · envoyproxy/envoy

wbpcode · 2024-12-09T04:20:38Z

Commit Message: api: new per descriptor hits-addend support and dynamic hits addend
Additional Description:

Now, we could get custom hits_addend from the envoy.ratelimit.hits_addend. But if there are multiple rate limit filters that requrie custom hits_addend, the envoy.ratelimit.hits_addend couldn't meet the requirement.
And we cann't also to support different hits_addend for diffferent descriptots in same request.

This API changes try to meet above two requirements.

Risk Level: low.
Testing: n/a.
Docs Changes: n/a.
Release Notes: n/a.
Platform Specific Features: n/a.

repokitteh-read-only · 2024-12-09T04:20:48Z

CC @envoyproxy/api-shepherds: Your approval is needed for changes made to (api/envoy/|docs/root/api-docs/).
envoyproxy/api-shepherds assignee is @abeyad
CC @envoyproxy/api-watchers: FYI only for changes made to (api/envoy/|docs/root/api-docs/).

🐱

Caused by: #37567 was opened by wbpcode.

see: more, trace.

abeyad

@wbpcode should we add the implementation using this API to the PR so we can see how it'll be used?

abeyad · 2024-12-09T20:42:15Z

api/envoy/config/route/v3/route_components.proto

+    // For example, the ``%BYTES_RECEIVED%`` format string will be replaced with the number of bytes
+    // received in the request.
+    //
+    // Only one of the ``number`` or ``format`` fields can be set.


what will happen if both are set?

This is oneof semantics. (although oneof is not recommend according to our style).

If both are set, the configuration will be rejected.

abeyad · 2024-12-09T20:52:17Z

api/envoy/extensions/common/ratelimit/v3/ratelimit.proto

+  // Optional hits_addend for the rate limit descriptor. If set the value will override the
+  // request level hits_addend.
+  // [#not-implemented-hide:]
+  google.protobuf.UInt32Value hits_addend = 3;


In the request level, it can be an integer or a format string. How come here it is only an integer?

The format string is used to extract a number based on the request and stream info. See the comments of format string comment

// Substitution format string to extract the number of hits to add to the rate limit descriptor. // The same :ref:`format specifier <config_access_log_format>` as used for // :ref:`HTTP access logging <config_access_log>` applies here. // // .. note:: // The format string must contain only single valid substitution field that will be replaced // with a non-negative number. // // For example, the ``%BYTES_RECEIVED%`` format string will be replaced with the number of bytes // received in the request.

wbpcode · 2024-12-10T02:35:19Z

@wbpcode should we add the implementation using this API to the PR so we can see how it'll be used?

SGTM.

wbpcode · 2024-12-10T14:38:07Z

cc @abeyad I complete the local ratelimit version of the per descriptor custom hits addend support.

When the request is coming, we will generated a list of descriptors based on the configuration. If the hits_addend is configured, the filter will also generate a dynamic hits_addend value based on the request info and configuration.

When a descriptor matchs a rule, in the previous implementation, the fixed value 1 will be used as hits addend. Now, in the new implementation, the custom hits_addend that generated in the previous step will be used.

wbpcode · 2024-12-10T14:55:15Z

envoy/ratelimit/ratelimit.h

+/**
+ * A single rate limit request descriptor. See ratelimit.proto.
+ * This is generated from the request based on the configured rate limit actions.
+ */
+struct Descriptor : public DescriptorBase {
+  absl::optional<RateLimitOverride> limit_ = absl::nullopt;
+  absl::optional<uint32_t> hits_addend_ = absl::nullopt;
+};
+
+using LocalDescriptor = DescriptorBase;


In the previous implementation, the class LocalDescriptor has two usages:

as the key of rate limit rules that every descriptor is related to a token bucket.

as the output of the descriptor populating of local rate limt filter. It will be used to find a matched rate limit rule.

But now, to support per descriptor hits_addend, we need a new class to represent the output of the descriptor populating. Finally, we choose to enhance and re-use the Descriptor class. (The Descriptor class is used as the output of global rate limit filter. Re-using it could also simplify future development when we want to add similar support for global rate limit filter in the future.)

wbpcode · 2024-12-10T18:15:22Z

/retest

abeyad

/lgtm api

arkodg · 2024-12-10T23:31:02Z

api/envoy/config/route/v3/route_components.proto

+  //   :ref:`Route.typed_per_filter_config<envoy_v3_api_field_config.route.v3.Route.typed_per_filter_config>`, etc.
  Override limit = 4;
+
+  // An optional hits addend to be appended to the descriptor produced by this rate limit


instead of appending to the descriptor, it should be used to populate the hits_addend field in the RLS request
https://www.envoyproxy.io/docs/envoy/latest/api-v3/service/ratelimit/v3/rls.proto#service-ratelimit-v3-ratelimitrequest

Single RateLimit message is used to generate a single descriptor. The RateLimit.hits_addend field here is also a descriptor level configuration. So, the generated value of RateLimit.hits_addend field here should also be populated to the per-descriptor Descriptor.hits_addend rather than request level RateLimitRequest.hits_addend.

shouldnt this be one level up, under https://www.envoyproxy.io/docs/envoy/latest/api-v3/config/route/v3/route_components.proto#config-route-v3-ratelimit ? so it can set the hits_addend field of https://www.envoyproxy.io/docs/envoy/latest/api-v3/service/ratelimit/v3/rls.proto#service-ratelimit-v3-ratelimitrequest ?
The ratelimit request holds a list of descriptors

The https://www.envoyproxy.io/docs/envoy/latest/api-v3/config/route/v3/route_components.proto#config-route-v3-ratelimit is what I said single RateLimit.

Every route/vhost will have a list of RateLimit configuration. Every Ratelimit will generate a single descriptor (a single descriptor is composited of multiple descriptor entries). All these descriptors finally composite the rate limit request.

But note, every descriptor is evaluated independently and will match to completely different rules. One of the most important target of this PR is to support per-descriptor hits addend, then we can limit different resource like qps or bandwidth in same request. You can check this PR's description for the PR's target. I have no plan to enhance the request level hits addend because it could be replaced by descriptor-level hits addend.

api/envoy/config/route/v3/route_components.proto

wbpcode · 2024-12-11T03:05:51Z

Hi, @mattklein123 , could you take a look when you get some free time? Thansk. This PR only contain local-ratelimit related change. And I will create a new PR to support global rate limit after this is done.

Signed-off-by: wangbaiping(wbpcode) <wangbaiping@bytedance.com>

wbpcode · 2024-12-11T16:37:40Z

Finally, reduced 50% code changes. I have tried my best to reduce the complexity of review. orz.

…er-descriptor-hits-adden-support

mattklein123

/wait

mattklein123 · 2024-12-12T03:58:19Z

api/envoy/config/route/v3/route_components.proto

+    // Substitution format string to extract the number of hits to add to the rate limit descriptor.
+    // The same :ref:`format specifier <config_access_log_format>` as used for
+    // :ref:`HTTP access logging <config_access_log>` applies here.
+    //
+    // .. note::
+    //   The format string must contain only single valid substitution field that will be replaced
+    //   with a non-negative number.
+    //
+    // For example, the ``%BYTES_RECEIVED%`` format string will be replaced with the number of bytes
+    // received in the request.
+    //
+    // One of the ``number`` or ``format`` fields should be set but not both.
+    string format = 2 [(validate.rules).string = {prefix: "%" suffix: "%" ignore_empty: true}];


What is the behavior if this is invalid and/or is a format string that doesn't lead to an integer? Please clarify.

mattklein123 · 2024-12-12T04:03:26Z

source/extensions/filters/common/local_ratelimit/local_ratelimit_impl.cc

  do {
    // expected_tokens is either initialized above or reloaded during the CAS failure below.
-    if (expected_tokens == 0) {
+    if (expected_tokens < to_consume) {


I didn't look at this carefully but did you audit everything to make sure there can be no underflow or overflow conditions?

I didn't look at this carefully but did you audit everything to make sure there can be no underflow or overflow conditions?

Good point. I am sure underflow or overflow won't happen because we only consum the token when the expected_tokens larges then to_consume. And they both are uint32_t.

~~But I think this notices me to add a more strict check to the value range of the to_consume.~~

Use a uint64_t directly would much simple. And the max to_consume is limited to 1000000000. If even the uint64_t is enough in this case, them just let it explode.

mattklein123 · 2024-12-12T04:05:03Z

source/extensions/filters/common/ratelimit_config/ratelimit_config.cc

+    if (hits_addend.has_number_value()) {
+      descriptor.hits_addend_ = static_cast<uint32_t>(hits_addend.number_value());
+    } else {
+      ENVOY_LOG(warn, "hits_addend must be a number");


This can heavily log spam if there is a config mistake. This should be a warn every of some type. Optimally we would catch as much as possible at config load time which isn't happening.

This can heavily log spam if there is a config mistake. This should be a warn every of some type. Optimally we would catch as much as possible at config load time which isn't happening.

Currently we have no way to check the provider's expected return type. I can change the level to debug and add more clear descripription on the API.

Signed-off-by: wangbaiping(wbpcode) <wangbaiping@bytedance.com>

wbpcode · 2024-12-12T12:35:34Z

added more comments about the behavior of hits_addend.
added more tests for unexpected hits_addend.

…er-descriptor-hits-adden-support

mattklein123 · 2024-12-18T03:44:43Z

source/extensions/filters/common/ratelimit_config/ratelimit_config.cc

+    if (success) {
+      descriptor.hits_addend_ = static_cast<uint64_t>(hits_addend);
+    } else {
+      ENVOY_LOG(debug, "Invalid hits_addend: {}", hits_addend_value.DebugString());


I think this is going to be very confusing to debug and I would go back to WARN but use one of the power of 2 or timed variants for that, but up to you.

I think this is going to be very confusing to debug and I would go back to WARN but use one of the power of 2 or timed variants for that, but up to you.

Will change this in next PR. There is a series of works.

wbpcode · 2024-12-18T03:47:33Z

/retest

wbpcode · 2024-12-18T05:03:19Z

/retest

…er-descriptor-hits-adden-support

wbpcode · 2024-12-18T08:21:15Z

/retest

repokitteh-read-only bot added the api label Dec 9, 2024

repokitteh-read-only bot assigned abeyad Dec 9, 2024

abeyad reviewed Dec 9, 2024

View reviewed changes

wbpcode requested a review from mattklein123 as a code owner December 10, 2024 13:57

wbpcode changed the title ~~api: new per descriptor hits-addend support and dynamic hits addend~~ ratelimit: new per descriptor hits-addend support and dynamic hits addend Dec 10, 2024

wbpcode commented Dec 10, 2024

View reviewed changes

wbpcode force-pushed the dev-per-descriptor-hits-adden-support branch from 2e2b2c3 to dc93699 Compare December 10, 2024 16:36

abeyad previously approved these changes Dec 10, 2024

View reviewed changes

repokitteh-read-only bot removed the api label Dec 10, 2024

arkodg reviewed Dec 10, 2024

View reviewed changes

api/envoy/config/route/v3/route_components.proto Show resolved Hide resolved

mathetake mentioned this pull request Dec 11, 2024

http ratelimit: option to reduce budget on stream done #37548

Merged

wbpcode assigned mattklein123 and unassigned abeyad Dec 11, 2024

wbpcode dismissed abeyad’s stale review via d3f28fd December 11, 2024 07:36

wbpcode requested a review from zuercher as a code owner December 11, 2024 10:36

wbpcode requested a review from kyessenov as a code owner December 11, 2024 13:55

rate limit: add per-descriptor custom hits addend support

774066d

Signed-off-by: wangbaiping(wbpcode) <wangbaiping@bytedance.com>

wbpcode force-pushed the dev-per-descriptor-hits-adden-support branch from 18489a3 to 774066d Compare December 11, 2024 16:32

Merge branch 'main' of https://github.com/envoyproxy/envoy into dev-p…

b904206

…er-descriptor-hits-adden-support

mattklein123 requested changes Dec 12, 2024

View reviewed changes

repokitteh-read-only bot added the waiting label Dec 12, 2024

address comments

c22ebe4

Signed-off-by: wangbaiping(wbpcode) <wangbaiping@bytedance.com>

repokitteh-read-only bot removed the waiting label Dec 12, 2024

repokitteh-read-only bot added the api label Dec 12, 2024

mathetake mentioned this pull request Dec 12, 2024

Make default hitsAddend minimum value configurable envoyproxy/ratelimit#729

Closed

Merge branch 'main' of https://github.com/envoyproxy/envoy into dev-p…

692c9af

…er-descriptor-hits-adden-support

wbpcode mentioned this pull request Dec 16, 2024

global rate limit: supported ratelimits in the typed per filter config #37684

Merged

wbpcode requested a review from mattklein123 December 17, 2024 03:51

mattklein123 approved these changes Dec 18, 2024

View reviewed changes

repokitteh-read-only bot removed the api label Dec 18, 2024

wbpcode enabled auto-merge (squash) December 18, 2024 03:47

Merge branch 'main' of https://github.com/envoyproxy/envoy into dev-p…

2e9538b

…er-descriptor-hits-adden-support

wbpcode merged commit cac9b87 into envoyproxy:main Dec 18, 2024

wbpcode mentioned this pull request Dec 30, 2024

per descriptor hits_addend support #37347

Closed

wbpcode deleted the dev-per-descriptor-hits-adden-support branch December 30, 2024 10:16

zirain mentioned this pull request Jan 3, 2025

extproc: sets token usage into filter metadata envoyproxy/ai-gateway#62

Merged

Conversation

wbpcode commented Dec 9, 2024

Uh oh!

repokitteh-read-only bot commented Dec 9, 2024

Uh oh!

abeyad left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode commented Dec 10, 2024

Uh oh!

wbpcode commented Dec 10, 2024

Uh oh!

wbpcode Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode commented Dec 10, 2024

Uh oh!

abeyad left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode Dec 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wbpcode commented Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wbpcode commented Dec 11, 2024

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode Dec 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode commented Dec 12, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wbpcode commented Dec 18, 2024

Uh oh!

wbpcode commented Dec 18, 2024

Uh oh!

wbpcode commented Dec 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

wbpcode Dec 10, 2024 •

edited

Loading

wbpcode Dec 11, 2024 •

edited

Loading

wbpcode Dec 12, 2024 •

edited

Loading

wbpcode commented Dec 11, 2024 •

edited

Loading

wbpcode Dec 12, 2024 •

edited

Loading