runtime: make random number generation produce 64 random bits by akonradi · Pull Request #1799 · envoyproxy/envoy

akonradi · 2017-10-03T18:47:50Z

The random number generator uses the std::ranlux48 generator, which produces 48-bit random numbers. This breaks the hashing HTTP load balancer, which assumes that all 64 bits are random; without this patch, it routes all requests to one upstream host. Signed-off-by: Alex Konradi <akonradi@google.com>

mattklein123 · 2017-10-03T18:50:54Z

source/common/runtime/runtime_impl.h

-  uint64_t random() override { return threadLocalGenerator()(); }
+  uint64_t random() override {
+    auto& gen = threadLocalGenerator();
+    return (gen() << 48) | gen();


This is not free and I'm not hugely in favor of calling this twice every time we want a random number. Can you explain the underlying issue more?

The hashing load balancer distributes its N hosts over 1024 buckets, then for each request, selects the nearest bucket to the hash value for the request. If no hash policy matches, the "hash" is randomly generated. A randomly-generated hash, though, has 0's as the top 16 bits. Since the "nearest" bucket is the one that has the smallest value greater than the request's has, the result is that the lowest-valued bucket is always selected, so only one host is ever routed to.

I agree that this is not free, and is probably not the best way to fix this particular degenerate case. It is, however, consistent with the declared API. I don't know whether there are other places where we rely on having 64 random bits, though.

If perf is a concern what do you think of splitting/renaming? If we had a random48() and random64() it'd solve the perf problem and increase clarity

Renaming is an option. Not sure if it's worth doing a bulk rename through the entire code base, but could add a new random64() function or something.

Another option is to switch the underlying pseudo random generator to one that spits out 64 bits. I'm not super familiar with the algorithms but we could look into this.

Another option is to special case the ring hash LB and just swap the returned bytes (just flip each 32-bit quad). That requires knowledge of the underlying algorithm but is then localized to the actual problem site.

Was there a reason to select std::ranlux48 in the first place? Or was this arbitrary?

I think I did some research and this was recommended as a good balance of randomness and speed.

Regardless of what is done elsewhere, we should either rename/add or change the documentation for random() because it's misleading.

Yup agreed. Can you maybe look and see if there is a different algorithm in std:: that we can use to get 64-bits. Also, there might be something in boringssl. cc @PiotrSikora

I've added generic RandomGeneratorImpl::random_bytes() as part work on UUID, but it was later simplified and inlined into RandomGeneratorImpl::uuid().

We might want to revisit that code (if it's not slower than ranlux48) instead of rolling our own pseudo-randomness.

There are no checks in Envoy that test for statistical randomness but this doesn't break any of the functionality tests. Signed-off-by: Alex Konradi <akonradi@google.com>

mattklein123

This change seems reasonable to me but all of these parameters basically mean nothing to me without doing a bunch of research. Is there anyone out there that is qualified to review quickly?

mattklein123 · 2017-10-04T21:29:24Z

Not hearing anyone excited about reviewing this. Will take a look in a bit. :)

mattklein123 · 2017-10-04T21:54:04Z

@rshriram is going to inquire with some math people at IBM research.

alyssawilk · 2017-10-05T13:11:07Z

We've been asking around on the Google crypto side as well :-)

…

On Wed, Oct 4, 2017 at 5:54 PM, Matt Klein ***@***.***> wrote: @rshriram <https://github.com/rshriram> is going to inquire with some math people at IBM research. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1799 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ARYFvT-TYOP-VVjPsHJldTPImY-9xuZgks5so_6bgaJpZM4Psl_T> .

mattklein123 · 2017-10-05T15:29:23Z

@akonradi I think you meant to close this one. :)

**Description** This PR adds more struct definitions and their corresponding JSONMarshal and JSONUnmarshal methods according to the Anthropic Messages API. These are metrics that are provided in the latest Anthropic messages API. Currently, these metrics/metadata are not being tracked for tracing purposes. Eventually, these metrics should be used in openinference tracing as seen in https://github.com/envoyproxy/ai-gateway/blob/fefb039dc230c2da481254d88b27ed034bcf45a4/internal/tracing/openinference/anthropic/messages.go#L154 It does not introduce any tracing as that should be opened in a new issue. This means that another issue should be opened to include tracking these metrics. **Related Issues/PRs (if applicable)** Fixes #1799 **Special notes for reviewers (if applicable)** Claude code was used to write most of the code here, but I manually checked all of code that it generated. One thing that I noticed is that some of the previous API schema for Anthropic implemented in this repository were moved to beta status by Anthropic. This is something that the maintainers should discuss. 1. https://platform.claude.com/docs/en/api/messages#body-messages-content 2. https://platform.claude.com/docs/en/api/messages/create 3. https://platform.claude.com/docs/en/api/beta/messages#body-messages-content 4. https://platform.claude.com/docs/en/api/beta/messages/create --------- Signed-off-by: Chang Min <changminbark@gmail.com> Co-authored-by: Ignasi Barrera <ignasi@tetrate.io> Co-authored-by: Gavrish Prabhu <gavrish.prabhu@nutanix.com>

mattklein123 reviewed Oct 3, 2017

View reviewed changes

Use 64-bit ranlux generator

725ac61

There are no checks in Envoy that test for statistical randomness but this doesn't break any of the functionality tests. Signed-off-by: Alex Konradi <akonradi@google.com>

mattklein123 reviewed Oct 4, 2017

View reviewed changes

PiotrSikora mentioned this pull request Oct 5, 2017

perf: optimize random number generation and populate all 64 bits. #1812

Merged

mattklein123 closed this Oct 5, 2017

akonradi deleted the rand-bugfix branch October 9, 2017 17:06

Conversation

akonradi commented Oct 3, 2017

Uh oh!

mattklein123 Oct 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akonradi Oct 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alyssawilk Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

mattklein123 Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

htuch Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

mattklein123 Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

akonradi Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

mattklein123 Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

PiotrSikora Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

mattklein123 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattklein123 commented Oct 4, 2017

Uh oh!

mattklein123 commented Oct 4, 2017

Uh oh!

alyssawilk commented Oct 5, 2017 via email

Uh oh!

mattklein123 commented Oct 5, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mattklein123 Oct 3, 2017 •

edited

Loading

akonradi Oct 3, 2017 •

edited

Loading

mattklein123 left a comment •

edited

Loading