High performance `getledgerentry` by SirTyson · Pull Request #4350 · stellar/stellar-core

SirTyson · 2024-06-06T19:51:24Z

Description

Resolves #4306

Note: the following interface is now outdated. Please refer to docs/software/commands.md for the up to date interface. The performance measurements are still accurate.

Previous interface:

getledgerentry core endpoint is now high performance, non blocking and served by a multi-threaded HTTP server that does not interact with the main thread. This enables down stream systems to query this endpoint at high rates without captive-core nodes losing sync. Note that this endpoint is served by a different port and separated from stellar-core's other endpoints. The following config options have been added supporting this feature:

RPC_HTTP_PORT = 11627 # default listening port
RPC_THREADS = 4 # default threads serving getledgerentry endpoint
RPC_SNAPSHOT_LEDGERS = 5 # default number of ledgers retained in history

The HTTP request string is as follows:

getledgerentry?key=Base64&ledgerSeq=NUM

key is required, and is the Base64 XDR of the LedgerKey being queried. ledgerSeq is optional. If not set, stellar-core will return the LedgerEntry based on the most recent ledger. If ledgerSeq is set, stellar-core will return an entry based on a historical ledger snapshot at the given ledger. The return payload is a JSON object in the following format:

"ledger": ledgerSeq, // Required
"state": ["not_found" | "live" | "dead"], // Required
"entry": Base64 // Optional

ledger is the ledgerSeq that the query is based on, and is always returned. state returns "live" if a live LedgerEntry was found, or "dead" if the LedgerEntry does not exist. Additionally, if ledgerSeq is set to a snapshot that stellar-core does not currently have, "not_found" is returned. Finally, if state==live, "entry" is returned with the Base64 XDR encoding of the full LedgerEntry.

To measure performance, I used a parallel go script (thanks @Shaptic) with stellar/go/clients/stellarcore to send requests at a very high rate over local host. 1 million LedgerEntries of type ContractCode, ContractData, and Trustline were randomly sampled from the BucketList (such that all entries exist) for these requests, and no caching was used. Test was ran on test-core-003a.dev.stellar002 with a captive-core instance in sync with pubnet with the following benchmarks:

Request Rate: 2731 / sec
total queries: 100910, failed: 0, success: 100910
success rate: 100.000000
average latency: 366.163µs
min latency: 256.861µs

Request Rate: 11836 / sec
total queries: 103390, failed: 0, success: 103390
success rate: 100.000000
average latency: 422.453µs
min latency: 204.412µs

Request Rate: 18245 / sec
total queries: 100740, failed: 0, success: 100740
success rate: 100.000000
average latency: 548.105µs
min latency: 214.905µs

Checklist

Reviewed the contributing document
Rebased on top of master (no merge commits)
Ran clang-format v8.0.0 (via make format or the Visual Studio extension)
Compiles
Ran all tests
If change impacts performance, include supporting evidence per the performance document

SirTyson · 2024-06-19T01:13:27Z

I've now added a batch load endpoint called getledgerentrybatch. This is a POST method and requires the following body:

ledgerSeq=NUM&key=Base64&key=Base64...

ledgerSeq is an optional value that follows the same semantics as the getledgerentry endpoint. It is followed by one or more key to be queried. The return value is a JSON payload as follows:

{
"entries": [
  {"entry": "Base64-LedgerKey", "state": "dead"}, // dead entry
  {"entry": "Base64-LedgerEntry", "state": "live"}, // live entry
],
"ledger": ledgerSeq
}

If a ledgerSeq is queried but is not available, the return payload is as follows:

{"ledger": ledgerSeq, "state": "not_found"}

2opremio · 2024-06-19T17:33:25Z

I know this is just a prototype, but it would be more ergonomic to send JSON in the POST body..

Also, from the example response it seems like "entry" could point to both an entry or a key, I would suggest always providing a key and optionally providing an entry field which can be omitted, as follows:

{
"entries": [
  {"key": "Base64-LedgerKey", "state": "dead"}, // dead entry
  {"key": "Base64-LedgerKey", "entry": "Base64-LedgerEntry", "state": "live"}, // live entry
],
"ledger": ledgerSeq
}

Regarding:

{"ledger": ledgerSeq, "state": "not_found"}

Could you simply use a 404 HTTP status code instead?

2opremio · 2024-06-19T17:51:43Z

Additionally, how can I distinguish TTL'ed entries? Are TTL'ed entries the ones with "dead" status? I will also need the entry body for TTL'ed entries.

To be clear, what I need is a way to implement SnapshotSourceWithArchive from the endpoint you provide.

SirTyson · 2024-06-19T17:57:37Z

Additionally, how can I distinguish TTL'ed entries? Are TTL'ed entries the ones with "dead" status?

I will need the entry for those as well.

If an entry has been evicted, it will be reported as DEAD. If the entry is expired but not evicted, it will be returned as LIVE. This is just a raw key-value lookup that doesn’t enforce TTLs. To determine if a key is dead or not, you’ll need to load both the entry key and the TTL key. Here, live means “the key exists on the BucketList” and dead means “key does not exist on the BucketList” and is unrelated to TTL. I believe this is the same interface as your get_including_archived.

2opremio · 2024-06-19T18:00:04Z

Ah, ok, so live means not evicted (but possibly expired), and I need to query TTL entries separately (the more reason to have a batch endpoint)

2opremio · 2024-06-19T18:01:28Z

If "dead" means not found, we can simply omit those entries and assume omitted entries where not found.

Then we can get rid of the state field altogether (since present entries are implicitly live)

janewang · 2024-08-07T16:34:55Z

What is the maximum number of ledger entry keys this endpoint could accept?

SirTyson · 2024-08-07T16:44:59Z

What is the maximum number of ledger entry keys this endpoint could accept?

I haven't tested the maximum entries for a single query. However I doubt it will be a limiting factor, given we achieved a request rate of 20k RPS with an average latency of 548.105 us for point loads, and bulk loads are more efficient.

janewang · 2024-08-07T17:07:35Z

@SirTyson We currently support 200 keys which is more than sufficient.
Btw the current interface is here and I hope this is a non-breaking change for downstream clients: https://developers.stellar.org/docs/data/rpc/api-reference/methods/getLedgerEntries

MonsieurNicolas · 2024-08-09T19:14:47Z

@janewang this endpoint will not be directly exposed to clients. This is the backend that rpc will call, so if more encodings or other protocols/semantics need to be supported in clients this would be done in rpc not core.

src/main/QueryServer.h

src/main/CommandHandler.h

src/main/QueryServer.h

src/main/ApplicationImpl.cpp

src/bucket/BucketListSnapshot.cpp

src/bucket/BucketSnapshotManager.cpp

src/main/CommandHandler.h

src/main/ApplicationImpl.cpp

src/main/QueryServer.h

src/bucket/BucketListSnapshot.cpp

src/bucket/BucketSnapshotManager.cpp

dmkozh · 2024-08-16T20:50:57Z

LGTM, could you please rebase and squash?

SirTyson · 2024-08-16T20:55:20Z

LGTM, could you please rebase and squash?

Done

Shaptic

Bit of a uninformed drive-by review but I wanted to try to understand the new behavior

docs/software/commands.md

docs/stellar-core_example.cfg

lib/httpthreaded/reply.cpp

lib/httpthreaded/request.hpp

src/bucket/test/BucketIndexTests.cpp

src/main/QueryServer.cpp

SirTyson · 2024-08-20T23:02:48Z

All your comments should be addressed @Shaptic

src/bucket/BucketListSnapshot.h

SirTyson force-pushed the http-threaded-2 branch from 4836bb2 to a81b48a Compare June 19, 2024 01:06

SirTyson force-pushed the http-threaded-2 branch from a81b48a to 1dfcd9f Compare August 1, 2024 20:49

SirTyson marked this pull request as ready for review August 9, 2024 18:30

SirTyson force-pushed the http-threaded-2 branch from 8e801ee to c1c7dbf Compare August 9, 2024 18:31

SirTyson requested review from dmkozh and marta-lokhova August 9, 2024 18:31

SirTyson force-pushed the http-threaded-2 branch from c1c7dbf to 486f145 Compare August 9, 2024 18:42

dmkozh reviewed Aug 9, 2024

View reviewed changes

This was referenced Aug 13, 2024

clients/stellarcore: Add support for Core's new HTTP endpoints stellar/go-stellar-sdk#5426

Closed

Replace getLedgerEntries() with a proxy to Core's /getledgerentry. stellar/stellar-rpc#269

Closed

dmkozh reviewed Aug 15, 2024

View reviewed changes

SirTyson force-pushed the http-threaded-2 branch from 155dfa9 to 6341749 Compare August 16, 2024 20:55

This was referenced Aug 19, 2024

Epic: Protocol 22 Changes stellar/go-stellar-sdk#5433

Closed

Epic: Protocol 23 Changes stellar/stellar-rpc#267

Closed

Shaptic reviewed Aug 19, 2024

View reviewed changes

SirTyson force-pushed the http-threaded-2 branch from 6341749 to 1cdd7aa Compare August 20, 2024 23:03

dmkozh reviewed Aug 21, 2024

View reviewed changes

src/bucket/BucketListSnapshot.h Outdated Show resolved Hide resolved

Added multithreaded http server library

62999e0

SirTyson force-pushed the http-threaded-2 branch from 1cdd7aa to 86cf443 Compare August 21, 2024 22:05

Added Query server with getledgerentry support

de2c587

SirTyson force-pushed the http-threaded-2 branch from b51449f to de2c587 Compare August 21, 2024 22:36

dmkozh approved these changes Aug 21, 2024

View reviewed changes

dmkozh enabled auto-merge August 21, 2024 22:36

dmkozh added this pull request to the merge queue Aug 21, 2024

Merged via the queue into stellar:master with commit 6170bc4 Aug 22, 2024

SirTyson mentioned this pull request Sep 3, 2024

clients/stellarcore: Add support for Core's getledgerentryraw endpoint stellar/go-stellar-sdk#5455

Closed

marta-lokhova mentioned this pull request Sep 9, 2024

Switch read-only LedgerTxn to BucketList snapshots #4431

Merged

SirTyson mentioned this pull request Jan 15, 2025

Add state archival getledgerentry endpoint #4623

Closed

6 tasks

Conversation

SirTyson commented Jun 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

SirTyson commented Jun 19, 2024

Uh oh!

2opremio commented Jun 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

2opremio commented Jun 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SirTyson commented Jun 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

2opremio commented Jun 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

2opremio commented Jun 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janewang commented Aug 7, 2024

Uh oh!

SirTyson commented Aug 7, 2024

Uh oh!

janewang commented Aug 7, 2024

Uh oh!

MonsieurNicolas commented Aug 9, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dmkozh commented Aug 16, 2024

Uh oh!

SirTyson commented Aug 16, 2024

Uh oh!

Shaptic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SirTyson commented Aug 20, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

SirTyson commented Jun 6, 2024 •

edited

Loading

2opremio commented Jun 19, 2024 •

edited

Loading

2opremio commented Jun 19, 2024 •

edited

Loading

SirTyson commented Jun 19, 2024 •

edited

Loading

2opremio commented Jun 19, 2024 •

edited

Loading

2opremio commented Jun 19, 2024 •

edited

Loading