add weighted reservoir sampling (AE-S) by tbg · Pull Request #44 · cockroachdb/cockroach

tbg · 2014-09-10T07:17:59Z

Implemented algorithm A-Res from http://utopia.duth.gr/~pefraimi/research/data/2007EncOfAlg.pdf, offering a reservoir sampling algorithm with the chance of each element being in the final sample being proportional to the (positive) weight assigned to each element.
If need arises we will fairly easily be able to implement A-ExpJ using this (but I don't think we'll ever need it) to save some random numbers.
The careful reviewer will observe that really I would have wanted a priority queue instead of an abstract heap (to get "peek") but in this version the exposition is clearer and the performance hit isn't more than a slightly worse constant depending on the reservoir size.
A-Res will be used to generate the split key when it is needed, taking the size of key-value pairs as their weight. Other metrics are possible, allowing for a range of interesting split key criteria.

andybons · 2014-09-10T15:16:07Z

lgtm 👍

andybons · 2014-09-10T15:18:16Z

I’m convinced Toby just likes implementing stuff from papers he finds on the internet ;)

spencerkimball · 2014-09-10T17:32:36Z

util/sampling.go

I'm always a little confused by this. If a type which is actually a slice, such as WeightedValueHeap, is used, does it even makes sense to refer to it using pointer? Or better to just do func (h WeightedValueHeap)...? Is there some standardized way of treating these types?

toddlipcon · 2014-09-10T17:38:31Z

Curious why this is useful for split key determination. Doesn't this imply that, if all of the key-values are the same length, then a split becomes uniformly random? It seems to me that, in that case, you'd want the split to be preferentially near the middle of the data (if not exactly near the middle).

spencerkimball · 2014-09-10T17:47:28Z

util/sampling.go

Instead of this method, why not make just the plain old NewWeightedReservoirSample allow minHeap to be specified as nil in which case it creates a WeightedValueHeap as the default...

good point, doing that

spencerkimball · 2014-09-10T18:42:01Z

LGTM

spencerkimball · 2014-09-10T18:43:45Z

@toddlipcon: the idea is to end up with a weighted sampling of all the keys in a range by the total size in bytes of the key + value. Let's say you kept a sample of 100 key/values. When finished, you'd pull all 100 keys out of the sample, sort them, and take the 50th as the split key.

toddlipcon · 2014-09-10T18:48:19Z

that still depends on a linear scan over all of the keys in order to construct the reservoir sample. If you're going to scan the whole range, couldn't you do this exactly with the simple algorithm (pseudocode):

total_size = sum the byte size of all files in the range
scanner = new InternalScanner();
scanned_size = 0
while scanner.stats.bytes_scanned() < total_size / 2:
scanner.next()

or something of that sort?

Or, assuming you have a tree structure key index in your SSTables, you could probably do this with a logarithmic number of seeks instead of a linear scan.

spencerkimball · 2014-09-10T18:58:46Z

Yep, that would work and certainly has the benefit of simplicity and would
take half the time on scan. We don't have a particularly accurate signal
for total bytes in the range and don't have access (or at least don't want
to try to ghetto-rig access) to the low level rocksdb MST files. Still, the
split key does not have to be anywhere near an exact choice, so we could
use RocksDB's estimate for size of range.

Tobias, up to you how you want to do it.

On Wed, Sep 10, 2014 at 11:48 AM, Todd Lipcon notifications@github.com
wrote:

that still depends on a linear scan over all of the keys in order to
construct the reservoir sample. If you're going to scan the whole range,
couldn't you do this exactly with the simple algorithm (pseudocode):

total_size = sum the byte size of all files in the range
scanner = new InternalScanner();
scanned_size = 0
while scanner.stats.bytes_scanned() < total_size / 2:
scanner.next()

or something of that sort?

Or, assuming you have a tree structure key index in your SSTables, you
could probably do this with a logarithmic number of seeks instead of a
linear scan.

—
Reply to this email directly or view it on GitHub
#44 (comment).

tbg · 2014-09-10T19:44:58Z

Leaving it as is for now, to be reconsidered as stats evolve.

@spencerkimball

address feedback from @spencerkimball final commit before merge

add weighted reservoir sampling (AE-S)

minor refactoring

Source commit: 4c61bb667fdd42452a5040622cc3512ec4ba7d21 Source PR: cockroachdb/cockroach#44

tbg assigned andybons and spencerkimball and unassigned andybons Sep 10, 2014

spencerkimball reviewed Sep 10, 2014
View reviewed changes

add weighted reservoir sampling (AE-S)

fa2d307

address feedback from @spencerkimball final commit before merge

tbg force-pushed the split branch from e32986e to fa2d307 Compare September 10, 2014 20:20

tbg added a commit that referenced this pull request Sep 10, 2014

Merge pull request #44 from tobstar87/split

87044be

add weighted reservoir sampling (AE-S)

tbg merged commit 87044be into cockroachdb:master Sep 10, 2014

tbg deleted the split branch September 16, 2014 08:56

soniabhishek pushed a commit to soniabhishek/cockroach that referenced this pull request Feb 15, 2017

Merge pull request cockroachdb#44 from crowdflux/feature/bifurcation

b8267c1

minor refactoring

blathers-crl bot mentioned this pull request May 12, 2020

exec test failed: clang err when i exec make test command #48713

Closed

xinyuliu12 mentioned this pull request Jul 13, 2020

Accelerate queries using eager predicate evaluation #51361

Closed

xinyuliu12 mentioned this pull request Jul 23, 2020

opt: reoptimize main query with scalar subquery results #51820

Open

xinyuliu12 mentioned this pull request Aug 31, 2020

OPT: Query planner chooses a suboptimal plan for case statement evaluation #53653

Closed

This was referenced Sep 10, 2020

OPT: inefficient execution of the stddev function #54160

Closed

OPT: inefficient execution of the variance function #54161

Closed

OPT: different execution plan selected when result tables are the same #54169

Closed

This was referenced Dec 1, 2020

OPT: different execution plans selected for queries consist of aggregate and join #57330

Closed

sql: v20.2.0-alpha.1: internal error: column 17 not in input #57441

Closed

colexec: suboptimal behavior of unordered distinct with limit #57566

Closed

This was referenced Dec 22, 2020

OPT: suboptimal behavior of order by clause #58216

Closed

OPT: suboptimal behavior related to predicate evaluation #58283

Closed

colexec: suboptimal behavior related to unsupported vectorized cast #58284

Closed

This was referenced Dec 30, 2020

OPT: suboptimal behavior related to aggregate functions #58377

Closed

OPT: suboptimal behavior related to DISTINCT #58382

Closed

OPT: suboptimal behavior related to UNION #58383

Closed

xinyuliu12 mentioned this pull request Dec 8, 2021

OPT: suboptimal behaviors on some queries #73628

Open

fredbi mentioned this pull request Apr 21, 2022

UPSERT with FK enabled #80319

Open

tbg mentioned this pull request Mar 15, 2023

[dnm] kvserver: extra logging for restore/tpce/8TB/aws/nodes=10/cpus=8 #98576

Closed

ebembi-crdb added a commit to ebembi-crdb/generated-diagrams that referenced this pull request Feb 19, 2026

docs: sync SQL diagrams from cockroach

f3b33fd

Source commit: 4c61bb667fdd42452a5040622cc3512ec4ba7d21 Source PR: cockroachdb/cockroach#44

This was referenced Feb 19, 2026

docs: sync SQL diagrams from cockroach#44 ebembi-crdb/generated-diagrams#22

Open

demo: add create_policy entry showing name/stmt suffix and inline diagram update #164269

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add weighted reservoir sampling (AE-S)#44

add weighted reservoir sampling (AE-S)#44
tbg merged 1 commit intocockroachdb:masterfrom
tbg:split

tbg commented Sep 10, 2014

Uh oh!

andybons commented Sep 10, 2014

Uh oh!

andybons commented Sep 10, 2014

Uh oh!

spencerkimball Sep 10, 2014

Uh oh!

toddlipcon commented Sep 10, 2014

Uh oh!

spencerkimball Sep 10, 2014

Uh oh!

tbg Sep 10, 2014

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

toddlipcon commented Sep 10, 2014

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

tbg commented Sep 10, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tbg commented Sep 10, 2014

Uh oh!

andybons commented Sep 10, 2014

Uh oh!

andybons commented Sep 10, 2014

Uh oh!

spencerkimball Sep 10, 2014

Choose a reason for hiding this comment

Uh oh!

toddlipcon commented Sep 10, 2014

Uh oh!

spencerkimball Sep 10, 2014

Choose a reason for hiding this comment

Uh oh!

tbg Sep 10, 2014

Choose a reason for hiding this comment

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

toddlipcon commented Sep 10, 2014

Uh oh!

spencerkimball commented Sep 10, 2014

Uh oh!

tbg commented Sep 10, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants