box: speed up `tuple_new()` for large sparse tuples by 3.5x by Gumix · Pull Request #9793 · tarantool/tarantool

Gumix · 2024-03-11T16:41:33Z

The first 2 patches add the bench_tuple_new<FORMAT_SPARSE> benchmark, that creates sparse tuples (1K fields each, 990 of which are nils).

The next 2 patches speed up the benchmark from 7 μs to 2 μs per iteration.

Needed for tarantool/tarantool-ee#711

coveralls · 2024-03-11T16:55:31Z

coverage: 86.948% (-0.06%) from 87.007%
when pulling dad86b1 on Gumix:iverbin/gh-711-optimize-validation-of-sparse-tuples-in-memcs
into 3978d54
on tarantool:master.

perf/tuple.cc

src/box/tuple_format.c

Currently `class MpData` generates msgpack data with a predefined format, let's call it `FORMAT_BASIC`. This patch allows to extend it with other formats. No functional changes. Needed for tarantool/tarantool-ee#711 NO_DOC=perf test NO_TEST=perf test NO_CHANGELOG=perf test

perf/tuple.cc

changelogs/unreleased/box-speed-up-tuple_new.md

src/box/tuple_format.c

Implement `class MpData<FORMAT_SPARSE>`, which generates 1000 fields, 10 of them contain unsigned integers, while the remaining are null. Needed for tarantool/tarantool-ee#711 NO_DOC=perf test NO_TEST=perf test NO_CHANGELOG=perf test

It is possible to skip MP_NIL by mp_decode_nil(), which is faster than mp_next(). This patch improves bench_tuple_new<FORMAT_SPARSE> by 2.2x. NO_WRAP $ taskset 0x2 ~/benchmark/tools/compare.py benchmarks \ ./tuple.perftest.old ./tuple.perftest.new \ --benchmark_min_warmup_time=10 \ --benchmark_repetitions=30 \ --benchmark_report_aggregates_only=true \ --benchmark_filter=tuple_new\<FORMAT_SPARSE\> [...] Comparing ./tuple.perftest.old to ./tuple.perftest.new Benchmark Time CPU Time Old Time New CPU Old CPU New ------------------------------------------------------------------------------------------------------------------------------------ bench_tuple_new<FORMAT_SPARSE>_mean -0.5525 -0.5525 6985 3126 6985 3126 bench_tuple_new<FORMAT_SPARSE>_median -0.5445 -0.5444 6838 3115 6838 3115 bench_tuple_new<FORMAT_SPARSE>_stddev -0.8368 -0.8367 541 88 541 88 bench_tuple_new<FORMAT_SPARSE>_cv -0.6354 -0.6352 0 0 0 0 NO_WRAP Needed for tarantool/tarantool-ee#711 NO_DOC=perf improvement NO_TEST=perf improvement NO_CHANGELOG=next commit

If the number of tuple fields is less than `format->min_field_count`, then some required field is missed, i.e., there is no need to update the `required_fields` bitmap during msgpack decoding. This optimization is valid only if tuple format doesn't contain fields accessed by JSON paths. This patch improves bench_tuple_new by 15-50%, depending on field count. NO_WRAP $ taskset 0x2 ~/benchmark/tools/compare.py benchmarks \ ./tuple.perftest.old ./tuple.perftest.new \ --benchmark_min_warmup_time=10 \ --benchmark_repetitions=30 \ --benchmark_report_aggregates_only=true \ --benchmark_filter=tuple_new [...] Comparing ./tuple.perftest.old to ./tuple.perftest.new Benchmark Time CPU Time Old Time New CPU Old CPU New ------------------------------------------------------------------------------------------------------------------------------------ bench_tuple_new<FORMAT_BASIC>_mean -0.1469 -0.1470 126 107 126 107 bench_tuple_new<FORMAT_BASIC>_median -0.1428 -0.1429 124 106 124 106 bench_tuple_new<FORMAT_BASIC>_stddev +0.0589 +0.0600 4 5 4 5 bench_tuple_new<FORMAT_BASIC>_cv +0.2412 +0.2427 0 0 0 0 bench_tuple_new<FORMAT_SPARSE>_mean -0.3754 -0.3753 3104 1939 3104 1939 bench_tuple_new<FORMAT_SPARSE>_median -0.3749 -0.3747 3071 1920 3071 1920 bench_tuple_new<FORMAT_SPARSE>_stddev -0.3482 -0.3482 85 55 85 55 bench_tuple_new<FORMAT_SPARSE>_cv +0.0434 +0.0434 0 0 0 0 NO_WRAP Needed for tarantool/tarantool-ee#711 NO_DOC=perf improvement

sergepetrenko

Thanks for the patch!

src/box/tuple_format.c

Gumix requested a review from a team as a code owner March 11, 2024 16:41

Gumix requested review from locker and sergepetrenko March 11, 2024 17:19

Gumix assigned locker and sergepetrenko Mar 11, 2024

locker requested changes Mar 12, 2024

View reviewed changes

perf/tuple.cc Outdated Show resolved Hide resolved

perf/tuple.cc Show resolved Hide resolved

src/box/tuple_format.c Show resolved Hide resolved

src/box/tuple_format.c Show resolved Hide resolved

src/box/tuple_format.c Outdated Show resolved Hide resolved

locker assigned Gumix and unassigned locker Mar 12, 2024

Gumix unassigned sergepetrenko Mar 12, 2024

Gumix requested a review from a team as a code owner March 12, 2024 18:33

Gumix changed the title ~~box: speed up tuple_new() for large sparse tuples by 2~3x~~ box: speed up tuple_new() for large sparse tuples by 3.5x Mar 12, 2024

Gumix requested a review from locker March 12, 2024 19:26

Gumix assigned locker and unassigned Gumix Mar 12, 2024

locker approved these changes Mar 13, 2024

View reviewed changes

perf/tuple.cc Outdated Show resolved Hide resolved

changelogs/unreleased/box-speed-up-tuple_new.md Outdated Show resolved Hide resolved

src/box/tuple_format.c Outdated Show resolved Hide resolved

locker assigned Gumix and unassigned locker Mar 13, 2024

Gumix added 3 commits March 13, 2024 15:23

perf: add sparse format to tuple perf test

c2dc84a

Implement `class MpData<FORMAT_SPARSE>`, which generates 1000 fields, 10 of them contain unsigned integers, while the remaining are null. Needed for tarantool/tarantool-ee#711 NO_DOC=perf test NO_TEST=perf test NO_CHANGELOG=perf test

Gumix assigned sergepetrenko and unassigned Gumix Mar 13, 2024

sergepetrenko approved these changes Mar 13, 2024

View reviewed changes

src/box/tuple_format.c Outdated Show resolved Hide resolved

p7nov approved these changes Mar 14, 2024

View reviewed changes

ochaplashkin approved these changes Mar 14, 2024

View reviewed changes

sergepetrenko added the full-ci Enables all tests for a pull request label Mar 14, 2024

sergepetrenko merged commit 26bf1cb into tarantool:master Mar 18, 2024

Gumix mentioned this pull request May 13, 2024

Refactor and speed up recovery #5787

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

box: speed up `tuple_new()` for large sparse tuples by 3.5x#9793

box: speed up `tuple_new()` for large sparse tuples by 3.5x#9793
sergepetrenko merged 4 commits intotarantool:masterfrom
Gumix:iverbin/gh-711-optimize-validation-of-sparse-tuples-in-memcs

Gumix commented Mar 11, 2024 •

edited

Loading

Uh oh!

coveralls commented Mar 11, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sergepetrenko left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

Gumix commented Mar 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Mar 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sergepetrenko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Gumix commented Mar 11, 2024 •

edited

Loading

coveralls commented Mar 11, 2024 •

edited

Loading