Reduce comparator objects init cost in BlockIter by XinyuZeng · Pull Request #9611 · facebook/rocksdb

XinyuZeng · 2022-02-21T03:17:48Z

This PR solves the problem discussed in #7149. By storing the pointer of InternalKeyComparator as icmp_ in BlockIter, the object size remains the same. And for each call to CompareCurrentKey, there is no need to create Comparator objects. One can use icmp_ directly or use the "user_comparator" from the icmp_.

Test Plan: with #9903,

$ TEST_TMPDIR=/dev/shm python3.6 ../benchmark/tools/compare.py benchmarks ./db_basic_bench ../rocksdb-pr9611/db_basic_bench --benchmark_filter=DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1 --benchmark_repetitions=50
...
Comparing ./db_basic_bench to ../rocksdb-pr9611/db_basic_bench                                                                                                                                                                                            
Benchmark                                                                                                                                                               Time             CPU      Time Old      Time New       CPU Old       CPU New
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
...
DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1_pvalue                 0.0001          0.0001      U Test, Repetitions: 50 vs 50
DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1_mean                  -0.0483         -0.0483          3924          3734          3924          3734
DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1_median                -0.0713         -0.0713          3971          3687          3970          3687
DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1_stddev                -0.0342         -0.0344           225           217           225           217
DBGet/comp_style:0/max_data:134217728/per_key_size:256/enable_statistics:0/negative_query:0/enable_filter:0/mmap:1/iterations:262144/threads:1_cv                    +0.0148         +0.0146             0             0             0             0
OVERALL_GEOMEAN                                                                                                                                                      -0.0483         -0.0483             0             0             0             0

facebook-github-bot · 2022-04-24T05:38:08Z

@ajkr has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-04-24T05:38:35Z

@XinyuZeng has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-04-25T18:13:40Z

@ajkr has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

ajkr

LGTM. Updated the "Test Plan" to show results of microbenchmark

facebook-github-bot · 2022-04-26T00:59:29Z

@ajkr has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

ajkr · 2022-04-26T01:02:36Z

table/block_based/block.h


-  UserComparatorWrapper ucmp() { return UserComparatorWrapper(raw_ucmp_); }
+    icmp_ =
+        std::make_unique<InternalKeyComparator>(raw_ucmp, false /* named */);


Can InternalKeyComparator wrap a UserComparatorWrapper?

It seems like InternalKeyComparator already has a UserComparatorWrapper as a private member. Calling icmp_->user_comparator() refers to it.

Summary: I tried evaluating #9611 using DBGet microbenchmarks but mostly found the change is well within the noise even for hundreds of repetitions; meanwhile, the InternalKeyComparator CPU it saves is 1-2% according to perf so it should be measurable. In this PR I tried adding a mmap mode that will bypass compression/checksum/block cache/file read to focus more on the block lookup paths, and also increased the Get() count. Pull Request resolved: #9903 Reviewed By: jay-zhuang, riversand963 Differential Revision: D35907375 Pulled By: ajkr fbshipit-source-id: 69490d5040ef0863e1ce296724104d0aa7667215

XinyuZeng · 2022-04-30T15:00:42Z

hey @ajkr, do you mind share the compare.py script you used in the Test Plan? Thanks!

ajkr · 2022-05-01T04:58:38Z

hey @ajkr, do you mind share the compare.py script you used in the Test Plan? Thanks!

Sure, it's a script provided in google benchmark -- https://github.com/google/benchmark/blob/main/tools/compare.py

riversand963 · 2022-07-12T23:05:28Z

@XinyuZeng can you take a look at #10340 ?

XinyuZeng · 2022-07-13T01:49:52Z

@riversand963 Through a quick look, I am not sure why #9611 introduces more allocations using glibc because it should reduce the number of allocations compared to the commit before it. Before the commit, each call to CompareCurrentKey will have an allocation. But I do agree with #10342:

As a side effect, internal key comparator was made configurable too. This introduces overhead to this simple wrapper. For example, every InternalKeyComparator will have an std::vector attached to it, which consumes memory and possible allocation overhead too.

This is also what I profiled and found when I did #9611

XinyuZeng added 2 commits February 20, 2022 23:34

keep icmp pointer in BlockIter instead of raw_ucmp

72bb50f

delete commented code

db51180

facebook-github-bot added the CLA Signed label Feb 21, 2022

Merge branch 'main' into reduce_cmp_init

2a60c1b

XinyuZeng closed this Apr 24, 2022

ajkr reopened this Apr 25, 2022

ajkr mentioned this pull request Apr 25, 2022

Add mmap DBGet microbench parameters #9903

Closed

ajkr approved these changes Apr 26, 2022

View reviewed changes

ajkr reviewed Apr 26, 2022

View reviewed changes

ajkr mentioned this pull request May 1, 2022

Fix PinSelf() read-after-free in DB::GetMergeOperands() #9507

Closed

facebook-github-bot closed this in 8b74cea May 4, 2022

siying mentioned this pull request Jul 12, 2022

Make InternalKeyComparator not configurable #10342

Closed

andrew-kryczka mentioned this pull request Oct 27, 2025

eliminate per-iterator heap allocation by constructing InternalKeyComparator in-place #14044

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce comparator objects init cost in BlockIter#9611

Reduce comparator objects init cost in BlockIter#9611
XinyuZeng wants to merge 3 commits intofacebook:mainfrom
XinyuZeng:reduce_cmp_init

XinyuZeng commented Feb 21, 2022 •

edited by ajkr

Loading

Uh oh!

facebook-github-bot commented Apr 24, 2022

Uh oh!

facebook-github-bot commented Apr 24, 2022

Uh oh!

facebook-github-bot commented Apr 25, 2022

Uh oh!

ajkr left a comment

Uh oh!

facebook-github-bot commented Apr 26, 2022

Uh oh!

ajkr Apr 26, 2022

Uh oh!

XinyuZeng Apr 26, 2022

Uh oh!

XinyuZeng commented Apr 30, 2022

Uh oh!

ajkr commented May 1, 2022

Uh oh!

riversand963 commented Jul 12, 2022

Uh oh!

XinyuZeng commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

XinyuZeng commented Feb 21, 2022 • edited by ajkr Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Apr 24, 2022

Uh oh!

facebook-github-bot commented Apr 24, 2022

Uh oh!

facebook-github-bot commented Apr 25, 2022

Uh oh!

ajkr left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 26, 2022

Uh oh!

ajkr Apr 26, 2022

Choose a reason for hiding this comment

Uh oh!

XinyuZeng Apr 26, 2022

Choose a reason for hiding this comment

Uh oh!

XinyuZeng commented Apr 30, 2022

Uh oh!

ajkr commented May 1, 2022

Uh oh!

riversand963 commented Jul 12, 2022

Uh oh!

XinyuZeng commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

XinyuZeng commented Feb 21, 2022 •

edited by ajkr

Loading