memory reporting of clients argv and scripts #5159

oranagra · 2018-07-22T07:15:06Z

This PR contains several commits handling related things. please feel free to cherry pick just the first 2 commits, or any other portions you like.

Track and report memory used by clients argv.
this is very useful in case clients started sending a command and didn't
complete it. in which case the first args of the command are already
trimmed from the query buffer.
report memory used by scripts (not the lua lib itself).
this is useful in case a bad client is flooding the server with many
unique scripts, and their code in redis takes a lot of ram. please use Adds memory information about the scripts' cache to INFO #4883 instead
Include internal sds fragmentation in MEMORY reporting
performance and memory reporting improvment - sds take control of it's internal frag
this commit has two aspects:
A) improve memory reporting for all the places that use sdsAllocSize to compute
memory used by a string, in this case it'll include the internal fragmentation.
B) reduce the need for realloc calls by making the sds implicitly take over
the internal fragmentation of the block it allocated.

itamarhaber · 2018-07-22T11:37:20Z

@oranagra I humbly beg that you compare your 2 to mine - https://github.com/antirez/redis/pull/4883 - iterating is slow :)

oranagra · 2018-07-22T11:47:27Z

@itamarhaber thanks, indeed my script counting code would have slowed down INFO in cases were there are many many scripts.
Removing this commit from my branch. @antirez please take #4883 instead.

antirez · 2018-07-23T16:47:13Z

Thank you, other one merged.

antirez · 2018-07-23T16:47:50Z

Reopening this one as I think there is more meat to merge.

oranagra · 2018-07-23T16:50:10Z

@antirez yes, this PR contained 4 commits, one of which was overlapping another PR (which was better in some aspects), so i removed that commit from this PR (now contains 3 commits).
you may decide to merge just the first one, first two, or all three.

oranagra · 2018-07-31T15:27:10Z

@antirez a reminder about this one in case you meant to review and forgot. also do you want me to resolve that minor conflict that was created by merging the other PR, or will you handle that in case you merge?

antirez · 2018-07-31T15:32:34Z

Thanks for bumping this on the list! I'll check tomorrow, a rebase would be appreciated, and sorry for breaking the patch merging other stuff.

antirez · 2018-07-31T15:33:29Z

P.S. I wish Github could warn users about "merging this will make X,Y,Z impossible to merge". For instance a few days ago I merged a typos PR that ruined a number of other PRs...

track and report memory used by clients argv. this is very usaful in case clients started sending a command and didn't complete it. in which case the first args of the command are already trimmed from the query buffer.

…s internal frag This commit has two aspects: 1) improve memory reporting for all the places that use sdsAllocSize to compute memory used by a string, in this case it'll include the internal fragmentation. 2) reduce the need for realloc calls by making the sds implicitly take over the internal fragmentation of the block it allocated.

oranagra · 2018-07-31T15:47:24Z

resolved (push -f). the conflicts were created by another PR i submitted (which i intended you to merge only if you decide not to merge this one).

i share the same pain about merge conflicts (mainly when cherry picking between branches).. many times i prefer not to fix types and formatting problems (white space and others), or even not to refactor ugly code (if it's bug free), so that it won't cause conflicts.

antirez · 2018-10-02T08:33:02Z

Hi @oranagra, I was checking the first commit implementing the argv memory reporting.
Here we are ending with two similar functions:

size_t getStringObjectSize(robj *o);
size_t getStringObjectSdsUsedMemory(robj *o):

The first is introduced by your commit, the second is already in the source tree.
The difference between the two should be that one includes external fragmentation and one does not.

In turn the two calls two different functions to return the memory used by an SDS string:

size_t sdsAllocSize(sds s);
size_t sdsZmallocSize(sds s);

The first returns the SDS logical memory used: header + size of the allocation including the final null term. The second instead reports the size of the allocation where the SDS is stored.

I wonder if we need both (probably yes, because sometimes we want to understand the total memory usage and sometimes just what the higher level API tried to allocate), but anyway the naming scheme of such functions look very confusing. So I think here there is a good chance to fix this.

I would change them in the following way:

sdsAllocSize() -> sdsLogicalSize()
sdsZmallocSize() -> sdsAllocationSize()
getStringObjectSize() -> getStringObjectLogicalSize()
getStringObjectSdsUsedMemory() -> getStringObjectAllocationSize()

WDYT?

oranagra · 2018-10-02T10:54:13Z

@antirez thanks for reviewing.
to me, considering we have sdslen, and sdsfree, i would expect their sum to be named "alloc", but maybe "size" would be ok too (i'm worried it "size" may be confused with "len").
also, considering we have many calls to zmalloc_size in the project (to get the real allocated size including internal fragmentation), it sounds logical to me that the sds function doing that will be named "zmalloc", this would not look good in the sds repo, but this may be the reason why this function is in networking.c and not sds.c.

anyway, this is just naming, so after the above arguments, feel free to decide what sounds best to you, and either rename yourself, or ask me to fix the PR.

however, like i did several times in the past, this PR introduces the changes in several stages (so that you can decide to merge just the first commit, or first two, without merging the last).
but if you do decide to merge the last commit, then the difference between these two functions disappears, since now any sds will take responsibility of it's internal fragmentation.

the reason i didn't use sdsZmallocSize in the first commit, and instead introduced a new function that reports the "logical" size without fragmentation, was that i didn't want the performance overhead of calling sdsZmallocSize. but the 3rd commit changes sds.c in a way that every sds is now aware of it's internal fragmentation.
The two benefits of doing that are:

memory reporting becomes implicitly more accurate.
less cases were the sds needs to be regrown / realloced.

p.s. in the light of the recent #5397, i've reviewed my second commit and i see two things that need fixing:

object.c line 783 (OBJ_ENCODING_EMBSTR) we need to call getStringObjectSdsUsedMemory
object.c line 1318 (when MEMORY USAGE adds the keyname), we need to either call getStringObjectSdsUsedMemory, or preferably, do that on the key in the dict rather than the one in argv (anyway, we can use sdsZmallocSize on obj->ptr).

so anyway, please finish your review and decide what parts you want to merge.
then either fix / refactor what you wish during merge. or if you prefer, tell me what you want to fix / merge and i'll update the PR.

antirez · 2018-10-02T11:27:50Z

Thank you @oranagra. A few remarks:

I've the feeling that the argv accounting change may be better like that to avoid a cache miss:

      c->argv_bytes += bulklen;

I think that this preserves completely the goal of your original commit, that is to understand if some client is doing crazy stuff. However I've the feeling that calling the function and potentially having to cache-miss, may have an impact in certain work loads.

Otherwise I'm taking the commit exactly like you proposed it, with the changes needed to apply it (since we accumulated other differences).

MEMORY USAGE -> I'm positive about this change, even if it's an odd API change. In some way one could argue that MEMORY USAGE should report the logical memory used by the system, so that different versions of Redis running with different allocators will always report the same numbers for objects allocated in a similar way. However the reality is that such a tool is especially useful for provisioning and debugging, so IMHO such "purity" is pointless, and is better to go for your approach.
The SDS change looks good as well to me, since this way we may use the full potential of the allocation. However I wonder why we don't juse use the API provided by malloc_size() & co, so that we have to export a single additional function to SDS. I'm not sure there is much gain in doing it in a single function, since anyway such function is doing another function call regardless. Moreover it would be important to be able to avoid setting such API in sdsalloc.h, so that SDS can work without that. And this is how we should ship SDS from upstream (https://github.com/antirez/sds) for max compatibility by default.

antirez · 2018-10-02T11:49:54Z

Ok that's the idea about the point "1" above: https://github.com/antirez/redis/compare/argv-accounting

oranagra · 2018-10-02T12:55:52Z

@antirez i'm not 100% sure i follow you, i'll try to respond, but if you feel that i lost you, please help me get back on the track.

your 1,2,3 are talking on each of the 3 commits, right?
in that case:

indeed the main purpose of this PR is to be able to detect that crazy client(s), so in that sense, the sdslen may be enough, but if possible i would like to get a little bit closer to the trough, since sdslen is not only missing the fragmentation, but also the "free" portion of the sds, which may be very big.
this is why my fist commit used sdsAllocSize, rather than sdslen... i.e. i wanted to avoid the unknown cost of zmalloc_size, and use only data that is present in the sds header anyway.
I'm not sure why you consider sdsAllocSize another cache miss more than sdslen?
both variables are part of the sds header.
Do you mean code cache? maybe you prefer to use sdsalloc instead of sdsAllocSize?
since sdsalloc is in in sds.h and is most likely inlined by the compiler? (we can probably move sdsAllocSize and sdsHdrSizeto sds.h too)

besides that, IIUC your code from the above mentioned branch, it looks like c->argv_bytes is mixing values from various sources, in which case there may be a creeping leak. i.e. if one place adds sdslen, and the other adds or deducts getStringObjectSdsUsedMemory. we must make sure we always use the same source.

i agree, the only sensible use for MEMORY USAGE is to try and understand used_memory (from INFO MEMORY), and both should be of the same units (with internal fragmentation).
what i wanted to avoid in that commit, in not the extra call to a function in zmalloc.c, but the extra call to the allocator library. i don't want two consecutive calls to malloc_size and it's unknown cost. especially if we have it for free.
so i'm willing to slightly "pollute" the cleanness of the standalone sds repository, for a slightly better redis.

oranagra · 2018-10-08T17:27:29Z

@antirez you may have missed my response above.
where do we stand about this? do you want to add this info to redis one way or another?

antirez · 2018-10-09T16:03:54Z

@oranagra thanks, yes checking those as well, started from the bug fixes to release a new RC ASAP.

oranagra · 2020-10-01T08:37:21Z

Closing this old PR, changes and above feedback where ported to the above 3 mentioned PRs

This was referenced Jul 22, 2018

Make INFO faster again!11one #5145

Closed

fix recursion typo in zmalloc_usable #5160

Merged

antirez closed this Jul 23, 2018

antirez reopened this Jul 23, 2018

oranagra added 3 commits July 31, 2018 18:41

memory reporting of clients argv

c255620

track and report memory used by clients argv. this is very usaful in case clients started sending a command and didn't complete it. in which case the first args of the command are already trimmed from the query buffer.

Include internal sds fragmentation in MEMORY reporting

de2be0f

oranagra mentioned this pull request Sep 30, 2018

Preparing for Redis 5 GA: What issues / PRs to address? #5399

Closed

oranagra mentioned this pull request Aug 18, 2020

[NEW] Client "eviction" - drop clients when their total buffer overhead is over a limit #7676

Closed

oranagra mentioned this pull request Sep 14, 2020

Redis high memory problem #7787

Closed

This was referenced Sep 30, 2020

Include internal sds fragmentation in MEMORY reporting #7864

Merged

memory reporting of clients argv #7874

Merged

performance and memory reporting improvement - sds take control of it's internal frag #7875

Merged

oranagra closed this Oct 1, 2020

memory reporting of clients argv and scripts #5159

memory reporting of clients argv and scripts #5159

Uh oh!

Conversation

oranagra commented Jul 22, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

itamarhaber commented Jul 22, 2018

Uh oh!

oranagra commented Jul 22, 2018

Uh oh!

antirez commented Jul 23, 2018

Uh oh!

antirez commented Jul 23, 2018

Uh oh!

oranagra commented Jul 23, 2018

Uh oh!

oranagra commented Jul 31, 2018

Uh oh!

antirez commented Jul 31, 2018

Uh oh!

antirez commented Jul 31, 2018

Uh oh!

oranagra commented Jul 31, 2018

Uh oh!

antirez commented Oct 2, 2018

Uh oh!

oranagra commented Oct 2, 2018

Uh oh!

antirez commented Oct 2, 2018

Uh oh!

antirez commented Oct 2, 2018

Uh oh!

oranagra commented Oct 2, 2018

Uh oh!

oranagra commented Oct 8, 2018

Uh oh!

antirez commented Oct 9, 2018

Uh oh!

oranagra commented Oct 1, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

oranagra commented Jul 22, 2018 •

edited

Loading