Skip to content

Conversation

@enjoy-binbin
Copy link
Contributor

@enjoy-binbin enjoy-binbin commented Jul 26, 2022

The kill above is sometimes successful and sometimes already too late.
The PING in pysnc wrong offset test got rejected by bgsaveerr because
lastbgsave_status is C_ERR.

In theory, using diskless can avoid PING being affected, because when
the replica is dropped, we will kill the child with SIGUSR1, and this
will not affect lastbgsave_status.

Anyway, this kill is not particularly needed here, dropping the kill
is the best one, since we do have the waitForBgsave, so just let it
take care of the bgsave. No need for fast termination.

@enjoy-binbin
Copy link
Contributor Author

reported in daily: https://github.com/redis/redis/runs/7511033997?check_suite_focus=true#step:7:4183

[exception]: Executing test client: MISCONF Redis is configured to save RDB snapshots, but it's currently unable to persist to disk. Commands that may modify the data set are disabled, because this instance is configured to report errors during writes if RDB snapshotting fails (stop-writes-on-bgsave-error option). Please check the Redis logs for details about the RDB error..
MISCONF Redis is configured to save RDB snapshots, but it's currently unable to persist to disk. Commands that may modify the data set are disabled, because this instance is configured to report errors during writes if RDB snapshotting fails (stop-writes-on-bgsave-error option). Please check the Redis logs for details about the RDB error.
    while executing
"[srv $level "client"] {*}$args"
    (procedure "r" line 7)
    invoked from within
"r ping"

@enjoy-binbin enjoy-binbin changed the title Use INFO instead of PING to check the server is alive in psync offset test Fix bgsaveerr issue in psync wrong offset test Jul 26, 2022
The kill above is sometimes successful and sometimes already too late.
The PING in pysnc wrong offset test got rejected by bgsaveerr because
lastbgsave_status is C_ERR.

In theory, using diskless can avoid PING being affected, because when
the replica is dropped, we will kill the child with SIGUSR1, and this
will not affect lastbgsave_status.

Anyway, this kill is not particularly needed here, dropping the kill
is the best one, since we do have the waitForBgsave, so just let it
take care of the bgsave. No need for fast termination.
Copy link
Member

@oranagra oranagra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for the record, i verified that with and without these kills, each of these tests takes roughly 100ms to complete.

@oranagra oranagra merged commit e714469 into redis:unstable Jul 27, 2022
@enjoy-binbin enjoy-binbin deleted the fix_psync_test branch July 27, 2022 14:26
arkamar added a commit to arkamar/gentoo that referenced this pull request Sep 23, 2022
This change backports patch from upstream PR gentoo#11043 in order to properly
solve bug #872278 reported for version 7.0.4 which affects version 7.0.5
as well. In upstream, the fix is not part of 7.0 branch, it is only
present in unstable branch.

Upstream-PR: redis/redis#11043
Bug: https://bugs.gentoo.org/860372
Bug: https://bugs.gentoo.org/872278
Signed-off-by: Petr Vaněk <arkamar@atlas.cz>
gentoo-bot pushed a commit to gentoo/gentoo that referenced this pull request Sep 25, 2022
This change backports patch from upstream PR #11043 in order to properly
solve bug #872278 reported for version 7.0.4 which affects version 7.0.5
as well. In upstream, the fix is not part of 7.0 branch, it is only
present in unstable branch.

Upstream-PR: redis/redis#11043
Bug: https://bugs.gentoo.org/860372
Bug: https://bugs.gentoo.org/872278
Signed-off-by: Petr Vaněk <arkamar@atlas.cz>
Signed-off-by: Sam James <sam@gentoo.org>
expeditioneer pushed a commit to expeditioneer/gentoo-portage that referenced this pull request Sep 27, 2022
This change backports patch from upstream PR gentoo#11043 in order to properly
solve bug #872278 reported for version 7.0.4 which affects version 7.0.5
as well. In upstream, the fix is not part of 7.0 branch, it is only
present in unstable branch.

Upstream-PR: redis/redis#11043
Bug: https://bugs.gentoo.org/860372
Bug: https://bugs.gentoo.org/872278
Signed-off-by: Petr Vaněk <arkamar@atlas.cz>
Signed-off-by: Sam James <sam@gentoo.org>
oranagra pushed a commit that referenced this pull request Dec 12, 2022
The kill above is sometimes successful and sometimes already too late.
The PING in pysnc wrong offset test got rejected by bgsaveerr because
lastbgsave_status is C_ERR.

In theory, using diskless can avoid PING being affected, because when
the replica is dropped, we will kill the child with SIGUSR1, and this
will not affect lastbgsave_status.

Anyway, this kill is not particularly needed here, dropping the kill
is the best one, since we do have the waitForBgsave, so just let it
take care of the bgsave. No need for fast termination.

(cherry picked from commit e714469)
enjoy-binbin added a commit to enjoy-binbin/redis that referenced this pull request Jul 31, 2023
The kill above is sometimes successful and sometimes already too late.
The PING in pysnc wrong offset test got rejected by bgsaveerr because
lastbgsave_status is C_ERR.

In theory, using diskless can avoid PING being affected, because when
the replica is dropped, we will kill the child with SIGUSR1, and this
will not affect lastbgsave_status.

Anyway, this kill is not particularly needed here, dropping the kill
is the best one, since we do have the waitForBgsave, so just let it
take care of the bgsave. No need for fast termination.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants