memcached: Make config non-HA-aware (bsc#1038223) by cmurphy · Pull Request #1341 · crowbar/crowbar-openstack

cmurphy · 2017-10-02T12:50:51Z

Without this patch, when deployed in an HA configuration, all of the
barclamps set their cache servers to all of the memcached servers in the
cluster in lexicographical order. This is not actually an optimal way to
configure memcached servers since if part of the cluster is down, the
memcached servers living on it will be inaccessible. python-memcached is
not tied to pacemaker and has no way of knowing that, so it passes the
entire list to python-memcached which attempts to connect to each server
serially, not attempting the next one until the first times out. The
effect is that any query to the OpenStack service will take a very long
time if the first memcached server in the list is down.

This patch fixes the issue by only using the local memcached server
instead of using all in the cluster. This also adjusts the
get_memcached_servers helper method to only accept one node as input
since, knowing what we now know, we're unlikely to need more than one.
The get_memcached_servers method was implemented while updating the
barclamps to prevent deprecation warnings emitted by keystonemiddleware
in Ocata[1] and was mimicking old behavior used to set the cache servers
for keystone and nova.

[1] https://docs.openstack.org/releasenotes/keystonemiddleware/ocata.html

cmurphy · 2017-10-02T12:51:42Z

Cloud7 version is here: #1340 (not cherry-picked)

dirkmueller · 2017-10-04T09:02:43Z

NoMethodError: undefined method `get_memcached_servers' for MemcachedHelper:Module

we need some other patch elsewhere?

where is the code that sets up memchached replication? basically when starting the memcached we need to tell it which one is the replication master and how to reach it.

cmurphy · 2017-10-04T14:44:28Z

@dirkmueller I had forgotten that the swift barclamp was still using that method name.

I decided that since swift is special I would rather leave it alone, so I changed the method name back to plural and reverted the variable name changes, so this patch is a lot smaller now.

Without this patch, when deployed in an HA configuration, all of the barclamps set their cache servers to all of the memcached servers in the cluster in lexicographical order. This is not actually an optimal way to configure memcached servers since if part of the cluster is down, the memcached servers living on it will be inaccessible. python-memcached is not tied to pacemaker and has no way of knowing that, so it passes the entire list to python-memcached which attempts to connect to each server serially, not attempting the next one until the first times out. The effect is that any query to the OpenStack service will take a very long time if the first memcached server in the list is down. This patch fixes the issue by only using the local memcached server instead of using all in the cluster. This is done for all barclamps using memcached except for swift, since swift has its own way of doing HA without pacemaker and also implements its own memcached client, so we might as well leave it alone.

cmurphy · 2017-10-12T15:10:27Z

Closing for the reasons given here: #1340 (comment)

cmurphy requested review from dirkmueller and stefannica October 2, 2017 12:51

cmurphy mentioned this pull request Oct 2, 2017

[4.0] memcached: Make config non-HA-aware (bsc#1038223) #1340

Closed

stefannica previously approved these changes Oct 2, 2017

View reviewed changes

nicolasbock previously approved these changes Oct 2, 2017

View reviewed changes

dirkmueller added the needs backport to SOC7 (stable/4.0) label Oct 4, 2017

cmurphy dismissed stale reviews from nicolasbock and stefannica via a16aa9a October 4, 2017 14:40

cmurphy force-pushed the fix-memcached branch from 310a318 to a16aa9a Compare October 4, 2017 14:40

dirkmueller added needs backport to SOC7 and removed needs backport to SOC7 (stable/4.0) labels Oct 5, 2017

cmurphy force-pushed the fix-memcached branch from a16aa9a to 91f3ab3 Compare October 5, 2017 11:34

cmurphy added the wip label Oct 6, 2017

cmurphy closed this Oct 12, 2017

vuntz mentioned this pull request Nov 14, 2017

Set socket_timeout for memcached connections to 1s #1429

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memcached: Make config non-HA-aware (bsc#1038223)#1341

memcached: Make config non-HA-aware (bsc#1038223)#1341
cmurphy wants to merge 1 commit intocrowbar:masterfrom
cmurphy:fix-memcached

cmurphy commented Oct 2, 2017

Uh oh!

cmurphy commented Oct 2, 2017

Uh oh!

dirkmueller commented Oct 4, 2017

Uh oh!

cmurphy commented Oct 4, 2017

Uh oh!

cmurphy commented Oct 12, 2017

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Conversation

cmurphy commented Oct 2, 2017

Uh oh!

cmurphy commented Oct 2, 2017

Uh oh!

dirkmueller commented Oct 4, 2017

Uh oh!

cmurphy commented Oct 4, 2017

Uh oh!

cmurphy commented Oct 12, 2017

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants