Skip to content

Commit e5a56f9

Browse files
lmicciniopenshift-merge-bot[bot]
authored andcommitted
Configure dogpile.cache to deal with memcached pods failures
Whenever one of the mecached pods disappears, because of a rolling restart during a minor update or as result of a failure, APIs can take a long time to detect that the pod went away and keep trying to reconnect. From a quick round of tests we saw downtimes up to ~150s. By enabling the retry_client and limiting the number of retries the behavior seems much more acceptable. Similarly, when TLS is not in use, we may want to set a lower value for memcache_dead_retry so to eventually reconnect to a new pod (having the same dns name but different ip) much faster. Jira: https://issues.redhat.com/browse/OSPRH-11935
1 parent d7e33fc commit e5a56f9

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

templates/nova.conf

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -173,9 +173,13 @@ enabled = True
173173
{{if .MemcachedTLS}}
174174
backend = dogpile.cache.pymemcache
175175
memcache_servers={{ .MemcachedServers }}
176+
enable_retry_client = true
177+
retry_attempts = 2
178+
retry_delay = 0
176179
{{else}}
177180
backend = dogpile.cache.memcached
178181
memcache_servers={{ .MemcachedServersWithInet }}
182+
memcache_dead_retry = 10
179183
{{end}}
180184
tls_enabled={{ .MemcachedTLS }}
181185
{{else}}

0 commit comments

Comments
 (0)