-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
K8SPG-708 replace ready/live probe http check with custom command, change pg entrypoint #1099
base: main
Are you sure you want to change the base?
Conversation
internal/patroni/reconcile.go
Outdated
container.LivenessProbe.Exec = &corev1.ExecAction{ | ||
Command: []string{"/usr/local/bin/postgres-liveness-check.sh"}, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we want to do this change only for >2.7.0 🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done here: c377050
Path: "/readiness", | ||
Port: intstr.FromInt(int(*cluster.Spec.Patroni.Port)), | ||
Scheme: corev1.URISchemeHTTPS, | ||
container.ReadinessProbe.ProbeHandler = readinessProbe(cluster) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add comments for your changes in upstream code
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
e2e-tests/vars.sh
Outdated
@@ -14,7 +14,7 @@ export IMAGE_BASE=${IMAGE_BASE:-"perconalab/percona-postgresql-operator"} | |||
export IMAGE=${IMAGE:-"${IMAGE_BASE}:${VERSION}"} | |||
export PG_VER="${PG_VER:-16}" | |||
export IMAGE_PGBOUNCER=${IMAGE_PGBOUNCER:-"${IMAGE_BASE}:main-ppg$PG_VER-pgbouncer"} | |||
export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:main-ppg$PG_VER-postgres"} | |||
export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:K8SPG-708-ppg$PG_VER-postgres"} | |||
export IMAGE_BACKREST=${IMAGE_BACKREST:-"${IMAGE_BASE}:main-ppg$PG_VER-pgbackrest"} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
probably you will return this value to export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:main-ppg$PG_VER-postgres"} .
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that once we merge and generate new ppg images with the updated scripts, we will open a new PR to revert this. But for now I think it should stay as is, otherwise the tests will fail.
// the database container is considered "alive" beyond basic checks. | ||
// Introduced with K8SPG-708. | ||
func readinessProbe(cluster *v1beta1.PostgresCluster) corev1.ProbeHandler { | ||
if cluster.CompareVersion("2.7.0") >= 0 { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My favorite comment. I like more to use like this:
Motivation:
Easy to refactor, depricated value should be under condition since it doesn't use in version 2.7.0 and above (and when we should change something we should change new behaviour, not old one)
func livenessProbe(cluster *v1beta1.PostgresCluster) corev1.ProbeHandler {
if cluster.CompareVersion("2.7.0") < 0 {
return corev1.ProbeHandler{
HTTPGet: &corev1.HTTPGetAction{
Path: "/liveness",
Port: intstr.FromInt(int(*cluster.Spec.Patroni.Port)),
Scheme: corev1.URISchemeHTTPS,
},
}
}
return corev1.ProbeHandler{
Exec: &corev1.ExecAction{
Command: []string{"bash", "-c", "/usr/local/bin/postgres-liveness-check.sh"},
},
}
return corev1.ProbeHandler{
HTTPGet: &corev1.HTTPGetAction{
Path: "/liveness",
Port: intstr.FromInt(int(*cluster.Spec.Patroni.Port)),
Scheme: corev1.URISchemeHTTPS,
},
}
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The main reason I prefer the existing approach is because it has as "default" non-conditional behaviour the old implementation of the probes, while the new approach is conditioned to only the latest CR version. Since it is already a very small function that is also unit tested, it is very compact and refactoring it internally in any way we want in the future will not cause us any trouble. Usually the case you are mentioning could cause problems in larger functions with more complicated logic.
cluster := new(v1beta1.PostgresCluster) | ||
err := cluster.Default(context.Background(), nil) | ||
assert.NilError(t, err) | ||
cluster.Name = "some-such" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we have any agreement on how to name variables in tests, or does it not matter in this case?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is upstream code that was just moved to a different place. if you notice the diff you will see where it was before.
CHANGE DESCRIPTION
Problem:
We are changing the probes from direct requests to patroni's restful api to a custom command so that we can get more flexibility on what can be executed for the live/ready checks of the pg container. For now this is needed so that we can introduce a sleep forever feature. Related PR: percona/percona-docker#1148
We are also introducing a new entrypoint for the database container which is defined in the percona-docker repo for now.
Testing sleep forever
PPG image used:
perconalab/percona-postgresql-operator:K8SPG-708-ppg17-postgres
Cause:
Short explanation of the root cause of the issue if applicable.
Solution:
Short explanation of the solution we are providing with this PR.
CHECKLIST
Jira
Needs Doc
) and QA (Needs QA
)?Tests
Config/Logging/Testability