Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

K8SPG-708 replace ready/live probe http check with custom command, change pg entrypoint #1099

Open
wants to merge 15 commits into
base: main
Choose a base branch
from

Conversation

gkech
Copy link
Contributor

@gkech gkech commented Mar 21, 2025

K8SPG-708 Powered by Pull Request Badge

CHANGE DESCRIPTION

Problem:
We are changing the probes from direct requests to patroni's restful api to a custom command so that we can get more flexibility on what can be executed for the live/ready checks of the pg container. For now this is needed so that we can introduce a sleep forever feature. Related PR: percona/percona-docker#1148

We are also introducing a new entrypoint for the database container which is defined in the percona-docker repo for now.

Testing sleep forever

Screenshot 2025-03-28 at 5 52 53 PM

PPG image used: perconalab/percona-postgresql-operator:K8SPG-708-ppg17-postgres

Cause:
Short explanation of the root cause of the issue if applicable.

Solution:
Short explanation of the solution we are providing with this PR.

CHECKLIST

Jira

  • Is the Jira ticket created and referenced properly?
  • Does the Jira ticket have the proper statuses for documentation (Needs Doc) and QA (Needs QA)?
  • Does the Jira ticket link to the proper milestone (Fix Version field)?

Tests

  • Is an E2E test/test case added for the new feature/change?
  • Are unit tests added where appropriate?

Config/Logging/Testability

  • Are all needed new/changed options added to default YAML files?
  • Are all needed new/changed options added to the Helm Chart?
  • Did we add proper logging messages for operator actions?
  • Did we ensure compatibility with the previous version or cluster upgrade process?
  • Does the change support oldest and newest supported PG version?
  • Does the change support oldest and newest supported Kubernetes version?

Comment on lines 151 to 152
container.LivenessProbe.Exec = &corev1.ExecAction{
Command: []string{"/usr/local/bin/postgres-liveness-check.sh"},
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we want to do this change only for >2.7.0 🤔

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done here: c377050

@gkech gkech marked this pull request as ready for review March 24, 2025 11:25
Path: "/readiness",
Port: intstr.FromInt(int(*cluster.Spec.Patroni.Port)),
Scheme: corev1.URISchemeHTTPS,
container.ReadinessProbe.ProbeHandler = readinessProbe(cluster)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please add comments for your changes in upstream code

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@egegunes have a look on this: 2088d85, and let me know if we are looking for something different

@gkech gkech requested a review from egegunes March 26, 2025 08:02
egegunes
egegunes previously approved these changes Mar 26, 2025
@@ -14,7 +14,7 @@ export IMAGE_BASE=${IMAGE_BASE:-"perconalab/percona-postgresql-operator"}
export IMAGE=${IMAGE:-"${IMAGE_BASE}:${VERSION}"}
export PG_VER="${PG_VER:-16}"
export IMAGE_PGBOUNCER=${IMAGE_PGBOUNCER:-"${IMAGE_BASE}:main-ppg$PG_VER-pgbouncer"}
export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:main-ppg$PG_VER-postgres"}
export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:K8SPG-708-ppg$PG_VER-postgres"}
export IMAGE_BACKREST=${IMAGE_BACKREST:-"${IMAGE_BASE}:main-ppg$PG_VER-pgbackrest"}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

probably you will return this value to export IMAGE_POSTGRESQL=${IMAGE_POSTGRESQL:-"${IMAGE_BASE}:main-ppg$PG_VER-postgres"} .

Copy link
Contributor Author

@gkech gkech Mar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think that once we merge and generate new ppg images with the updated scripts, we will open a new PR to revert this. But for now I think it should stay as is, otherwise the tests will fail.

// the database container is considered "alive" beyond basic checks.
// Introduced with K8SPG-708.
func readinessProbe(cluster *v1beta1.PostgresCluster) corev1.ProbeHandler {
if cluster.CompareVersion("2.7.0") >= 0 {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My favorite comment. I like more to use like this:
Motivation:
Easy to refactor, depricated value should be under condition since it doesn't use in version 2.7.0 and above (and when we should change something we should change new behaviour, not old one)

func livenessProbe(cluster *v1beta1.PostgresCluster) corev1.ProbeHandler {
	if cluster.CompareVersion("2.7.0") < 0 {
		return corev1.ProbeHandler{
		HTTPGet: &corev1.HTTPGetAction{
			Path:   "/liveness",
			Port:   intstr.FromInt(int(*cluster.Spec.Patroni.Port)),
			Scheme: corev1.URISchemeHTTPS,
		},
	}
	}
        return corev1.ProbeHandler{
	      Exec: &corev1.ExecAction{
		      Command: []string{"bash", "-c", "/usr/local/bin/postgres-liveness-check.sh"},
	      },
      }
	return corev1.ProbeHandler{
		HTTPGet: &corev1.HTTPGetAction{
			Path:   "/liveness",
			Port:   intstr.FromInt(int(*cluster.Spec.Patroni.Port)),
			Scheme: corev1.URISchemeHTTPS,
		},
	}
}

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The main reason I prefer the existing approach is because it has as "default" non-conditional behaviour the old implementation of the probes, while the new approach is conditioned to only the latest CR version. Since it is already a very small function that is also unit tested, it is very compact and refactoring it internally in any way we want in the future will not cause us any trouble. Usually the case you are mentioning could cause problems in larger functions with more complicated logic.

cluster := new(v1beta1.PostgresCluster)
err := cluster.Default(context.Background(), nil)
assert.NilError(t, err)
cluster.Name = "some-such"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we have any agreement on how to name variables in tests, or does it not matter in this case?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is upstream code that was just moved to a different place. if you notice the diff you will see where it was before.

@gkech gkech requested a review from nmarukovich March 26, 2025 18:04
@gkech gkech requested a review from egegunes March 28, 2025 15:58
@egegunes egegunes added this to the 2.7.0 milestone Mar 31, 2025
@gkech gkech changed the title K8SPG-708 replace ready/live probe http check with custom command K8SPG-708 replace ready/live probe http check with custom command, change pg entrypoint Apr 1, 2025
egegunes
egegunes previously approved these changes Apr 1, 2025
@gkech gkech requested a review from egegunes April 3, 2025 08:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants