Skip to content
This repository has been archived by the owner on Feb 26, 2025. It is now read-only.

Kubectl drain errors on test #38

Closed
paulgmiller opened this issue Dec 12, 2024 · 4 comments
Closed

Kubectl drain errors on test #38

paulgmiller opened this issue Dec 12, 2024 · 4 comments

Comments

@paulgmiller
Copy link
Owner

We're using GracefulShutdownTimeout: -1 so our webhook should drain but either
It could be konnectivity or kube-proxy not updating fast enough. I think it its not connectivity because I see that being evicted well before the error.

So think I'll try holdign sigterm for a while so kube-proxy can update? But still a mystery here.

rror when evicting pods/"store-front-bfb8f484c-mwxd4" -n "pets": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF
error when evicting pods/"makeline-service-5f6dcd9d4d-rpv8t" -n "pets": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF
error when evicting pods/"controller-manager-779cf5b64b-nc6kz" -n "default": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF
error when evicting pods/"store-admin-8f6db4878-4ftjb" -n "pets": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF
error when evicting pods/"rabbitmq-0" -n "pets": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF
error when evicting pods/"mongodb-0" -n "pets": Internal error occurred: failed calling webhook "eviction.mydomain.com": failed to call webhook: Post "https://eviction-webhook.default.svc:443/validate-eviction?timeout=10s": EOF

@paulgmiller paulgmiller changed the title Kubectl Kubectl drain errors on test Dec 12, 2024
@paulgmiller
Copy link
Owner Author

Turns out the konnectivity agent eviction seems to cause this. If I exclude it from a node drain I don't reproduce. (used #39 to prove this)

@paulgmiller
Copy link
Owner Author

Readiness now checks api sever connection
kubernetes-sigs/apiserver-network-proxy#491
kubernetes-sigs/apiserver-network-proxy#485

@paulgmiller
Copy link
Owner Author

Ah maybe here kubernetes-sigs/apiserver-network-proxy#566

@paulgmiller
Copy link
Owner Author

Probably can't fix this. Depriotized as we're moving away from webhook. but still a little sad that any eviction webhook would cause these failures in kubectl drain but they do work on retry.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant