-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workspace pod may stall during rollout #801
Labels
kind/bug
Some behavior is incorrect or out of spec
needs-triage
Needs attention from the triage team
Comments
A k3d cluster configuration for testing the behavior of apiVersion: k3d.io/v1alpha5
kind: Simple
metadata:
name: issue-801
options:
k3s:
extraArgs:
- arg: "--kube-apiserver-arg=feature-gates=MaxUnavailableStatefulSet=true"
nodeFilters:
- server:*
- arg: "--kube-scheduler-arg=feature-gates=MaxUnavailableStatefulSet=true"
nodeFilters:
- server:*
- arg: "--kubelet-arg=feature-gates=MaxUnavailableStatefulSet=true"
nodeFilters:
- agent:* |
I experimented with Parallel strategy, and it does seem effective even when the I also experimented with the OnDelete strategy, and found it difficult to implement for the following reasons:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
kind/bug
Some behavior is incorrect or out of spec
needs-triage
Needs attention from the triage team
What happened?
When the workspace spec is undeployable, e.g. due to an invalid docker image, the rollout fails. Unfortunately, attempts to update the spec aren't effective at unblocking the system, due to a limitation of StatefulSet:
https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/#forced-rollback
One must manually delete the workspace pod to unblock the system.
See also:
Example
Lots of ways to trigger this:
Output of
pulumi about
PKO v2.0.0-beta.3
Additional context
Let's keep in mind some requirements.
A possible solution may be to use the "parallel" pod management strategy:
https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/#parallel-pod-management
Contributing
Vote on this issue by adding a 👍 reaction.
To contribute a fix for this issue, leave a comment (and link to your pull request, if you've opened one already).
The text was updated successfully, but these errors were encountered: