-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some runners pods are never terminated #3903
Comments
Hi @julien-michaud , I experience the same behaviour (I'm using the containerMode kubernetes). I checked the processes of one of these instances and the steps were done (no more processes from the workflow) but the container stays there doing... nothing. I encounter this issue especially when I have long running commands that don't write to the output ( |
Hi @julien-michaud 👋 We reached out to GCP support, and they confirmed a regression introduced with the Container-Optimized OS (COS) versions between Here is what they suggested:
Based on the configuration you shared, I assume you're also running the runners on GKE. I hope this helps! |
Thanks a lot for the infos @prizov ! We just upgraded to |
Checks
Controller Version
0.10.1
Deployment Method
Helm
Checks
To Reproduce
Describe the bug
Sometimes, the runner pods continue running in zombie mode after completing their jobs.
Describe the expected behavior
runner pods should should be terminated after job completion
Additional Context
Controller Logs
Runner Pod Logs
https://gist.github.com/julien-michaud/ce2a1e5c5d494d89e09453f0b270a26f
The text was updated successfully, but these errors were encountered: