[JENKINS-75945] Jenkins Kubernetes Plugin Retains Stale Node Directories After Failed Pod Creation

Problem: When using the Jenkins Kubernetes Cloud Plugin in a namespace with limited resources, jobs frequently attempt to create multiple pods. If initial pod creation fails due to resource quota limits, the plugin retries with new pod names. Each failed pod attempt results in a new node directory (<tt>${JENKINS_HOME}/nodes/<pod_name></tt>) being created. However, these directories are never cleaned up if the pods are not successfully created.

Impact:
<ul>
	<li>Thousands of stale node directories accumulate over time.</li>
</ul>


<ul>
	<li>Jenkins startup becomes extremely slow or crashes due to the volume of entries in the nodes directory.</li>
</ul>


<ul>
	<li>Manual cleanup becomes a recurring necessity to ensure Jenkins remains operational.</li>
</ul>


Expected Behavior: The plugin should automatically remove node directories for pods that were never successfully created.

 

A single job that waits for resources in the mentioned namespace can generate up to 144 stale directories that are not being deleted. After a while there are thousands of such directories.

 

Examples:
<ol>
	<li>Error that is being logged when the resources are missing: see <a href="https://issues.jenkins.io/secure/attachment/64722/64722_out-of-resources.txt" title="out-of-resources.txt attached to JENKINS-75945">out-of-resources.txt<img class="rendericon" src="https://issues.jenkins.io/images/icons/link_attachment_7.gif" height="7" width="7" align="absmiddle" alt="" border="0"/></a></li>
	<li>Error that is being logged when trying to start Jenkins up (and failing because of the volume of files present in ${JENKINS_HOME}/nodes: <a href="https://issues.jenkins.io/secure/attachment/64727/64727_failed-to-start.txt" title="failed-to-start.txt attached to JENKINS-75945">failed-to-start.txt<img class="rendericon" src="https://issues.jenkins.io/images/icons/link_attachment_7.gif" height="7" width="7" align="absmiddle" alt="" border="0"/></a></li>
</ol>


 

---
<details><summary>Originally reported by <a href="https://issues.jenkins.io/secure/ViewProfile.jspa?name=brotholomew">brotholomew</a>, imported from: <a class="no-jira-link-rewrite" href="https://issues.jenkins.io/browse/JENKINS-75945" target="_blank">Jenkins Kubernetes Plugin Retains Stale Node Directories After Failed Pod Creation</a></summary>
<ul>
<li>status: Open
<li>priority: Major
<li>component(s): kubernetes-plugin
<li>resolution: Unresolved
<li>votes: 2
<li>watchers: 2
<li>imported: 2025-12-02
</ul>
<details><summary>Raw content of original issue</summary>

<pre>
Problem: When using the Jenkins Kubernetes Cloud Plugin in a namespace with limited resources, jobs frequently attempt to create multiple pods. If initial pod creation fails due to resource quota limits, the plugin retries with new pod names. Each failed pod attempt results in a new node directory (<tt>${JENKINS_HOME}/nodes/&lt;pod_name&gt;</tt>) being created. However, these directories are never cleaned up if the pods are not successfully created.

Impact:
<ul>
	<li>Thousands of stale node directories accumulate over time.</li>
</ul>


<ul>
	<li>Jenkins startup becomes extremely slow or crashes due to the volume of entries in the nodes directory.</li>
</ul>


<ul>
	<li>Manual cleanup becomes a recurring necessity to ensure Jenkins remains operational.</li>
</ul>


Expected Behavior: The plugin should automatically remove node directories for pods that were never successfully created.

 

A single job that waits for resources in the mentioned namespace can generate up to 144 stale directories that are not being deleted. After a while there are thousands of such directories.

 

Examples:
<ol>
	<li>Error that is being logged when the resources are missing: see <a href="https://issues.jenkins.io/secure/attachment/64722/64722_out-of-resources.txt" title="out-of-resources.txt attached to JENKINS-75945">out-of-resources.txt<img class="rendericon" src="https://issues.jenkins.io/images/icons/link_attachment_7.gif" height="7" width="7" align="absmiddle" alt="" border="0"/></a></li>
	<li>Error that is being logged when trying to start Jenkins up (and failing because of the volume of files present in ${JENKINS_HOME}/nodes: <a href="https://issues.jenkins.io/secure/attachment/64727/64727_failed-to-start.txt" title="failed-to-start.txt attached to JENKINS-75945">failed-to-start.txt<img class="rendericon" src="https://issues.jenkins.io/images/icons/link_attachment_7.gif" height="7" width="7" align="absmiddle" alt="" border="0"/></a></li>
</ol>


 </pre>
</details>
</details>
<details><summary>environment</summary>

```
Jenkins: 2.504.2 LTS 
kubernetes-client-api: 6.10.0-251.v556f5f100500 
kubernetes-credentials: 192.v4d5b_1c429d17 
kubernetes: 4353.vb_47977da_9417 
Jenkins is running on a pod in a kube cluster in a container based on jenkins/jenkins:2.504.2-lts-jdk21
```
</details>
<details><summary>2 attachments</summary>

- [failed-to-start.txt](https://raw.githubusercontent.com/jenkinsci/attachments-from-jira-issues-last/refs/heads/main/attachments/64727/failed-to-start.txt)
- [out-of-resources.txt](https://raw.githubusercontent.com/jenkinsci/attachments-from-jira-issues-last/refs/heads/main/attachments/64722/out-of-resources.txt)
</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[JENKINS-75945] Jenkins Kubernetes Plugin Retains Stale Node Directories After Failed Pod Creation #2758

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[JENKINS-75945] Jenkins Kubernetes Plugin Retains Stale Node Directories After Failed Pod Creation #2758

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions