Feat/retries on conflict error #74

samuel-esp · 2024-12-14T17:30:12Z

Motivation

Give the user the possibility to choose how to handle HTTP 409 conflict errors. Such conflicts typically occur when another entity (such as an HPA, CI/CD pipeline, or manual intervention) modifies a resource just before KubeDownscaler processes it

See #68 or caas-team/py-kube-downscaler#111

Changes

Introduced --max-retries-on-conflict argument like in Py-Kube-Downscaler
Introduced GetWorkload() function to handle the use case when the downscaler needs to retrieve a single Kubernetes resource (before it was only possible to get a list of resources, i.e kubectl get deploy -n default). the old GetWorkload() was renamed to GetWorkloads() to reflect the changes
Introduced a new function GetResourceType() that returns the resource type (string)
Refactored the main loop to be able to use --max-retries-on-conflict

Tests done

Unit Tests

TODO

I've assigned myself to this PR
Refactored docs
Added more unit tests on this specific use case

jonathan-mayer

Also just so you know, the workflows were broken for forks (thats why they were failing). We've fixed it but it will now run every workflow twice in this and the other pr. If you want you can rebase the branches with main and the errors will go away.

cmd/kubedownscaler/main.go

internal/pkg/scalable/cronjobs.go

samuel-esp · 2025-02-19T23:34:08Z

@jonathan-mayer rebased and included the previous suggestions. Just a couple of things to note:

Pre-Commit failed because the specific getResourceFunc for each resource has duplicate code" detected (which is true because the logic is shared across them). So, eventually, we should try to think about a strategy to tackle that
Pre-Commit also suggested to lower a bit startScanning complexity. What would you refactor inside that?

jonathan-mayer · 2025-02-20T06:16:11Z

Ill have a look over it.

Pre-Commit failed because the specific getResourceFunc for each resource has duplicate code" detected (which is true because the logic is shared across them). So, eventually, we should try to think about a strategy to tackle that

I do get that it detects it as duplicate, but im not entirely sure how to avoid it. I think in theory it is avoidable by just having list and get in separate functions. Another way would also be to refactor to use the dynamic client again, although i would still like to avoid that.

Pre-Commit also suggested to lower a bit startScanning complexity. What would you refactor inside that?

I think we could refactor the function called in the go routine out.

samuel-esp · 2025-02-20T18:43:10Z

I perfectly agree with you, I don't see anything wrong with having them duplicate. I would nolint them
I'll extract the go routine from the function

samuel-esp · 2025-02-20T19:56:33Z

@jonathan-mayer I refactored the go func out of the startScanning function however I'm not sure what could be the best name for the that. attemptScan was the best I came up with

jonathan-mayer · 2025-02-21T05:58:49Z

@jonathan-mayer I refactored the go func out of the startScanning function however I'm not sure what could be the best name for the that. attemptScan was the best I came up with

i think the current function name should be good for now. Although i would not put a anonymous function in the attemptScan function, but instead just run the attemptScan function in a goroutine so go attemptScan(). Also just a heads up, it might take some time until i get around to reviewing this.

samuel-esp · 2025-02-21T07:23:12Z

Don't worry Jonathan, absolutely no rush take your time

jonathan-mayer · 2025-02-25T06:56:39Z

I think we should change up the GetResource funcs again. They should only handle listing and then every scalableResouce should have a re-get function which regets itself from kubernetes. it could like something like this:

func (c *cronJob) Reget(clientsets *Clientsets, ctx context.Context) {
	*c, err = clientsets.Kubernetes.BatchV1().CronJobs(c.Namespace).Get(c.Name, ctx,, metav1.GetOptions{})
	if err != nil {
		// TODO handle error
	}
}

cmd/kubedownscaler/main.go

internal/pkg/util/config.go

samuel-esp · 2025-02-25T21:15:57Z

I think we should change up the GetResource funcs again. They should only handle listing and then every scalableResouce should have a re-get function which regets itself from kubernetes. it could like something like this:
func (c *cronJob) Reget(clientsets *Clientsets, ctx context.Context) {
	*c, err = clientsets.Kubernetes.BatchV1().CronJobs(c.Namespace).Get(c.Name, ctx,, metav1.GetOptions{})
	if err != nil {
		// TODO handle error
	}
}

Implemented, the linter still complains about 2 things:

deployment/statefulset class duplication
RegetWorkload and its specific methods returns an interface type

I mean the first one could be true, for the second one to me it seems legit to return an interface type in that case

jonathan-mayer

looked at main and clientgo

cmd/kubedownscaler/main.go

internal/api/kubernetes/client.go

samuel-esp · 2025-02-26T21:03:51Z

refactored the way you suggested, still having "duplicate" suggestion on linting. I think we can't do much about that

cmd/kubedownscaler/main.go

internal/pkg/scalable/cronjobs.go

internal/pkg/scalable/workload.go

cmd/kubedownscaler/main.go

jonathan-mayer

I think this should be good to go

jonathan-mayer · 2025-03-04T11:52:04Z

Oh actually add the nolints for the dupl linter suggestion, then we can merge

jonathan-mayer assigned samuel-esp Jan 7, 2025

jonathan-mayer added the enhancement New feature or request label Jan 7, 2025

jonathan-mayer linked an issue Jan 7, 2025 that may be closed by this pull request

Allow for synchronous operation #68

Open

jonathan-mayer reviewed Jan 7, 2025

View reviewed changes

jonathan-mayer requested changes Jan 7, 2025

View reviewed changes

jonathan-mayer mentioned this pull request Feb 19, 2025

Allow for synchronous operation #68

Open

samuel-esp added 4 commits February 19, 2025 21:28

feat: added max-retries-on-conflict support for conflict errors

2333232

feat: added docs for --max-retries-on-conflict arg

0db75e8

feat: refactored troubleshooting.md

353aa67

refactor: rebased, refactored getResourceFunc logic

05b88bf

samuel-esp force-pushed the feat/retries-409-error branch from 27c4a65 to 05b88bf Compare February 19, 2025 22:48

samuel-esp marked this pull request as ready for review February 19, 2025 22:53

samuel-esp requested review from JTaeuber and jon4skl as code owners February 19, 2025 22:53

refactor: linter suggestions

f3d8fe6

refactor: moved scanning gofunc to scanAttempt function

2579f24

samuel-esp requested a review from jonathan-mayer February 20, 2025 19:53

refactor: renamed scanAttempt function

e6ca737

refactor: attemptScan go func

0ecb0fb

jonathan-mayer requested changes Feb 25, 2025

View reviewed changes

samuel-esp added 3 commits February 25, 2025 20:34

fix: max-retries-on-conflict renamed

16a31de

feat: introduced reget function

f2795e8

refactor: linter suggestions

4a2b206

samuel-esp requested a review from jonathan-mayer February 26, 2025 06:35

jonathan-mayer requested changes Feb 26, 2025

View reviewed changes

cmd/kubedownscaler/main.go Outdated Show resolved Hide resolved

internal/api/kubernetes/client.go Outdated Show resolved Hide resolved

internal/api/kubernetes/client.go Outdated Show resolved Hide resolved

refactor: reget functions assigned to workload

0acc7bd

samuel-esp requested a review from jonathan-mayer February 26, 2025 20:53

refactor: reget function

43b9e00

jonathan-mayer requested changes Feb 28, 2025

View reviewed changes

cmd/kubedownscaler/main.go Outdated Show resolved Hide resolved

cmd/kubedownscaler/main.go Outdated Show resolved Hide resolved

internal/pkg/scalable/cronjobs.go Outdated Show resolved Hide resolved

internal/pkg/scalable/workload.go Outdated Show resolved Hide resolved

samuel-esp added 2 commits February 28, 2025 09:46

refactor: resources, log messages

925a8cd

refactor: exported error messages from attemptScan

4b51d8a

samuel-esp requested a review from jonathan-mayer February 28, 2025 11:31

jonathan-mayer requested changes Mar 3, 2025

View reviewed changes

cmd/kubedownscaler/main.go Outdated Show resolved Hide resolved

cmd/kubedownscaler/main.go Outdated Show resolved Hide resolved

refactor: go func for attemptScan

46cea9d

samuel-esp requested a review from jonathan-mayer March 3, 2025 17:36

jonathan-mayer approved these changes Mar 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/retries on conflict error #74

Feat/retries on conflict error #74

samuel-esp commented Dec 14, 2024 •

edited

Loading

jonathan-mayer left a comment

samuel-esp commented Feb 19, 2025

jonathan-mayer commented Feb 20, 2025

samuel-esp commented Feb 20, 2025 •

edited

Loading

samuel-esp commented Feb 20, 2025 •

edited

Loading

jonathan-mayer commented Feb 21, 2025 •

edited

Loading

samuel-esp commented Feb 21, 2025

jonathan-mayer commented Feb 25, 2025 •

edited

Loading

samuel-esp commented Feb 25, 2025 •

edited

Loading

jonathan-mayer left a comment

samuel-esp commented Feb 26, 2025

jonathan-mayer left a comment

jonathan-mayer commented Mar 4, 2025

Feat/retries on conflict error #74

Are you sure you want to change the base?

Feat/retries on conflict error #74

Conversation

samuel-esp commented Dec 14, 2024 • edited Loading

Motivation

Changes

Tests done

TODO

jonathan-mayer left a comment

Choose a reason for hiding this comment

samuel-esp commented Feb 19, 2025

jonathan-mayer commented Feb 20, 2025

samuel-esp commented Feb 20, 2025 • edited Loading

samuel-esp commented Feb 20, 2025 • edited Loading

jonathan-mayer commented Feb 21, 2025 • edited Loading

samuel-esp commented Feb 21, 2025

jonathan-mayer commented Feb 25, 2025 • edited Loading

samuel-esp commented Feb 25, 2025 • edited Loading

jonathan-mayer left a comment

Choose a reason for hiding this comment

samuel-esp commented Feb 26, 2025

jonathan-mayer left a comment

Choose a reason for hiding this comment

jonathan-mayer commented Mar 4, 2025

samuel-esp commented Dec 14, 2024 •

edited

Loading

samuel-esp commented Feb 20, 2025 •

edited

Loading

samuel-esp commented Feb 20, 2025 •

edited

Loading

jonathan-mayer commented Feb 21, 2025 •

edited

Loading

jonathan-mayer commented Feb 25, 2025 •

edited

Loading

samuel-esp commented Feb 25, 2025 •

edited

Loading