Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

justinjung04 · 2025-03-26T16:21:55Z

What this PR does:

This PR introduces ability to throttle incoming query requests in ingesters and store gateways when their CPU and heap is under pressure.

Data stores (ingesters and store gateways) currently don't have good ways to limit and control resource allocation per query request. Each query request has huge variance in its resource consumption, so it's hard to define static limits to protect ingesters or store gateways from using more than 100% CPU or being OOMkilled.

I'm introducing new component called resource monitor which can be referenced in ingesters and store gateways to block incoming query requests when the utilization is above the defined threshold.

Here is a test where high TPS of queries exhausting ingester CPU was throttled by the new feature, stabalizing the ingester CPU at around configured threshold of 40%.

I'm applying this to ingesters and store gateways for now just to keep the PR size small, but later this can be easily applied to query frontend and queriers as well.

Which issue(s) this PR fixes:
n/a

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

Signed-off-by: Justin Jung <[email protected]>

justinjung04 · 2025-03-26T23:33:08Z

When choosing how to retrieve correct CPU and heap data, I basically tested different metrics from https://pkg.go.dev/runtime/metrics and https://github.com/prometheus/procfs, compared with kubernetes metrics to find closest metrics. I thought it's unnecessary to comment about different metrics that I tried, but let me know if you believe I should mention about it somewhere.

GiedriusS · 2025-03-28T10:57:28Z

CHANGELOG.md

@@ -5,6 +5,8 @@
 * [FEATURE] Query Frontend: Add dynamic interval size for query splitting. This is enabled by configuring experimental flags `querier.max-shards-per-query` and/or `querier.max-fetched-data-duration-per-query`. The split interval size is dynamically increased to maintain a number of shards and total duration fetched below the configured values. #6458
 * [FEATURE] Querier/Ruler: Add `query_partial_data` and `rules_partial_data` limits to allow queries/rules to be evaluated with data from a single zone, if other zones are not available. #6526
 * [FEATURE] Update prometheus alertmanager version to v0.28.0 and add new integration msteamsv2, jira, and rocketchat. #6590
+* [FEATURE] Ingester: Add a `-ingester.enable-ooo-native-histograms` flag to enable out-of-order native histogram ingestion per tenant. It only takes effect when `-blocks-storage.tsdb.enable-native-histograms=true` and `-ingester.out-of-order-time-window` > 0. It is applied after the restart if it is changed at runtime through the runtime config. #6626


Seems like an unrelated change?

GiedriusS · 2025-03-28T11:02:52Z

pkg/util/resource/monitor.go

+}
+
+func NewScanner() (*Scanner, error) {
+	proc, err := procfs.Self()


I don't know if in Cortex docs somewhere we say that something else other than Linux is supported but maybe it makes sense to error out if the OS is not Linux? Or return a scanner that does nothing.

GiedriusS · 2025-03-28T11:06:37Z

docs/configuration/config-file-reference.md

@@ -271,6 +271,15 @@ query_scheduler:
    # CLI flag: -query-scheduler.grpc-client-config.connect-timeout
    [connect_timeout: <duration> | default = 5s]

+resource_thresholds:


IMHO, it's worth mentioning what metrics are being used to estimate resource usage, and it's probably worth mentioning that they might not exactly translate to memory usage, for example.

GiedriusS · 2025-03-28T11:08:09Z

CHANGELOG.md

@@ -5,6 +5,8 @@
 * [FEATURE] Query Frontend: Add dynamic interval size for query splitting. This is enabled by configuring experimental flags `querier.max-shards-per-query` and/or `querier.max-fetched-data-duration-per-query`. The split interval size is dynamically increased to maintain a number of shards and total duration fetched below the configured values. #6458
 * [FEATURE] Querier/Ruler: Add `query_partial_data` and `rules_partial_data` limits to allow queries/rules to be evaluated with data from a single zone, if other zones are not available. #6526
 * [FEATURE] Update prometheus alertmanager version to v0.28.0 and add new integration msteamsv2, jira, and rocketchat. #6590
+* [FEATURE] Ingester: Add a `-ingester.enable-ooo-native-histograms` flag to enable out-of-order native histogram ingestion per tenant. It only takes effect when `-blocks-storage.tsdb.enable-native-histograms=true` and `-ingester.out-of-order-time-window` > 0. It is applied after the restart if it is changed at runtime through the runtime config. #6626
+* [FEATURE] Ingester/StoreGateway: Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674


Maybe it would be interesting to know/document the use-case exactly? Is it so that the user would get a nice error message instead of waiting for the timeout to hit when there's a resource pressure?

GiedriusS · 2025-03-28T11:09:00Z

pkg/util/resource/monitor.go

+
+	// Variables to calculate average CPU utilization
+	index         int
+	cpuRates      []float64


Maybe it makes sense to use an array instead of a slice here? dataPointsToAvg is a constant either way.

justinjung04 added 3 commits March 25, 2025 13:19

Add resource based throttling to ingesters and store gateways

80b2d5c

Signed-off-by: Justin Jung <[email protected]>

doc

2121845

Signed-off-by: Justin Jung <[email protected]>

Add automaxprocs

2b168fc

Signed-off-by: Justin Jung <[email protected]>

pull-request-size bot added the size/XL label Mar 26, 2025

justinjung04 added 2 commits March 26, 2025 09:23

nit

56f8e57

Signed-off-by: Justin Jung <[email protected]>

Add test for monitor

9efbbd9

Signed-off-by: Justin Jung <[email protected]>

justinjung04 force-pushed the resource-based-throttling branch from 30d1cba to 9efbbd9 Compare March 26, 2025 16:27

fix tests

30bbd3d

Signed-off-by: Justin Jung <[email protected]>

justinjung04 changed the title ~~Resource based throttling~~ Add resource-thresholds to throttle query requests when the pods are under resource pressure. Mar 26, 2025

justinjung04 changed the title ~~Add resource-thresholds to throttle query requests when the pods are under resource pressure.~~ Add resource-thresholds in ingesters and store gateways to throttle query requests when the pods are under resource pressure. Mar 26, 2025

justinjung04 added 3 commits March 26, 2025 10:01

changelog

fa56e65

Signed-off-by: Justin Jung <[email protected]>

Merge branch 'master' into resource-based-throttling

a2ffcdd

Signed-off-by: Justin Jung <[email protected]>

fix test

5cccd60

Signed-off-by: Justin Jung <[email protected]>

justinjung04 force-pushed the resource-based-throttling branch from 841d578 to 5cccd60 Compare March 26, 2025 21:55

remove interface

6e37330

Signed-off-by: Justin Jung <[email protected]>

justinjung04 marked this pull request as ready for review March 26, 2025 23:33

dosubot bot added component/ingester component/store-gateway type/feature labels Mar 26, 2025

GiedriusS reviewed Mar 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

justinjung04 commented Mar 26, 2025 •

edited

Loading

justinjung04 commented Mar 26, 2025

GiedriusS Mar 28, 2025

GiedriusS Mar 28, 2025

GiedriusS Mar 28, 2025

GiedriusS Mar 28, 2025

GiedriusS Mar 28, 2025

Add resource-thresholds in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

Are you sure you want to change the base?

Add resource-thresholds in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

Conversation

justinjung04 commented Mar 26, 2025 • edited Loading

justinjung04 commented Mar 26, 2025

GiedriusS Mar 28, 2025

Choose a reason for hiding this comment

GiedriusS Mar 28, 2025

Choose a reason for hiding this comment

GiedriusS Mar 28, 2025

Choose a reason for hiding this comment

GiedriusS Mar 28, 2025

Choose a reason for hiding this comment

GiedriusS Mar 28, 2025

Choose a reason for hiding this comment

Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

Add `resource-thresholds` in ingesters and store gateways to throttle query requests when the pods are under resource pressure. #6674

justinjung04 commented Mar 26, 2025 •

edited

Loading