Skip to content

Commit

Permalink
Merge pull request #7756 from aojea/dns_sli
Browse files Browse the repository at this point in the history
update DNS programming latency SLI
  • Loading branch information
k8s-ci-robot authored Mar 14, 2024
2 parents 31399e5 + 9e43c29 commit e9f78a3
Showing 1 changed file with 9 additions and 1 deletion.
10 changes: 9 additions & 1 deletion sig-scalability/slos/dns_programming_latency.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,10 @@ from all of them.
DNS will start resolving service name to its newly started backends.
- As a user of vanilla Kubernetes, I want some guarantee how quickly in-cluster
DNS will stop resolving service name to its removed (or unhealthy) backends.
- As a user of vanilla Kubernetes, I wasn some guarantee how quickly newly
- As a user of vanilla Kubernetes, I want some guarantee how quickly newly
create services will be resolvable via in-cluster DNS.
- As a user of vanilla Kubernetes, I want some guarantee how quickly in-cluster
DNS will start resolving headless service hostnames to its newly started backends.

### Other notes
- We are consciously focusing on in-cluster DNS for the purpose of this SLI,
Expand All @@ -37,6 +39,12 @@ The reason for doing it this way is feasibility for efficiently computing that:
in 99% of programmers (e.g. iptables). That requires tracking metrics on
per-change base (which we can't do efficiently).

- The SLI for DNS publishing should remain constant independent of the number of records.
For example, in a headless service with thousands of pods the time between the pod being
assigned an IP and the time DNS makes that IP available in the service's A/AAAA record(s)
should be statisitically consistent for the first Pod and the last Pod.


### How to measure the SLI.
There [network programming latency](./network_programming_latency.md) is
formulated in almost exactly the same way. As a result, the methodology for
Expand Down

0 comments on commit e9f78a3

Please sign in to comment.