Merge pull request #7756 from aojea/dns_sli

update DNS programming latency SLI
kubernetes · Mar 14, 2024 · e9f78a3 · e9f78a3
2 parents 31399e5 + 9e43c29
commit e9f78a3
Showing 1 changed file with 9 additions and 1 deletion.
diff --git a/sig-scalability/slos/dns_programming_latency.md b/sig-scalability/slos/dns_programming_latency.md
@@ -15,8 +15,10 @@ from all of them.
 DNS will start resolving service name to its newly started backends.
 - As a user of vanilla Kubernetes, I want some guarantee how quickly in-cluster
 DNS will stop resolving service name to its removed (or unhealthy) backends.
-- As a user of vanilla Kubernetes, I wasn some guarantee how quickly newly
+- As a user of vanilla Kubernetes, I want some guarantee how quickly newly
 create services will be resolvable via in-cluster DNS.
+- As a user of vanilla Kubernetes, I want some guarantee how quickly in-cluster
+DNS will start resolving headless service hostnames to its newly started backends.
 
 ### Other notes
 - We are consciously focusing on in-cluster DNS for the purpose of this SLI,
@@ -37,6 +39,12 @@ The reason for doing it this way is feasibility for efficiently computing that:
     in 99% of programmers (e.g. iptables). That requires tracking metrics on
     per-change base (which we can't do efficiently).
 
+- The SLI for DNS publishing should remain constant independent of the number of records.
+For example, in a headless service with thousands of pods the time between the pod being
+assigned an IP and the time DNS makes that IP available in the service's A/AAAA record(s)
+should be statisitically consistent for the first Pod and the last Pod.
+
+
 ### How to measure the SLI.
 There [network programming latency](./network_programming_latency.md) is
 formulated in almost exactly the same way. As a result, the methodology for