generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 222
Closed
Milestone
Description
Hermetic test is a fast way to verify functionality e2e without requiring an integration test env. In addition to the basic test we have today, we should add the following:
Test framework improvements:
- Run a fake k8s API server so we don't need to fake the reconcilers, done in Refactor ext-proc Main with Server Package Add Hermetic Test with k8s API Client for EPP #222
Test Cases
- Happy path: Configure one InferencePool and InferenceModel, and verify request success
- Test algorithm
- Verify min KV cache algo without LoRA - EPP should pick the pod with least kv-cache, refer to the filter_test.go for the conditions of the pod metrics.
- Verify LoRA affinity algo - EPP should pick the pod with LoRA already active.
- Sheddable request succeeds when resource is available; and dropped when resource is constrained.
### Tasks
- [ ] https://github.com/kubernetes-sigs/gateway-api-inference-extension/issues/153
Metadata
Metadata
Assignees
Labels
No labels