Skip to content

Add hermetic tests on the routing algorithm #80

@liu-cong

Description

@liu-cong

Hermetic test is a fast way to verify functionality e2e without requiring an integration test env. In addition to the basic test we have today, we should add the following:

Test framework improvements:

Test Cases

  • Happy path: Configure one InferencePool and InferenceModel, and verify request success
  • Test algorithm
    • Verify min KV cache algo without LoRA - EPP should pick the pod with least kv-cache, refer to the filter_test.go for the conditions of the pod metrics.
    • Verify LoRA affinity algo - EPP should pick the pod with LoRA already active.
    • Sheddable request succeeds when resource is available; and dropped when resource is constrained.
### Tasks
- [ ] https://github.com/kubernetes-sigs/gateway-api-inference-extension/issues/153

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions