Skip to content

Commit 2030338

Browse files
authored
Apply suggestions from code review
1 parent 761c992 commit 2030338

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -8,9 +8,9 @@ This project offers tools for AI Inference, enabling developers to build [Infere
88

99
AI/ML is changing rapidly, and [Inference] goes beyond basic networking to include complex traffic routing and optimizations. Below are key terms used within this project:
1010

11-
- **Scheduler**: Makes decisions about which endpoint is optimal (best cost / best performance) for an inference request based on `Metrics and Capabilities` from Model Serving Platforms.
11+
- **Scheduler**: Makes decisions about which endpoint is optimal (best cost / best performance) for an inference request based on `Metrics and Capabilities` from [Model Serving](/docs/proposals/003-model-server-protocol/README.md).
1212
- **Metrics and Capabilities**: Data provided by model serving platforms about performance, availability and capabilities to optimize routing. Includes things like [Prefix Cache] status or [LoRA Adapters] availability.
13-
- **Endpoint Picker**: A `Scheduler` combined with `Metrics and Capabilities` systems is often referred to together as an `Endpoint Picker (EPP)`.
13+
- **Endpoint Selector**: A `Scheduler` combined with `Metrics and Capabilities` systems is often referred to together as an [Endpoint Selection Extension](https://gateway-api-inference-extension.sigs.k8s.io/#endpoint-selection-extension) (this is also sometimes referred to as an "endpoint picker").
1414
- **Inference Gateway**: A proxy/load-balancer which has been coupled with a `Endpoint Picker`. It provides optimized routing and load balancing for serving generative Artificial Intelligence (AI) workloads. It simplifies the deployment, management, and observability of AI inference workloads.
1515

1616
For deeper insights and more advanced concepts, refer to our [proposals](/docs/proposals).

0 commit comments

Comments
 (0)