Feature request: provider-agnostic Model Capabilities registry (context window, token limits, feature support)

**Expected Behavior**

Provide a provider-agnostic, runtime-accessible model capabilities registry so applications can discover per-model capability and limits (for example: context window, max output tokens, and support for tool calling, streaming, multimodality, reasoning). The API should be additive and opt-in for providers that can supply this data.

Example API sketch (consumer):

```java
Optional<ModelCapabilities> caps = modelCapabilitiesProvider.getCapabilities("gpt-5-mini");
if (caps.isPresent() && caps.get().contextWindowTokens() != null) {
    validatePromptBudget(caps.get().contextWindowTokens());
}
if (caps.isPresent() && !caps.get().supportsToolCalling()) {
    // Disable tool execution for this model at runtime
}
```

Providers or model implementations may register capabilities beans or the framework may ship a central lookup that aggregates provider-supplied metadata.

**Current Behavior**

Spring AI exposes model operations and options but does not provide a unified runtime capability descriptor for models. As a result, application teams must:

- hardcode per-model limits in their apps, or
- maintain and synchronize an external map/registry of model -> capabilities, or
- consult provider documentation at development time and rely on runtime inference.

None of these approaches are ideal for production systems that must validate prompt sizes, set safe defaults, or degrade features dynamically.

**Context**

How has this affected you?

- It increases application complexity and risk: apps either risk runtime failures (oversized prompts) or conservative limits that reduce utility.
- It forces duplicate maintenance across projects and teams.

What are you trying to accomplish?

- Add a small, provider-neutral contract to Spring AI so client code can query model capabilities at runtime and adapt behavior safely.

What other alternatives have you considered?

- Keep capability registries in application code (current workaround) — error-prone and hard to keep current.
- Provider-specific metadata APIs only — workable but fragments application logic and increases branching.
- Documentation-only guidance — insufficient for runtime validation.

Additional notes

- Fields should be nullable/optional when a provider cannot return them.
- Providers may expose extra, provider-specific keys via a map for advanced use cases.
- I can follow up with a PR that implements the minimal contract and a provider-side stub for one or two model integrations to demonstrate the pattern.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature request: provider-agnostic Model Capabilities registry (context window, token limits, feature support) #6486

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Feature request: provider-agnostic Model Capabilities registry (context window, token limits, feature support) #6486

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions