Do not make any extra API call when endpoint URL is directly specified #1322

coyotte508 · 2025-03-31T15:56:29Z

The fix is not perfect, a greater refactoring would need to take place (eg to remove all concept of providerConfig when endpointUrl is provided, use a separate default provider config maybe not relying on model being defined)

Copilot

Pull Request Overview

This PR updates the inference API request logic to avoid making unnecessary API calls when a direct endpoint URL is provided. Key changes include:

Making the "model" property optional in BodyParams.
Adding a fast-path in makeRequestOptions to bypass model resolution when an endpoint URL is supplied.
Adjusting the signature and usage of makeRequestOptionsFromResolvedModel to accommodate the optional model input.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
packages/inference/src/types.ts	Changed "model" from required to optional in BodyParams.
packages/inference/src/lib/makeRequestOptions.ts	Added branch logic for endpoint URL and updated request options construction accordingly.

Comments suppressed due to low confidence (2)

packages/inference/src/types.ts:130

Making the 'model' field optional may lead to cases where a required model is not provided when an endpoint URL is absent. Ensure that downstream logic either supplies a valid model ID or handles the undefined case appropriately.

model?: string;

packages/inference/src/lib/makeRequestOptions.ts:210

When using the endpointUrl branch, 'input.resolvedModel' might be undefined. Consider adding validation or a fallback to guarantee that a model value is provided when needed by providerConfig.makeBody.

model: input.resolvedModel,

Kakulukian

ty

Wauplin

Thanks! Just a nit regarding the potentially undefined model.

a greater refactoring would need to take place (eg to remove all concept of providerConfig when endpointUrl is provided, use a separate default provider config maybe not relying on model being defined)

What we do on the Python side is to use the "hf-inference" provider when an endpoint is passed. And in the hf-inference logic we take care of handling things correctly. If a user passes provider="<any-provider-except-hf-inference>", endpoint="https://..." we raise an exception.

packages/inference/src/types.ts

packages/inference/src/providers/replicate.ts

packages/inference/src/providers/types.ts

packages/inference/src/lib/makeRequestOptions.ts

Wauplin · 2025-04-02T14:58:56Z

I'm a bit afraid this PR and #1315 will highly conflict 😕 cc @hanouticelina

packages/inference/src/lib/makeRequestOptions.ts

coyotte508 requested review from julien-c, hanouticelina and SBrandeis as code owners March 31, 2025 15:56

coyotte508 requested a review from Copilot March 31, 2025 15:56

Copilot AI reviewed Mar 31, 2025

View reviewed changes

Kakulukian approved these changes Apr 1, 2025

View reviewed changes

Wauplin reviewed Apr 1, 2025

View reviewed changes

packages/inference/src/types.ts Outdated Show resolved Hide resolved

packages/inference/src/providers/replicate.ts Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

coyotte508 marked this pull request as draft April 2, 2025 11:35

This comment was marked as outdated.

Sign in to view

coyotte508 commented Apr 2, 2025

View reviewed changes

packages/inference/src/providers/types.ts Outdated Show resolved Hide resolved

coyotte508 marked this pull request as ready for review April 2, 2025 12:05

coyotte508 requested a review from Wauplin April 2, 2025 12:05

coyotte508 commented Apr 2, 2025

View reviewed changes

packages/inference/src/lib/makeRequestOptions.ts Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

no api call to load task data in case of inference endpoints

712ed1f

coyotte508 force-pushed the no-unnecessary-tasks-call branch from 0c9cbe6 to 712ed1f Compare April 8, 2025 09:22

coyotte508 commented Apr 8, 2025

View reviewed changes

packages/inference/src/lib/makeRequestOptions.ts Outdated Show resolved Hide resolved

still use model when provided alongside endpoint url

a0d0e1f

hanouticelina approved these changes Apr 8, 2025

View reviewed changes

SBrandeis approved these changes Apr 8, 2025

View reviewed changes

coyotte508 merged commit 1a63901 into main Apr 9, 2025
6 checks passed

coyotte508 deleted the no-unnecessary-tasks-call branch April 9, 2025 14:21

coyotte508 mentioned this pull request Apr 29, 2025

[mcp-client] Allow arbitrary endpoint via URL (in both McpClient and Agent) #1396

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do not make any extra API call when endpoint URL is directly specified #1322

Do not make any extra API call when endpoint URL is directly specified #1322

coyotte508 commented Mar 31, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Kakulukian left a comment

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

Uh oh!

Uh oh!

Wauplin commented Apr 2, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Do not make any extra API call when endpoint URL is directly specified #1322

Do not make any extra API call when endpoint URL is directly specified #1322

Conversation

coyotte508 commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Kakulukian left a comment

Choose a reason for hiding this comment

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

Uh oh!

Uh oh!

Wauplin commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

coyotte508 commented Mar 31, 2025 •

edited

Loading

Wauplin commented Apr 2, 2025 •

edited

Loading