[inference provider] Add wavespeed.ai as an inference provider #1424

arabot777 · 2025-05-05T02:28:49Z

What’s in this PR
WaveSpeedAI is a high-performance AI image and video generation service platform, offering industry-leading generation speeds. Now, want to be listed as an Inference Provider on the Hugging Face Hub

The JS Client Integration was completed based on the inference-providers help documentation and passed the test. I am submitting the pr now and look forward to further communication with you

Test

pnpm --filter @huggingface/inference test "test/InferenceClient.spec.ts" -t "^Wavespeed AI"

> @huggingface/[email protected] test /Users/shanliu/work/huggingface.js/packages/inference
> vitest run --config vitest.config.mts "test/InferenceClient.spec.ts"


 RUN  v0.34.6 /Users/shanliu/work/huggingface.js/packages/inference

 ✓ test/InferenceClient.spec.ts (104) 198160ms
   ✓ InferenceClient (104) 198160ms
     ✓ backward compatibility (1)
       ✓ works with old HfInference name
     ↓ HF Inference (49) [skipped]
       ↓ throws error if model does not exist [skipped]
       ↓ fillMask [skipped]
       ↓ works without model [skipped]
       ↓ summarization [skipped]
       ↓ questionAnswering [skipped]
       ↓ tableQuestionAnswering [skipped]
       ↓ documentQuestionAnswering [skipped]
       ↓ documentQuestionAnswering with non-array output [skipped]
       ↓ visualQuestionAnswering [skipped]
       ↓ textClassification [skipped]
       ↓ textGeneration - gpt2 [skipped]
       ↓ textGeneration - openai-community/gpt2 [skipped]
       ↓ textGenerationStream - meta-llama/Llama-3.2-3B [skipped]
       ↓ textGenerationStream - catch error [skipped]
       ↓ textGenerationStream - Abort [skipped]
       ↓ tokenClassification [skipped]
       ↓ translation [skipped]
       ↓ zeroShotClassification [skipped]
       ↓ sentenceSimilarity [skipped]
       ↓ FeatureExtraction [skipped]
       ↓ FeatureExtraction - auto-compatibility sentence similarity [skipped]
       ↓ FeatureExtraction - facebook/bart-base [skipped]
       ↓ FeatureExtraction - facebook/bart-base, list input [skipped]
       ↓ automaticSpeechRecognition [skipped]
       ↓ audioClassification [skipped]
       ↓ audioToAudio [skipped]
       ↓ textToSpeech [skipped]
       ↓ imageClassification [skipped]
       ↓ zeroShotImageClassification [skipped]
       ↓ objectDetection [skipped]
       ↓ imageSegmentation [skipped]
       ↓ imageToImage [skipped]
       ↓ imageToImage blob data [skipped]
       ↓ textToImage [skipped]
       ↓ textToImage with parameters [skipped]
       ↓ imageToText [skipped]
       ↓ request - openai-community/gpt2 [skipped]
       ↓ tabularRegression [skipped]
       ↓ tabularClassification [skipped]
       ↓ endpoint - makes request to specified endpoint [skipped]
       ↓ endpoint - makes request to specified endpoint - alternative syntax [skipped]
       ↓ chatCompletion modelId - OpenAI Specs [skipped]
       ↓ chatCompletionStream modelId - OpenAI Specs [skipped]
       ↓ chatCompletionStream modelId Fail - OpenAI Specs [skipped]
       ↓ chatCompletion - OpenAI Specs [skipped]
       ↓ chatCompletionStream - OpenAI Specs [skipped]
       ↓ custom mistral - OpenAI Specs [skipped]
       ↓ custom openai - OpenAI Specs [skipped]
       ↓ OpenAI client side routing - model should have provider as prefix [skipped]
     ↓ Fal AI (4) [skipped]
       ↓ textToImage - black-forest-labs/FLUX.1-schnell [skipped]
       ↓ textToImage - SD LoRAs [skipped]
       ↓ textToImage - Flux LoRAs [skipped]
       ↓ automaticSpeechRecognition - openai/whisper-large-v3 [skipped]
     ↓ Featherless (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textGeneration [skipped]
     ↓ Replicate (10) [skipped]
       ↓ textToImage canonical - black-forest-labs/FLUX.1-schnell [skipped]
       ↓ textToImage canonical - black-forest-labs/FLUX.1-dev [skipped]
       ↓ textToImage canonical - stabilityai/stable-diffusion-3.5-large-turbo [skipped]
       ↓ textToImage versioned - ByteDance/SDXL-Lightning [skipped]
       ↓ textToImage versioned - ByteDance/Hyper-SD [skipped]
       ↓ textToImage versioned - playgroundai/playground-v2.5-1024px-aesthetic [skipped]
       ↓ textToImage versioned - stabilityai/stable-diffusion-xl-base-1.0 [skipped]
       ↓ textToSpeech versioned [skipped]
       ↓ textToSpeech OuteTTS -  usually Cold [skipped]
       ↓ textToSpeech Kokoro [skipped]
     ↓ SambaNova (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ featureExtraction [skipped]
     ↓ Together (4) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
       ↓ textGeneration [skipped]
     ↓ Nebius (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
     ↓ 3rd party providers (1) [skipped]
       ↓ chatCompletion - fails with unsupported model [skipped]
     ↓ Fireworks (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Hyperbolic (4) [skipped]
       ↓ chatCompletion - hyperbolic [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
       ↓ textGeneration [skipped]
     ↓ Novita (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Black Forest Labs (2) [skipped]
       ↓ textToImage [skipped]
       ↓ textToImage URL [skipped]
     ↓ Cohere (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Cerebras (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Nscale (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
     ↓ Groq (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ OVHcloud (4) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textGeneration [skipped]
       ↓ textGeneration stream [skipped]
     ✓ Wavespeed AI (5) 89033ms
       ✓ textToImage - wavespeed-ai/flux-schnell 89032ms
       ✓ textToImage - wavespeed-ai/flux-dev-lora 12369ms
       ✓ textToImage - wavespeed-ai/flux-dev-lora-ultra-fast 17936ms
       ✓ textToVideo - wavespeed-ai/wan-2.1/t2v-480p 79507ms
       ✓ imageToImage - wavespeed-ai/hidream-e1-full 74481ms

 Test Files  1 passed (1)
      Tests  5 passed | 103 skipped (108)
   Start at  14:33:17
   Duration  89.62s (transform 315ms, setup 14ms, collect 368ms, tests 89.03s, environment 0ms, prepare 74ms)

SBrandeis

Hello, thank you for your contribution
The code is of great quality overall - I left a few comments regarding our code style.
Please make sure the client can be used to query your API for all supported tasks, and that the payload are matching your API.
Thanks again!

SBrandeis · 2025-05-19T14:36:21Z

packages/inference/README.md

+- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)



Suggested change

- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)

has deleted

SBrandeis · 2025-05-19T14:41:26Z

packages/inference/test/InferenceClient.spec.ts

+					hfModelId: "wavespeed-ai/wan-2.1/i2v-480p",
+					providerId: "wavespeed-ai/wan-2.1/i2v-480p",
+					status: "live",
+					task: "image-to-video",


this task is not supported in the client code - let's remove it for now

has deleted

SBrandeis · 2025-05-19T14:43:28Z

packages/inference/src/providers/wavespeed-ai.ts

+import { InferenceOutputError } from "../lib/InferenceOutputError";
+import { ImageToImageArgs } from "../tasks";
+import type { BodyParams, HeaderParams, RequestArgs, UrlParams } from "../types";
+import { delay } from "../utils/delay";
+import { omit } from "../utils/omit";
+import { base64FromBytes } from "../utils/base64FromBytes";
+import {
+	TaskProviderHelper,
+	TextToImageTaskHelper,
+	TextToVideoTaskHelper,
+	ImageToImageTaskHelper,
+} from "./providerHelper";
+


We use import type when the import is only used as a type

Suggested change

import { InferenceOutputError } from "../lib/InferenceOutputError";

import { ImageToImageArgs } from "../tasks";

import type { BodyParams, HeaderParams, RequestArgs, UrlParams } from "../types";

import { delay } from "../utils/delay";

import { omit } from "../utils/omit";

import { base64FromBytes } from "../utils/base64FromBytes";

import {

TaskProviderHelper,

TextToImageTaskHelper,

TextToVideoTaskHelper,

ImageToImageTaskHelper,

} from "./providerHelper";

import { InferenceOutputError } from "../lib/InferenceOutputError";

import type { ImageToImageArgs } from "../tasks";

import type { BodyParams, HeaderParams, RequestArgs, UrlParams } from "../types";

import { delay } from "../utils/delay";

import { omit } from "../utils/omit";

import { base64FromBytes } from "../utils/base64FromBytes";

import type {

TaskProviderHelper,

TextToImageTaskHelper,

TextToVideoTaskHelper,

ImageToImageTaskHelper,

} from "./providerHelper";

Modify as suggested

SBrandeis · 2025-05-19T15:08:19Z

packages/inference/src/providers/wavespeed-ai.ts

+	};
+}
+
+type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;


I'm not sure this type alias is needed, can we remove it?

Suggested change

type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

WaveSpeedAICommonResponse can be renamed to WaveSpeedAIResponse

This type is needed and will be used in two places. It's uncertain whether it will be used again in the future.
It follows the DRY (Don't Repeat Yourself) principle
It provides better type safety (through default generic parameters)
It makes the code more readable and maintainable

SBrandeis · 2025-05-19T15:20:13Z

packages/inference/src/providers/wavespeed-ai.ts

+				case "completed": {
+					// Get the video data from the first output URL
+					if (!taskResult.outputs?.[0]) {
+						throw new InferenceOutputError("No video URL in completed response");
+					}
+					const videoResponse = await fetch(taskResult.outputs[0]);
+					if (!videoResponse.ok) {
+						throw new InferenceOutputError("Failed to fetch video data");
+					}
+					return await videoResponse.blob();


From what I understand, the payload can be something else than a video (eg an image)
Let's update the error message to reflect that

yes,
I revised it.

SBrandeis · 2025-05-19T15:24:51Z

packages/inference/src/providers/wavespeed-ai.ts

+		if (!args.parameters) {
+			return {
+				...args,
+				model: args.model,
+				data: args.inputs,
+			};
+		} else {
+			return {
+				...args,
+				inputs: base64FromBytes(
+					new Uint8Array(args.inputs instanceof ArrayBuffer ? args.inputs : await (args.inputs as Blob).arrayBuffer())
+				),
+			};
+		}
+	}
+
+	override preparePayload(params: BodyParams): Record<string, unknown> {
+		return {
+			...omit(params.args, ["inputs", "parameters"]),
+			...(params.args.parameters as Record<string, unknown>),
+			image: params.args.inputs,
+		};
+	}


I think only one of the two ( preparePayload or preparePayloadAsync) should be responsible for building the payload, meaning, I'd rather move the rename of inputs to image in preparePayloadAsync an have preparePayload as dumb as possible

cc @hanouticelina - would love your opinion on that specific point

I only kept preparePayloadAsync func

I think only one of the two ( preparePayload or preparePayloadAsync) should be responsible for building the payload, meaning, I'd rather move the rename of inputs to image in preparePayloadAsync an have preparePayload as dumb as possible

yes agree!

SBrandeis · 2025-05-19T15:27:55Z

packages/inference/src/providers/wavespeed-ai.ts

+				inputs: base64FromBytes(
+					new Uint8Array(args.inputs instanceof ArrayBuffer ? args.inputs : await (args.inputs as Blob).arrayBuffer())
+				),


Does the wavespeed API support base64-encoded images as inputs?

hanouticelina

thank you @arabot777 for the PR! I left some minor comments. I tested the 3 tasks supported by Wavespeed.ai and it works as expected with the changes I suggested.

packages/inference/src/lib/getProviderHelper.ts

packages/inference/src/providers/wavespeed-ai.ts

Co-authored-by: célina <[email protected]>

SBrandeis

Second round of code review, thank you! We're getting there

Note: make sure you run pnpm format and pnpm lint to conform our code style.

SBrandeis · 2025-05-22T14:17:44Z

packages/inference/src/providers/wavespeed-ai.ts

+/**
+ * Common response structure for all WaveSpeed AI API responses
+ */
+interface WaveSpeedAICommonResponse<T> {
+	code: number;
+	message: string;
+	data: T;
+}
+


This abstraction is not necessary IMO, let's remove it (see my other comment)

Suggested change

/**

* Common response structure for all WaveSpeed AI API responses

*/

interface WaveSpeedAICommonResponse<T> {

code: number;

message: string;

data: T;

}

It has been modified as suggested

SBrandeis · 2025-05-22T14:19:34Z

packages/inference/src/providers/wavespeed-ai.ts

+type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;
+


Following the previous comment - let's remove one level of abstraction

Suggested change

type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

interface WaveSpeedAIResponse {

code: number;

message: string;

data: WaveSpeedAITaskResponse

}

It has been modified as suggested

SBrandeis · 2025-05-22T14:25:25Z

packages/inference/src/providers/wavespeed-ai.ts

+	preparePayload(params: BodyParams): Record<string, unknown> {
+		const payload: Record<string, unknown> = {
+			...omit(params.args, ["inputs", "parameters"]),
+			...(params.args.parameters as Record<string, unknown>),
+			prompt: params.args.inputs,
+		};
+		// Add LoRA support if adapter is specified in the mapping


We don't need to cast into Result<string, unknown> if the params have the proper type
ImageToImageArgs, TextToImageArgs, and TextToVideoArgs need to be improrted from "../tasks"

Suggested change

preparePayload(params: BodyParams): Record<string, unknown> {

const payload: Record<string, unknown> = {

...omit(params.args, ["inputs", "parameters"]),

...(params.args.parameters as Record<string, unknown>),

prompt: params.args.inputs,

};

// Add LoRA support if adapter is specified in the mapping

preparePayload(params: BodyParams<ImageToImageArgs | TextToImageArgs | TextToVideoArgs>): Record<string, unknown> {

const payload: Record<string, unknown> = {

...omit(params.args, ["inputs", "parameters"]),

...params.args.parameters,

prompt: params.args.inputs,

};

// Add LoRA support if adapter is specified in the mapping

It has been modified as suggested

SBrandeis · 2025-05-22T14:31:43Z

packages/inference/src/providers/wavespeed-ai.ts

+		if (params.mapping?.adapter === "lora" && params.mapping.adapterWeightsPath) {
+			payload.loras = [
+				{
+					path: params.mapping.adapterWeightsPath,


For reference, adapterWeightsPath is the path to the LoRA weights inside the associated HF repo
eg, for nerijs/pixel-art-xl, it will be

"pixel-art-xl.safetensors"

Let's make sure that is indeed what your API is expecting when running LoRAs

Here I see that fal is the endpoint that has been concatenated with hf.
Can I directly set the adapterWeightsPath to a lora http address? Or any other address.

In the test cases, I conducted the test in this way. The adapterWeightsPath was directly passed over as an input parameter of lora.

"wavespeed-ai/flux-dev-lora": { hfModelId: "wavespeed-ai/flux-dev-lora", providerId: "wavespeed-ai/flux-dev-lora", status: "live", task: "text-to-image", adapter: "lora", adapterWeightsPath: "https://d32s1zkpjdc4b1.cloudfront.net/predictions/599f3739f5354afc8a76a12042736bfd/1.safetensors", }, "wavespeed-ai/flux-dev-lora-ultra-fast": { hfModelId: "wavespeed-ai/flux-dev-lora-ultra-fast", providerId: "wavespeed-ai/flux-dev-lora-ultra-fast", status: "live", task: "text-to-image", adapter: "lora", adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA", },

in wavespeedai task is :

However, I'm not sure whether the input parameters submitted by hf to lora must be the abbreviation of the file path of the hf model and then concatenated with the hf address in the code. If it is this kind of specification, I can complete it in the format of fal

I think your API can just take the hf model id as the loras path, right?

Suggested change

path: params.mapping.adapterWeightsPath,

path: params.mapping.hfModelId,,

As mentioned by @SBrandeis, this part depends on what your API is expecting as inputs when using LoRAs weights.

Yes, you're correct.
In the example, linoyts/yarn_art_Flux_LoRA is the lora model address of hf. We will automatically match and download the hf model。

I completed the modification and ran the use case successfully

SBrandeis · 2025-05-22T14:33:14Z

packages/inference/src/providers/wavespeed-ai.ts

+	override prepareHeaders(params: HeaderParams, isBinary: boolean): Record<string, string> {
+		this.accessToken = params.accessToken;
+		const headers: Record<string, string> = { Authorization: `Bearer ${params.accessToken}` };
+		if (!isBinary) {
+			headers["Content-Type"] = "application/json";
+		}
+		return headers;
+	}


This is the same behavior as the blanket implementation here:
https://github.com/arabot777/huggingface.js/blob/f706e02d6128f559bd5551072344ff6e31b9c4be/packages/inference/src/providers/providerHelper.ts#L114-L124

No need for an override IMO

Suggested change

override prepareHeaders(params: HeaderParams, isBinary: boolean): Record<string, string> {

this.accessToken = params.accessToken;

const headers: Record<string, string> = { Authorization: `Bearer ${params.accessToken}` };

if (!isBinary) {

headers["Content-Type"] = "application/json";

}

return headers;

}

I removed this part of the logic at the beginning. However, the getresponse method of imageToimage.ts did not pass in header information.

I have to rewrite prepareHeaders here and by assignment
this.accessToken = params.accessToken; To ensure that the complete ak information of the header can be passed on when calling getresponse

I'd rather update ImageToImage to be able to pass headers to getResponse:

export async function imageToImage(args: ImageToImageArgs, options?: Options): Promise<Blob> { const provider = await resolveProvider(args.provider, args.model, args.endpointUrl); const providerHelper = getProviderHelper(provider, "image-to-image"); const payload = await providerHelper.preparePayloadAsync(args); const { data: res } = await innerRequest<Blob>(payload, providerHelper, { ...options, task: "image-to-image", }); const { url, info } = await makeRequestOptions(args, providerHelper, { ...options, task: "image-to-image" }); return providerHelper.getResponse(res, url, info.headers as Record<string, string>); }

rather than overriding prepareHeaders and doing this.accessToken = params.accessToken

Your suggestion makes sense. Initially, this was a common/public function, so I took a minimalistic approach and didn't modify it. Now, let me try making some changes here.

I completed the modification and ran the use case successfully

arabot777 · 2025-06-03T15:30:37Z

Hi @arabot777, we recently merged an improvement of error handling for inference (PR: #1504). i've added suggestions on how to incorporate it into the WaveSpeed AI inference provider implementation. Other than that, the PR looks good to me but let's wait for @SBrandeis final review!

Thank you for your reminder. I have completed the new error handling

arabot777 · 2025-06-09T15:02:23Z

Hi @SBrandeis ,

Just checking in—is there anything I can do to help move this PR forward? Let me know if you'd like any changes or have questions. Thanks for your time!

SBrandeis

Looks good - just a few minor comments to address
Let's merge soon and proceed with the next steps: https://huggingface.co/docs/inference-providers/register-as-a-provider

SBrandeis · 2025-06-18T08:55:30Z

packages/inference/src/providers/wavespeed-ai.ts

+			const result: WaveSpeedAIResponse = await resultResponse.json();
+			if (result.code !== 200) {
+				throw new InferenceClientProviderOutputError(
+					`API request to WaveSpeed AI API failed with code ${result.code}: ${result.message}`
+				);
+			}


Already covered by the previous check on resultResponse

ref: https://developer.mozilla.org/en-US/docs/Web/API/Response/ok

Suggested change

const result: WaveSpeedAIResponse = await resultResponse.json();

if (result.code !== 200) {

throw new InferenceClientProviderOutputError(

`API request to WaveSpeed AI API failed with code ${result.code}: ${result.message}`

);

}

SBrandeis · 2025-06-18T08:57:34Z

packages/inference/src/providers/wavespeed-ai.ts

+					const mediaResponse = await fetch(taskResult.outputs[0]);
+					if (!mediaResponse.ok) {
+						throw new InferenceClientProviderApiError(
+							"Failed to fetch response status from WaveSpeed AI API",


Suggested change

"Failed to fetch response status from WaveSpeed AI API",

"Failed to fetch generation output from WaveSpeed AI API",

SBrandeis · 2025-06-18T09:06:55Z

packages/inference/test/InferenceClient.spec.ts

+			HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"] = {
+				"wavespeed-ai/flux-schnell": {
+					hfModelId: "wavespeed-ai/flux-schnell",
+					providerId: "wavespeed-ai/flux-schnell",
+					status: "live",
+					task: "text-to-image",
+				},
+				"wavespeed-ai/wan-2.1/t2v-480p": {
+					hfModelId: "wavespeed-ai/wan-2.1/t2v-480p",
+					providerId: "wavespeed-ai/wan-2.1/t2v-480p",
+					status: "live",
+					task: "text-to-video",
+				},
+				"wavespeed-ai/hidream-e1-full": {
+					hfModelId: "wavespeed-ai/hidream-e1-full",
+					providerId: "wavespeed-ai/hidream-e1-full",
+					status: "live",
+					task: "image-to-image",
+				},
+				"openfree/flux-chatgpt-ghibli-lora": {
+					hfModelId: "openfree/flux-chatgpt-ghibli-lora",
+					providerId: "wavespeed-ai/flux-dev-lora",
+					status: "live",
+					task: "text-to-image",
+					adapter: "lora",
+					adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",
+				},
+				"linoyts/yarn_art_Flux_LoRA": {
+					hfModelId: "linoyts/yarn_art_Flux_LoRA",
+					providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",
+					status: "live",
+					task: "text-to-image",
+					adapter: "lora",
+					adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",
+				},
+			};


In order to reflect how mappings will work when deployed live, you need to:

add a provider field to the mapping

use the HF model IDs as keys

Suggested change

HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"] = {

"wavespeed-ai/flux-schnell": {

hfModelId: "wavespeed-ai/flux-schnell",

providerId: "wavespeed-ai/flux-schnell",

status: "live",

task: "text-to-image",

},

"wavespeed-ai/wan-2.1/t2v-480p": {

hfModelId: "wavespeed-ai/wan-2.1/t2v-480p",

providerId: "wavespeed-ai/wan-2.1/t2v-480p",

status: "live",

task: "text-to-video",

},

"wavespeed-ai/hidream-e1-full": {

hfModelId: "wavespeed-ai/hidream-e1-full",

providerId: "wavespeed-ai/hidream-e1-full",

status: "live",

task: "image-to-image",

},

"openfree/flux-chatgpt-ghibli-lora": {

hfModelId: "openfree/flux-chatgpt-ghibli-lora",

providerId: "wavespeed-ai/flux-dev-lora",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",

},

"linoyts/yarn_art_Flux_LoRA": {

hfModelId: "linoyts/yarn_art_Flux_LoRA",

providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",

},

};

HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"] = {

"black-forest-labs/FLUX.1-schnell": {

provider: "wavespeed-ai",

hfModelId: "wavespeed-ai/flux-schnell",

providerId: "wavespeed-ai/flux-schnell",

status: "live",

task: "text-to-image",

},

"Wan-AI/Wan2.1-T2V-14B": {

provider: "wavespeed-ai",

hfModelId: "wavespeed-ai/wan-2.1/t2v-480p",

providerId: "wavespeed-ai/wan-2.1/t2v-480p",

status: "live",

task: "text-to-video",

},

"HiDream-ai/HiDream-E1-Full": {

provider: "wavespeed-ai",

hfModelId: "wavespeed-ai/hidream-e1-full",

providerId: "wavespeed-ai/hidream-e1-full",

status: "live",

task: "image-to-image",

},

"openfree/flux-chatgpt-ghibli-lora": {

provider: "wavespeed-ai",

hfModelId: "openfree/flux-chatgpt-ghibli-lora",

providerId: "wavespeed-ai/flux-dev-lora",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",

},

"linoyts/yarn_art_Flux_LoRA": {

provider: "wavespeed-ai",

hfModelId: "linoyts/yarn_art_Flux_LoRA",

providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",

},

};

SBrandeis · 2025-06-18T09:07:31Z

packages/inference/test/InferenceClient.spec.ts

+
+			it(`textToImage - wavespeed-ai/flux-schnell`, async () => {
+				const res = await client.textToImage({
+					model: "wavespeed-ai/flux-schnell",


Following my previous comment, model IDs used here need to match the keys in the HARDCODED_MODEL_INFERENCE_MAPPING (which are the HF model IDs)

Suggested change

model: "wavespeed-ai/flux-schnell",

model: "black-forest-labs/FLUX.1-schnell",

arabot777 · 2025-06-22T13:16:40Z

@SBrandeis Thanks for your review and help with the PR! I’ve addressed all the comments—could you help take a look and merge when you get a chance? Really appreciate your time and insights.

arabot777 · 2025-07-10T13:31:06Z

@SBrandeis @hanouticelina Hello, it's been a few weeks. Could you follow up on the progress

SBrandeis · 2025-07-31T08:49:56Z

Hey there!
Thank you for your interest in becoming an Inference Provider and for the excellent work you've put into this integration!
We really appreciate the effort.

However, we're currently in a consolidation phase focusing on growing usage of Inference Providers via new features and integrations rather than expanding to new partners. This means we've temporarily paused onboarding new providers while we work on these improvements.

We're not able to provide a specific timeline for when we'll resume new provider onboarding, but we'd love to revisit this integration in the future.

Thanks again for your contribution and understanding!

arabot777 · 2025-10-14T07:12:52Z

Hello, I have merged the latest main code and completed the test cases. Can we proceed to the next step as soon as possible?

test print:


✓ Wavespeed AI (5) 60757ms
       ✓ textToImage - black-forest-labs/FLUX.1-schnell 7366ms
       ✓ textToImage - openfree/flux-chatgpt-ghibli-lora 13480ms
       ✓ textToImage - linoyts/yarn_art_Flux_LoRA 13563ms
       ✓ textToVideo - Wan-AI/Wan2.1-T2V-14B 60718ms
       ✓ imageToImage - HiDream-ai/HiDream-E1-Full 10142ms
     ↓ PublicAI (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Baseten (2) [skipped]
       ↓ chatCompletion - Qwen3 235B Instruct [skipped]
       ↓ chatCompletion stream - Qwen3 235B [skipped]
     ↓ clarifai (2) [skipped]
       ↓ chatCompletion - DeepSeek-V3_1 [skipped]
       ↓ chatCompletion stream - DeepSeek-V3_1 [skipped]

 Test Files  1 passed (1)
      Tests  5 passed | 120 skipped (125)
   Start at  15:00:48
   Duration  61.40s (transform 352ms, setup 11ms, collect 405ms, tests 60.76s, environment 0ms, prepare 67ms)

packages/inference/test/InferenceClient.spec.ts

SBrandeis · 2025-10-14T09:09:07Z

packages/inference/test/InferenceClient.spec.ts

+				"openfree/flux-chatgpt-ghibli-lora": {
+					provider: "wavespeed-ai",
+					hfModelId: "openfree/flux-chatgpt-ghibli-lora",
+					providerId: "wavespeed-ai/flux-dev-lora",
+					status: "live",
+					task: "text-to-image",
+					adapter: "lora",
+					adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",
+				},
+				"linoyts/yarn_art_Flux_LoRA": {
+					provider: "wavespeed-ai",
+					hfModelId: "linoyts/yarn_art_Flux_LoRA",
+					providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",
+					status: "live",
+					task: "text-to-image",
+					adapter: "lora",
+					adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",
+				},


The value for adapterWeightsPath does not match what the API will return, see: https://huggingface.co/api/models/linoyts/yarn_art_Flux_LoRA?expand=inferenceProviderMapping

adapterWeightsPath is not configurable by the provider mapping API and has the same for all providers

Suggested change

"openfree/flux-chatgpt-ghibli-lora": {

provider: "wavespeed-ai",

hfModelId: "openfree/flux-chatgpt-ghibli-lora",

providerId: "wavespeed-ai/flux-dev-lora",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",

},

"linoyts/yarn_art_Flux_LoRA": {

provider: "wavespeed-ai",

hfModelId: "linoyts/yarn_art_Flux_LoRA",

providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",

},

"openfree/flux-chatgpt-ghibli-lora": {

provider: "wavespeed-ai",

hfModelId: "openfree/flux-chatgpt-ghibli-lora",

providerId: "wavespeed-ai/flux-dev-lora",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "flux-chatgpt-ghibli-lora.safetensors",

},

"linoyts/yarn_art_Flux_LoRA": {

provider: "wavespeed-ai",

hfModelId: "linoyts/yarn_art_Flux_LoRA",

providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",

status: "live",

task: "text-to-image",

adapter: "lora",

adapterWeightsPath: "pytorch_lora_weights.safetensors",

},

packages/inference/test/InferenceClient.spec.ts

SBrandeis

Looks good to me

hanouticelina

looks good! thank you @arabot777

- Fix adapterWeightsPath for LoRA models to match API response - Fix MIME type spacing in imageToImage test (image/png) - Replace hardcoded timeout with TIMEOUT constant

Co-authored-by: Simon Brandeis <[email protected]>

SBrandeis · 2025-10-14T15:18:54Z

packages/inference/src/types.ts

 	"sambanova",
 	"scaleway",
 	"together",
+	"wavespeed-ai",


As discussed offline, let's rename this to wavespeed?

sure. I'll make this change shortly and update the PR.

Done. Renamed "wavespeed-ai" to "wavespeed" and confirmed working through testing.

arabot777 added 2 commits May 5, 2025 09:45

add wavespeed.ai as an inference provider

a4d8504

delete debug log

686931e

arabot777 requested review from SBrandeis, hanouticelina and julien-c as code owners May 5, 2025 02:28

arabot777 and others added 6 commits May 5, 2025 17:31

Merge branch 'main' into feat/wavespeedai

0e71b88

Merge branch 'main' into feat/wavespeedai

4461225

Merge branch 'main' into feat/wavespeedai

e0bf580

Merge branch 'main' into feat/wavespeedai

07af35f

Merge branch 'main' into feat/wavespeedai

fa3afa4

support lora

214ff99

SBrandeis reviewed May 19, 2025

View reviewed changes

hanouticelina added the inference-providers integration of a new or existing Inference Provider label May 20, 2025

arabot777 and others added 4 commits May 20, 2025 21:31

code review

47c64c6

Merge branch 'main' into feat/wavespeedai

7270c5c

code review

ba35791

Merge branch 'main' into feat/wavespeedai

ca35eab

arabot777 requested a review from SBrandeis May 20, 2025 13:44

arabot777 and others added 2 commits May 20, 2025 21:48

delete unused import

80d4640

Merge branch 'main' into feat/wavespeedai

77be0c6

hanouticelina reviewed May 21, 2025

View reviewed changes

arabot777 and others added 4 commits May 22, 2025 00:22

Update packages/inference/src/lib/getProviderHelper.ts

0c77b3b

Co-authored-by: célina <[email protected]>

Update packages/inference/src/lib/getProviderHelper.ts

3ab254e

Co-authored-by: célina <[email protected]>

Merge branch 'main' into feat/wavespeedai

a8fe74c

Merge branch 'main' into feat/wavespeedai

f706e02

SBrandeis reviewed May 22, 2025

View reviewed changes

code review modification

47f41f0

arabot777 requested review from SBrandeis and hanouticelina May 22, 2025 15:32

Merge branch 'main' into feat/wavespeedai

0cfefe8

arabot777 added 2 commits June 7, 2025 11:55

Merge branch 'main' into feat/wavespeedai

64a991d

Merge branch 'main' into feat/wavespeedai

fd20f75

arabot777 added 2 commits June 11, 2025 11:56

Merge branch 'main' into feat/wavespeedai

98465e2

Merge branch 'main' into feat/wavespeedai

4e4ca9c

SBrandeis approved these changes Jun 18, 2025

View reviewed changes

arabot777 and others added 3 commits June 22, 2025 20:36

Merge branch 'main' into feat/wavespeedai

76344db

recode

f145274

Fix the demo

dc66fd4

Merge branch 'main' into feat/wavespeedai

da95bc3

Merge branch 'main' into feat/wavespeedai

781d302

SBrandeis reviewed Oct 14, 2025

View reviewed changes

packages/inference/test/InferenceClient.spec.ts Outdated Show resolved Hide resolved

SBrandeis reviewed Oct 14, 2025

View reviewed changes

packages/inference/test/InferenceClient.spec.ts Outdated Show resolved Hide resolved

SBrandeis reviewed Oct 14, 2025

View reviewed changes

packages/inference/test/InferenceClient.spec.ts Outdated Show resolved Hide resolved

SBrandeis approved these changes Oct 14, 2025

View reviewed changes

SBrandeis requested review from Wauplin and hanouticelina October 14, 2025 09:41

hanouticelina approved these changes Oct 14, 2025

View reviewed changes

arabot777 and others added 2 commits October 14, 2025 21:56

fix: apply code review suggestions for Wavespeed AI provider

8ae96f2

- Fix adapterWeightsPath for LoRA models to match API response - Fix MIME type spacing in imageToImage test (image/png) - Replace hardcoded timeout with TIMEOUT constant

Apply suggestions from code review

8e35b64

Co-authored-by: Simon Brandeis <[email protected]>

SBrandeis reviewed Oct 14, 2025

View reviewed changes

change wavespeed-ai to wavespeed

5efd3dc

SBrandeis merged commit 17fb3a7 into huggingface:main Oct 14, 2025
5 checks passed

arabot777 mentioned this pull request Oct 24, 2025

[inference provider] Add wavespeed.ai as an inference provider huggingface/huggingface_hub#3474

Merged

		- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)

		type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

	path: params.mapping.adapterWeightsPath,
	path: params.mapping.hfModelId,,

	"Failed to fetch response status from WaveSpeed AI API",
	"Failed to fetch generation output from WaveSpeed AI API",

	model: "wavespeed-ai/flux-schnell",
	model: "black-forest-labs/FLUX.1-schnell",

[inference provider] Add wavespeed.ai as an inference provider #1424

[inference provider] Add wavespeed.ai as an inference provider #1424

Uh oh!

Conversation

arabot777 commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanouticelina left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arabot777 May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arabot777 commented May 5, 2025 •

edited

Loading

arabot777 May 22, 2025 •

edited

Loading