feat: add ondeviceml.space — browser-based on-device AI inference by abhid1234 · Pull Request #2141 · huggingface/huggingface.js

abhid1234 · 2026-05-05T23:16:22Z

What this adds

Registers ondeviceml.space as a local app for Gemma and Qwen2 transformer models.

ondeviceml.space is a client-side-only AI gallery: models run entirely in the browser via WebGPU / WASM (MediaPipe Tasks GenAI runtime). No server, no install, no account — the model weights download once to the browser cache and all inference stays on-device.

Deep-link behavior

When a user clicks "Open in ondeviceml.space" from a model page, the URL

https://ondeviceml.space/?hf_model=google/gemma-3n-E2B-it&task=image-text-to-text

lands on the Gallery page, which:

Matches the hf_model param against the local catalog
Shows a confirmation banner: [ Gemma 3n E2B → Chat ] [ Load & Open ] [ Dismiss ]
Downloads the model weights (first visit only; cached via browser storage after)
Navigates to the matched feature

The deep-link intake is live on production — you can test it now:

https://ondeviceml.space/?hf_model=google/gemma-3n-E2B-it&task=image-text-to-text
https://ondeviceml.space/?hf_model=Qwen/Qwen2.5-1.5B-Instruct&task=text-generation

`displayOnModelPage` logic

Shows only for transformers-library models with text-generation or image-text-to-text pipeline whose model_type is one of gemma3n, gemma4, gemma3, qwen2 — the architectures validated to run via MediaPipe in-browser. Happy to expand or tighten the list based on your feedback.

Catalog today

Model	HF pipeline_tag	Browser runtime
Qwen/Qwen2.5-1.5B-Instruct	text-generation	WebGPU/WASM
google/gemma-3n-E2B-it	image-text-to-text	WebGPU/WASM
google/gemma-4-E2B-it	image-text-to-text	WebGPU/WASM
google/gemma-3n-E4B-it	image-text-to-text	WebGPU/WASM

Checklist

Deep-link is live and tested on production
displayOnModelPage targets only supported model types
No snippet needed — the deeplink opens a web URL directly
No macOS-only flag — works on any browser with WebGPU support (Chrome 113+, Edge)

Note

Low Risk
Low risk: adds a new LOCAL_APPS entry and display gating logic plus an external deeplink URL; no existing execution paths or core inference logic are modified.

Overview
Adds ondeviceml.space to the LOCAL_APPS registry so supported Hugging Face model pages can show an “Open in ondeviceml.space” option.

The new entry is only shown for transformers models with text-generation or image-text-to-text pipelines whose config.model_type starts with one of gemma3n, gemma4, gemma3, or qwen2, and it deep-links to https://ondeviceml.space/?hf_model=...&task=....

^{Reviewed by Cursor Bugbot for commit fcba75d. Bugbot is set up for automated code reviews on this repo. Configure here.}

…rence

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit fcba75d. Configure here.}

cursor · 2026-05-05T23:19:50Z

+			return (
+				model.library_name === "transformers" &&
+				(model.pipeline_tag === "text-generation" || model.pipeline_tag === "image-text-to-text") &&
+				supportedModelTypes.some((t) => model.config?.model_type?.startsWith(t))


Prefix allows unsupported architectures

Medium Severity

startsWith broadens the allow-list, so distinct architectures like qwen2_5_vl pass the qwen2 check. The button can appear for models outside the ondeviceml catalog or validated runtime, producing deeplinks the site cannot load.

^{Reviewed by Cursor Bugbot for commit fcba75d. Configure here.}

feat: add ondeviceml.space as a local app for Gemma/Qwen browser infe…

fcba75d

…rence

abhid1234 requested review from SBrandeis, Wauplin, gary149, julien-c, ngxson and pcuenca as code owners May 5, 2026 23:16

cursor Bot reviewed May 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add ondeviceml.space — browser-based on-device AI inference#2141

feat: add ondeviceml.space — browser-based on-device AI inference#2141
abhid1234 wants to merge 1 commit intohuggingface:mainfrom
abhid1234:add-ondeviceml-local-app

abhid1234 commented May 5, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

abhid1234 commented May 5, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this adds

Deep-link behavior

displayOnModelPage logic

Catalog today

Checklist

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 5, 2026

Choose a reason for hiding this comment

Prefix allows unsupported architectures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

abhid1234 commented May 5, 2026 •

edited by cursor Bot

Loading

`displayOnModelPage` logic