Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Startup latency, power and performance preference hints #95

Open
anssiko opened this issue Sep 3, 2020 · 1 comment
Open

Startup latency, power and performance preference hints #95

anssiko opened this issue Sep 3, 2020 · 1 comment
Labels
Developer's Perspective Machine Learning Experiences on the Web: A Developer's Perspective Discussion topic Topic discussed at the workshop

Comments

@anssiko
Copy link
Member

anssiko commented Sep 3, 2020

The Mobile-first web-based Machine Learning talk by @JRMeyer and @lrianu explains how Artie's game engine uses TensorFlow Lite for Microcontrollers to run the computer vision models on the client, in the browser. IIUC TF Lite Micro was chosen in part to minimize the startup latency that was crucial for the experience. (Another key design consideration was avoiding dynamic memory allocations.)

This feedback suggests related Web APIs for ML inference should consider catering for various use cases, some of which are latency sensitive during inference time, some want to minimize startup latency, some care about battery performance (e.g. long running tasks on mobile), to give some example.

There has been some work in this area in Web APIs:

WebGL defines WebGLPowerPreference. Similarly, WebNN API for hardware accelerated inference defines PowerPreference, both currently offering "default", "low-power", or "high-performance" options.

Loop in @huningxin and @wchao1115 for WebNN and @RafaelCintron for WebGL expertise to discuss whether we have captured a good set of preferences, and whether there's room for more granular controls from an implementation feasibility point of view. From use cases point of view, the more knobs the better.

@JRMeyer and @lrianu, just curious, did you use Unity's tooling to cross-compile TF Lite Micro into WebAssembly?

@jasonmayes FYI for this interesting usage of TF Lite Micro.

@anssiko anssiko added Developer's Perspective Machine Learning Experiences on the Web: A Developer's Perspective User's Perspective Machine Learning Experiences on the Web: A User's Perspective labels Sep 3, 2020
@anssiko anssiko removed the User's Perspective Machine Learning Experiences on the Web: A User's Perspective label Sep 24, 2020
@anssiko
Copy link
Member Author

anssiko commented Sep 24, 2020

(This topic was discussed in the context of #68 during the Developer's Perspective Live Session. Minutes of that live session will be published at https://www.w3.org/2020/06/machine-learning-workshop/minutes/20200923.html soon.)

@anssiko anssiko added this to the 2020-09-23 Live Session #3 milestone Sep 24, 2020
@dontcallmedom dontcallmedom added the Discussion topic Topic discussed at the workshop label Oct 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Developer's Perspective Machine Learning Experiences on the Web: A Developer's Perspective Discussion topic Topic discussed at the workshop
Projects
None yet
Development

No branches or pull requests

2 participants