- Kanpur, Uttar Pradesh
-
21:22
(UTC -12:00) - https://www.linkedin.com/in/krtarunsingh/
Popular repositories Loading
-
on-device-npu-rag
on-device-npu-rag PublicOn-device, NPU-first RAG app for Copilot+ PCs & Android. ONNX Runtime (QNN/DirectML/CPU) + FAISS/BM25, optional Ollama. Private, offline notes search.
-
webgpu-webllm-app
webgpu-webllm-app PublicRun a large language model fully in the browser — no servers, no API keys. The app uses WebLLM + WebGPU for acceleration, and automatically falls back to a WASM (wllama) runtime when WebGPU isn’t a…
-
voice-agent-realtime-mcp-sip
voice-agent-realtime-mcp-sip PublicProduction-ready starter kit for building low-latency AI voice agents with OpenAI Realtime API. Includes FastAPI server, WebRTC client, and MCP-style tools. Test order lookups, create tickets, and …
-
-
-
OnDeviceLLM-Android
OnDeviceLLM-Android PublicStarter repo for building an offline Android Chat + Translate app with multi-path LLM backends: llama.cpp (Adreno OpenCL), MLC-LLM (TVM), and WebLLM/WebGPU fallback. Includes JNI stubs, Kotlin UI, …
Python 2
If the problem persists, check the GitHub status page or contact support.