Skip to content
View krtarunsingh's full-sized avatar

Block or report krtarunsingh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. on-device-npu-rag on-device-npu-rag Public

    On-device, NPU-first RAG app for Copilot+ PCs & Android. ONNX Runtime (QNN/DirectML/CPU) + FAISS/BM25, optional Ollama. Private, offline notes search.

    Python 8 2

  2. webgpu-webllm-app webgpu-webllm-app Public

    Run a large language model fully in the browser — no servers, no API keys. The app uses WebLLM + WebGPU for acceleration, and automatically falls back to a WASM (wllama) runtime when WebGPU isn’t a…

    JavaScript 5 1

  3. voice-agent-realtime-mcp-sip voice-agent-realtime-mcp-sip Public

    Production-ready starter kit for building low-latency AI voice agents with OpenAI Realtime API. Includes FastAPI server, WebRTC client, and MCP-style tools. Test order lookups, create tickets, and …

    JavaScript 4 5

  4. gen-ai-chess-app gen-ai-chess-app Public

    JavaScript 3 2

  5. llamaextract-document-workflows llamaextract-document-workflows Public

    Python 2

  6. OnDeviceLLM-Android OnDeviceLLM-Android Public

    Starter repo for building an offline Android Chat + Translate app with multi-path LLM backends: llama.cpp (Adreno OpenCL), MLC-LLM (TVM), and WebLLM/WebGPU fallback. Includes JNI stubs, Kotlin UI, …

    Python 2