Skip to content
View Mihir4921's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Mihir4921

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Mihir4921/README.md

Hi there, I'm Mihir Prajapati! 👋

Backend & distributed systems engineer with a master's in CS from NYU. I build reliable services, data pipelines, and the infrastructure they run on, with a soft spot for performance and systems that hold up under load.

🔭 Currently building event-driven backends and AI-powered features as a founding engineer 🌱 Interested in distributed systems, GPU/performance work, and applied AI


💡 Projects

  • StockMarket.ai — RAG app that recommends and summarizes stock-prediction research, grounding an LLM in a vector-searched corpus.
  • CUDA Parallelization Prediction — predicts optimal CUDA launch configs with XGBoost + SHAP across seven GPU workloads.
  • RepCRec — distributed database with replicated concurrency control, deadlock detection, and site failure/recovery.
  • OS-Nexus — from-scratch C++ simulators of an OS scheduler, virtual-memory manager, I/O scheduler, and linker.

📚 Publications

  • M. Prajapati et al., "Automatic Question Tagging using Machine Learning and Deep Learning Algorithms," 2022 6th Intl. Conference on Electronics, Communication and Aerospace Technology (ICECA), 2022, pp. 932–938. DOI

🛠️ Skills

  • Languages: Python, Go, C/C++, C#, TypeScript, JavaScript, SQL, Java, Scala, Bash/Shell
  • Backend & APIs: FastAPI, Spring Boot, Node.js, Flask, REST APIs, GraphQL, Microservices, System Design, Distributed Systems
  • Data & messaging: PostgreSQL, MySQL, MongoDB, Redis, Kafka, Apache Spark, Airflow, ChromaDB, ETL
  • Cloud & DevOps: AWS, GCP, Docker, Kubernetes, Terraform, Ansible, Jenkins, GitHub Actions, OpenShift, CI/CD
  • ML & AI: PyTorch, Hugging Face, RAG, LLM Evaluation, LangChain, XGBoost, CUDA, GPU Programming, HPC
  • Testing & observability: Pytest, JUnit, Selenium, Postman, Prometheus, Splunk, Grafana

📫 Contact

Pinned Loading

  1. Stockmarket.ai Stockmarket.ai Public

    RAG application that recommends and summarizes stock-prediction research papers, grounding an LLM in a vector-searched corpus.

    TypeScript

  2. CUDA-Parallelization-Prediction CUDA-Parallelization-Prediction Public

    Predict the best-performing CUDA launch configuration for a kernel

    Jupyter Notebook

  3. OS-Nexus OS-Nexus Public

    From-scratch C++ simulators of core OS internals: CPU scheduler, paging virtual-memory manager, I/O scheduler, and a two-pass linker.

    C++

  4. RepCRec RepCRec Public

    Distributed database simulator with replicated concurrency control, deadlock detection, and site failure/recovery across 10 sites.

    C++

  5. Roofline-Modeling-CML-Sp2024 Roofline-Modeling-CML-Sp2024 Public

    Roofline performance analysis of CNN training (ResNet-18, AlexNet) on A100 vs V100 GPUs using NVIDIA Nsight Compute.

    Python