Photo of Parth Kale

that's me! πŸ‘‹

✎ this is me, sketched out β€”

Parth Kale

AI Engineer

Building what comes after us - the age of artificial intelligence.

Mumbai, India

Hey β€” I'm Parth. My work lives at the intersection of machine learning, distributed systems, and human interaction. I enjoy building things from first principles, pushing models beyond demos, and crafting AI that feels less like software and more like intelligence

Let's cook !

The Career Path

Pencils down β€” here's the line I've been drawing, from the very first dot to where I'm standing today.

  1. 2019 β€” 2020

    Where it started

    Navodaya English High School

    SSC β€” 92.80%. The first dots on the page.

  2. 2021 β€” 2022

    HSC Β· Science

    KJ Somaiya College of Science & Commerce

    HSC Science β€” 80%. Started leaning hard into computers.

  3. 2022 β€” 2026

    B.Tech Β· IT + Honors in AI/ML

    Vidyalankar Institute of Technology

    CGPA 8.81. Where the AI obsession really took shape.

  4. Apr β€” Jun 2025

    AI Developer Intern

    AlgoRoots Pvt Ltd

    First production system, straight into the deep end.

    • Built a production-scale call agent with RAG, function calling, prompt engineering & SIP + LiveKit β€” handling 60,000+ automated calls/day.
    • Implemented fault-tolerant, scalable backend services for high uptime.
    • Built logging & monitoring pipelines for real-time analytics.
  5. Jul 2025 β€” May 2026

    AI Researcher Intern

    AlgoRoots Pvt Ltd

    Pushed into research β€” multilingual speech synthesis.

    • Designed & preprocessed a custom multilingual dataset for fine-tuning open-source TTS models.
    • Improved Hindi–English code-switching via fine-tuning an open-source model.
    • Enhanced speech quality & NLP performance for multilingual voice systems.
  6. Jun 2026 β€” Present

    AI Engineer

    AlgoRoots Pvt Ltd

    Where the line reaches today.

    • Build & deploy production-grade AI agents for voice interviewing, proctoring & candidate assessment.
    • Develop full-stack AI systems across LLMs, speech recognition, speech synthesis & real-time comms.
    • Design automated evaluation frameworks that generate structured interview feedback.
    • Own the full lifecycle β€” prototyping, model integration, cloud deployment & monitoring.

Stuff I've Built

Sticky notes from the lab β€” voice agents, language models & multimodal experiments.

SLM β€” Marathi Language Model

βŒ₯ GitHub β†—

Dec 2024

Pretrained an 84M-parameter GPT-2-inspired Marathi LM from scratch (6 layers, 6 heads, 384 dim). Built a custom 32K Marathi tokenizer and a 66K+ story dataset by translating TinyStories.

PyTorchGPT-2TokenizerPretraining

OpenBee β€” Offline Voice Assistant

βŒ₯ GitHub β†—

Mar 2026

Fully offline voice AI (Speech β†’ LLM β†’ Voice) with zero cloud dependency. Whisper + Gemma 1B + Kokoro on LiveKit. TTFT < 80ms, TTS < 500ms on consumer hardware, with a React control dashboard.

WhisperGemmaKokoroLiveKitReact

MemorySearch β€” Semantic Image Search

βŒ₯ GitHub β†—

Jan 2026

Modular image retrieval using BLIP captioning + configurable embeddings (Qwen0.6B, GTE, Gemma). Config-driven, incremental indexing and multi-model similarity ranking without full re-indexing.

BLIPEmbeddingsRetrievalMultimodal

LocalMind β€” Local Agentic RAG

βŒ₯ GitHub β†—

Jul 2025

Open-source, fully local conversational AI with LlamaIndex + LangGraph. Agentic RAG via dual tools (docs + web) and persistent memory. Q5-M quantized Jan-nano LLM (~60% VRAM savings).

LlamaIndexLangGraphRAGFastAPI

Alice β€” Home Surveillance System

βŒ₯ GitHub β†—

Feb β€” May 2025

Low-latency AI security using QwenVL-2.5 4B for real-time threat detection, quantized for ~43% VRAM reduction. Evaluation loop cut false positives ~35%; LiveKit + Twilio SIP calls homeowners with context-aware alerts.

QwenVLQuantizationLiveKitTwilio

Draupadi β€” AI Safety App

βŒ₯ GitHub β†—

Nov 2024

Real-time distress detection app (React Native + TensorFlow + Twilio) that detects voice cues like "help" or screams. GPS tracking + auto-SMS alerts and an "I'm Safe" mode for user control.

React NativeTensorFlowTwilioGPS

the toolkit

Skills

Programming

PythonJavaScript / TypeScriptC++DSA

AI & Machine Learning

LLMsGenerative AIAI AgentsRAGFine-Tuning (LoRA, PEFT)QuantizationComputer VisionMultimodal AI

Frameworks & Tools

Hugging FaceLangChainLlamaIndexLangGraphPyTorchTensorFlowvLLM

Web & Backend

React.jsNext.jsNode.jsFastAPIREST APIsRealtime (Twilio, SIP)

Cloud & Deployment

AWSAzureDockerVercelCI/CD (GitHub Actions, Jenkins)

Databases

PostgreSQL / NeonDBMongoDB

from the notebook

Writing