SLM β Marathi Language Model
β₯ GitHub βDec 2024
Pretrained an 84M-parameter GPT-2-inspired Marathi LM from scratch (6 layers, 6 heads, 384 dim). Built a custom 32K Marathi tokenizer and a 66K+ story dataset by translating TinyStories.

that's me! π
β this is me, sketched out β
Building what comes after us - the age of artificial intelligence.
Hey β I'm Parth. My work lives at the intersection of machine learning, distributed systems, and human interaction. I enjoy building things from first principles, pushing models beyond demos, and crafting AI that feels less like software and more like intelligence
Pencils down β here's the line I've been drawing, from the very first dot to where I'm standing today.
Navodaya English High School
SSC β 92.80%. The first dots on the page.
KJ Somaiya College of Science & Commerce
HSC Science β 80%. Started leaning hard into computers.
Vidyalankar Institute of Technology
CGPA 8.81. Where the AI obsession really took shape.
AlgoRoots Pvt Ltd
First production system, straight into the deep end.
AlgoRoots Pvt Ltd
Pushed into research β multilingual speech synthesis.
AlgoRoots Pvt Ltd
Where the line reaches today.
Sticky notes from the lab β voice agents, language models & multimodal experiments.
Dec 2024
Pretrained an 84M-parameter GPT-2-inspired Marathi LM from scratch (6 layers, 6 heads, 384 dim). Built a custom 32K Marathi tokenizer and a 66K+ story dataset by translating TinyStories.
Mar 2026
Fully offline voice AI (Speech β LLM β Voice) with zero cloud dependency. Whisper + Gemma 1B + Kokoro on LiveKit. TTFT < 80ms, TTS < 500ms on consumer hardware, with a React control dashboard.
Jan 2026
Modular image retrieval using BLIP captioning + configurable embeddings (Qwen0.6B, GTE, Gemma). Config-driven, incremental indexing and multi-model similarity ranking without full re-indexing.
Jul 2025
Open-source, fully local conversational AI with LlamaIndex + LangGraph. Agentic RAG via dual tools (docs + web) and persistent memory. Q5-M quantized Jan-nano LLM (~60% VRAM savings).
Feb β May 2025
Low-latency AI security using QwenVL-2.5 4B for real-time threat detection, quantized for ~43% VRAM reduction. Evaluation loop cut false positives ~35%; LiveKit + Twilio SIP calls homeowners with context-aware alerts.
Nov 2024
Real-time distress detection app (React Native + TensorFlow + Twilio) that detects voice cues like "help" or screams. GPS tracking + auto-SMS alerts and an "I'm Safe" mode for user control.
the toolkit
Programming
AI & Machine Learning
Frameworks & Tools
Web & Backend
Cloud & Deployment
Databases
from the notebook