Open for opportunities

AI Engineer who ships products people actually use

3.5+ years building AI for production

Snehit Vaddi

Currently building Agentic AI at ModMed, shipping products that touch millions of patients. From self-driving cars to medical AI to side projects helping thousands land jobs — I build things that work.

Stuff I've Built

From self-driving cars to medical AI and agentic applications.
Here's what I've been building.

AI Influencer Bot
🤖 Agentic AI2026

AI Influencer Bot

PythonLLM+2
RAG-Anything
4
🧠 GenAI & LLMs2024

RAG-Anything

PythonLangChain+2
MEDHALT
1
🧠 GenAI & LLMs2025

MEDHALT

PythonNER+2
FineTune Resume
🌐 Web Apps2025

FineTune Resume

Next.jsAI+2
Resume2Portfolio
🌐 Web Apps2025

Resume2Portfolio

Next.jsAI+2
WhatsApp R2Park
🌐 Web Apps2026

WhatsApp R2Park

Node.jsWhatsApp API+2
HackSwipe
🌐 Web Apps2026

HackSwipe

JavaScriptWeb App+2
H1B Wage Finder
1
🌐 Web Apps2026

H1B Wage Finder

PythonData Analysis+2
WhatsApp R2Park Bot
🌐 Web Apps2026

WhatsApp R2Park Bot

JavaScriptWhatsApp API+2
Finetune Resume
🛠️ Tools2026

Finetune Resume

TypeScriptAI+2

Research & Publications

Published and under-review work across LLM reasoning, interpretability, and applied computer vision.

arXiv2026Under Review

Can Small Models Reason About Legal Documents? A Comparative Study

LLMsReasoningLegal AISmall Models
arXiv2026Under Review

Do Hallucination Neurons Generalize? Evidence from Cross-Domain Transfer in LLMs

LLMsHallucinationInterpretability
Plants2025PublishedFirst Author

Detecting Escherichia coli Contamination on Plant Leaf Surfaces Using UV-C Fluorescence Imaging and Deep Learning

Computer VisionYOLOv11AgricultureFood Safety
IEEE ICCMST2023Published

An Effective Model for Smartphone-Based Pothole Classification and Admin Alerting System

Computer VisionEdge AIMobile
IEEE2022Published

ECG-Based Early Heart Attack Prediction Using Neural Networks

Healthcare AINeural NetworksTime SeriesCardiology

My Toolkit

The technologies I use to turn caffeine into code

🤖

GenAI & LLMs

LangChain
OpenAI API
Claude
RAG Systems
Hugging Face
Fine-tuning
Prompt Engineering
Vector DBs
🧠

Machine Learning

TensorFlow
PyTorch
Keras
Scikit-learn
Computer Vision
NLP
Deep Learning
MLOps

Data Engineering

Python
SQL
PySpark
Kafka
Airflow
AWS (S3, Redshift)
Snowflake
dbt
💻

Development

Python
TypeScript
React/Next.js
FastAPI
Node.js
Swift (iOS)
Git
Docker
☁️

Cloud & Tools

AWS
GCP
Vercel
GitHub Actions
Jupyter
VS Code
Linux
Streamlit

Where I've Worked

From building data pipelines at scale to pushing the boundaries of AI research

Full-timeFeb 2025 - Apr 2026

AI Engineer — GenAI Applications & LLM Systems

ModMedBoca Raton, FL (Remote)
  • Shipped Clinical Ambient AI Scribe serving 15,000+ providers across 11 specialties (400K+ daily encounters), automating 70% of documentation via real-time transcription + LLM SOAP generation
  • Built agentic document pipeline (OpenAI Agents SDK + fine-tuned Qwen2-VL VLM) routing 10M+ clinical pages/month, replacing a $400K/month vendor with a $20K in-house system (95% cost reduction)
  • Built Text2SQL + clinical knowledge graph over ModMed’s EHR warehouse (200+ tables, pgvector embeddings), cutting analyst request volume by 60%
  • Architected production multi-agent RAG with LoRA-finetuned SLMs, hybrid pgvector + BM25 retrieval, and cross-encoder reranking — 94% retrieval precision across 50K+ clinical documents
  • Open-sourced MEDHALT — clinical hallucination detection (DeBERTa NER + LLM-as-judge) achieving 92% accuracy vs. GPT-4, with MLflow tracking and golden-set regression testing
  • Shipped LangChain/LangGraph/Claude monitoring framework for Scribe quality, cutting incident response to under 5 minutes with PHI-safe pipelines and prompt-injection filtering
PythonLangChainLangGraphVLMsRAGpgvectorDatabricksKubernetesMLflow
InternshipMay 2024 - Jul 2024

AI Software Developer Intern

GeoSpider AIUSA (Remote)
  • Built LangGraph multi-agent RAG that autonomously resolved 65% of customer tickets across a 50K-doc knowledge base, with dynamic routing and LLM-as-judge scoring
  • Built LLM-as-judge routing layer with dynamic few-shot prompting, improving helpfulness from 43% → 76% and relevance by 30%
  • Designed FAISS + keyword hybrid search with semantic reranking — 92% recall@10, serving 150+ concurrent users via vLLM with 40% p95 latency reduction
  • Implemented Redis-backed multi-turn agent memory and FastAPI inference gateway with fallback logic and structured logging
LangGraphFAISSvLLMRedisFastAPILLM-as-Judge
ResearchFeb 2023 - Dec 2024

Graduate Researcher, AI/ML

University of FloridaGainesville, FL
  • Developed hybrid YOLOv8-ViT model improving small-object detection by 15% with Grad-CAM / EigenCAM explainability. Published at SPIE 2025 and IEEE 2023; presented at both
  • Built React dashboard with Grad-CAM visualizations replacing static PDF reports (adoption 15% → 85%). Prototyped CLIP-based multi-modal retrieval between lab images and research reports
  • Automated model retraining via MLflow + GitHub Actions CI/CD, cutting deployment from 4 hours to 15 minutes
PythonYOLOv8ViTCLIPMLflowGrad-CAM
Full-timeJun 2021 - Dec 2022

Software Data Engineer

AT&T (via Accenture)Bangalore, India
  • Engineered BERT + XGBoost intent classifier (88% F1) predicting technician dispatch necessity — eliminated 12K unnecessary dispatches/year, saving $2M annually
  • Built Elasticsearch + Word2Vec anomaly detection for network telemetry, cutting diagnosis time by 40% and Tier-2 escalations by 25%
  • Optimized PySpark / Delta Lake pipelines processing 1M+ logs/day (30% latency reduction); designed Azure Synapse warehouse with dbt
  • Built NLP ticket-categorization pipeline with fine-tuned BERT — 91% accuracy, 500K+ monthly interactions
PythonPySparkBERTAzure SynapseDelta LakeElasticsearch

Education

Master of Science in Computer & Information Science

University of Florida

Gainesville, FL2023 - 2024

Focus: Machine Learning, Computer Vision, Data Engineering

Bachelor of Technology in Computer Science

GPA: 3.9/4.0

GITAM University

Visakhapatnam, India2017 - 2021

Focus: Software Engineering, Data Science

Let's grab a virtual coffee

Got an interesting project?
Let's talk!

I'm always excited to discuss AI, data engineering, or just tech in general. Whether you have a project idea, job opportunity, or just want to say hi - my inbox is open!

Drop me an email