Now Building at Sunus AI
Based Skillman · New Jersey · Remote
Practice GenAI / RAG / Full-stack

Grishman Paruchuru.

A Full Stack AI Engineer designing, shipping, and scaling production GenAI systems — RAG, document intelligence, and the products that wrap them.

Scroll
4+
Years shipping software
30%
Inference cost reduction with SLM routing
15%
Cloud spend reduction via usage analytics
1×
Published research paper

An engineer
who ships AI that actually works.

§ 01 — Index

I spend my days at the intersection of large language models, retrieval systems, and the real product surfaces people actually use. I've built RAG pipelines that retrieve the right context the first time, intelligent document processors that turn messy PDFs into clean structured data, and the SaaS platforms that put those capabilities into customers' hands.

The work I'm proudest of doesn't sit in a Jupyter notebook — it ships, it stays up, and it earns trust. That means thinking about evals, latency, cost, retries, schema validation, observability, and UX with the same seriousness as the model itself.

Don't take my word for it
— ask my portfolio.

§ 03 — Playground

The Playground is a small, runnable retrieval system whose corpus is my own résumé, projects, and notes. Type a question and a deterministic local sim shows you which documents it pulled, the similarity scores, and a grounded answer — the same loop I build at work, just shrunk down for you to poke at.