Full Stack AI Engineer
Lead end-to-end development and deployment of GenAI SaaS applications, leveraging frontier LLMs and integrating them into robust user-facing web platforms with a full-stack AI operations mindset.
- Built an Intelligent Document Processor combining GenAI + computer vision to automate extraction from complex unstructured documents.
- Built advanced and lightweight RAG pipelines (LangChain / LlamaIndex) with PgVector and Mongo Vector, reducing hallucinations and improving answer grounding.
- Reduced inference costs by 30% by routing targeted tasks to SLMs and reserving larger LLMs for higher-complexity reasoning.
- Implemented asynchronous workload handling with Celery worker pools and queue-based orchestration for long-running AI tasks.
- Strengthened platform security using OAuth 2.0 enterprise auth, prompt-injection guardrails, PII redaction, and dependency vulnerability scanning.
- Built observability and cost analytics workflows (Prometheus/Grafana + usage reporting), contributing to a 15% cloud compute spend reduction.
- Resolved persistent
429 RESOURCE_EXHAUSTEDerrors in streaming generation with custom retries and exponential backoff.