R&D Lead, DevOps and RAG Backend developer, Pendium Health
Jan, 2023 - Present
Authored and deployed Python-based infrastructure code across core AWS services serving as foundation for RAG systems for healthcare startups and auto-scaled FastAPI streaming architecture. Led cloud migration from monolithic scripts to Dockerized microservices on AWS.
Developed RAG systems for healthcare startups, boosting retrieval accuracy by 40% via custom retrieval agents (Chain of Abstraction, llamaindex).
Built auto-scaled FastAPI streaming architecture supporting 2k+ daily requests with <200ms latency and 98% uptime (AWS ELB).
Led cloud migration from monolithic scripts to Dockerized microservices on AWS (EC2, ECR, ELB, R53), integrating TLS termination and GitHub Actions CI/CD pipelines.
Achieved 90% reduction in deployment downtime via container orchestration.
Accelerated release cycles (12 hours → 10 mins) through automated build-test-deploy workflows.
Implemented zero-touch certificate renewal using AWS Certificate Manager (ACM).