Staff Site Reliability Engineer, House Rx
Mar, 2023 - Oct, 20252 years 6 months
Managed geo-distributed multidisciplinary team
Owned infrastructure, reliability, and release management for a PHI-regulated product suite, taking Apdex from < 90 to 97
Maintained and extended self-hosted LGTM observability stack (Loki, Grafana, Tempo, Mimir)
Automated CI/CD and dependency provisioning with Terraform & Actions, cutting deploy time by 89% and 4xing redundancy.
Led incident management process redesign, improving MTTR and postmortem quality