Senior Infrastructure Software engineer, Zendesk
Oct, 2024 - Present
Serve as a key member of the DevSecOps team at Zendesk, driving operational excellence and security across cloud-native infrastructure. Lead initiatives spanning site reliability engineering (SRE) best practices, infrastructure security hardening, and enterprise-scale Kubernetes operations on Google Cloud Platform (GCP).
Architect and maintain production-grade Google Kubernetes Engine (GKE) clusters, ensuring high availability, scalability, and security compliance for mission-critical services
Design and implement comprehensive observability solutions using Datadog and OpenTelemetry, establishing monitoring, tracing, and alerting frameworks that reduce MTTR and improve system visibility
Develop automation tooling and infrastructure solutions using Bash, Python, and Golang, streamlining operational workflows and reducing manual intervention
Implement Infrastructure as Code (IaC) practices using Pulumi and Terraform, managing cloud resources with version control, testing, and automated deployment pipelines
Enforce security best practices across the infrastructure stack, conducting security assessments, implementing compliance controls, and collaborating with security teams on vulnerability remediation
Champion SRE principles including error budgets, SLO/SLI definitions, and blameless post-mortems to foster a culture of reliability and continuous improvement
Collaborate cross-functionally with development teams to optimize application performance, troubleshoot production issues, and implement deployment strategies including blue-green and canary deployments