Staff Site Reliability Engineer, Doctolib
Jan, 2024 - Present
Leading infrastructure modernization and reliability initiatives for Europe's largest health tech platform serving 100,000+ concurrent users across France, Germany, and Italy.
Engineered Argo Workflows ecosystem as complete replacement for legacy kubetoolbox with production-grade automation (Azure AD SSO, GitHub notifications, ECR integration, event sensors, CLI tooling) — enabling teams to automate complex operational tasks at scale
Designed Datadog Deployment Gates configurable validation ecosystem with 10+ dedicated monitors replacing static queries, improving canary rollout reliability and deployment confidence
Co-authored 'Paradigm Shift' strategy shifting from centralized to decentralized, library-based configuration — enabling team autonomy and reducing SRE review bottlenecks
Implemented EKS workload optimization strategy (cost reduction), redesigned monolith pod sizing, and optimized preproduction environments
Advanced production-grade canary deployment strategy and ArgoCD integration for strengthened release confidence and rollback capabilities