Senior Site Reliability Engineer, Bloomberg LP
Mar, 2021 - Present
- Through continuous process improvements and automation using PowerShell, reduced 1400+ tickets compared to year 2018.
- Migrating SQL Server 2008/R2 database instances to SQL Server 2014 / 2017 / 2019.
- Designed a migration rollback strategy using Peer-to-Peer (P2P) replication thereby providing a forward and rollback strategy to business critical applications.
- Tuned P2P replication to be able to push more than 5M TXs/hr.
- Created and Implemented SQL Server monitoring solution using open-source - Grafana, Telegraf and InfluxDB (time-series database). This enabled us in retiring 3rd party monitoring software thereby saving significant amount of money in license and support cost.
- Created and implemented blocking and deadlock detecting and created self-service HUMIO realtime charts for application team to be able to troubleshoot issues.
- Created an automated failover process for AlwaysON Availability group that auto-rebalances the AlwaysON topology to ensure there are atleast 2 sync replicas. This ensures that during machine crash, SQL Server is able to automatically perform failovers without dataloss. Also, this allows machine maintenance on a given node since the process makes it async which minimizes impact on primary sync replica.
- Automated all the pre and post migration steps using dbatools (https://dbatools.io/) - PowerShell and custom TSQL Scripts.
- Successfully did a POC of running SQL Server on Kubernetes (K8s).
- Implemented Query Store for sql server 2016 and up instances thereby providing query perform stats to application team.
- Mentoring other team members on SQL Server platform along with supporting MySQL open-source RDBMS.