×
Bryce Bonvillain

Bryce Bonvillain

Senior Site Reliability Engineer


Background


About

About

As an SRE I have a deep understanding of cloud computing and have successfully implemented automation solutions to ensure the availability, performance, and scalability of systems.

Work Experience

Work Experience

  • Senior Site Reliability Engineer, Gfk

    May, 2021 - Present

    • Built and managed a resilient vault cluster on GKE, using vault secrets operator for seamless secrets management across GKE clusters.

    • Implemented GCP secrets engine for IAM role-based token authentication, orchestrated with Terraform.

    • Provided mentorship and guidance to junior team members, enhancing team performance.

    • Implemented Infracost in GKE with a GitLab-CI pipeline, offering real-time cost insights in merge requests for efficient cloud usage.

    • Standardized and refactored Terraform codebase, improving maintainability.

    • Migrated Terraform stack from stash to GitLab, enabling collaborative development via GitLab-CI.

    • Reduced cloud spend by implementing cost optimization techniques, including eliminating redundant resources and rightsizing infrastructure.

  • Devops Engineer, Quandoo

    Oct, 2020 - May, 20217 months

    • Assisted development teams in migrating applications from AWS EC2 instances to GCP Kubernetes Engine, providing support throughout the transition.

    • Developed a Terraform module utilizing Python, GCP Cloud Functions, Pub/Sub, and Slack webhooks to notify teams of upgrades to GCP Kubernetes clusters.

  • Site Reliability Engineer, Deposit Solutions GmbH

    Feb, 2019 - Jun, 20201 year 5 months

    • Provided guidance and support for complex issues to tenant development teams working on Java-based environments, ensuring that they had the resources they needed to be successful.

    • Developed a sustainable and as code integration environment, empowering the team to work independently.

    • Automated DNS management by creating a python service that scraped application tags, eliminating manual management and potential errors.

    • Assisted in the design and implementation of a new, dockerized environment aimed at introducing microservice culture to the company.

    • Led the design and implementation of an access management project, leveraging active directory, salt stack, and python to compensate for company restructuring, automating onboarding and offboarding of users for direct ssh, database, Jenkins, and Gitlab access.

  • Site Reliability Engineer, Kreditech Holding SSL GmbH

    Mar, 2017 - Dec, 20181 year 9 months

    • Stabilized a monolithic build management and CI server by gathering and implementing necessary build runner dependencies as code, reducing toil work for the team and allowing them to focus on improving infrastructure.

    • Developed the infrastructure that enabled the company's first A/B release of an application by utilizing Nginx configuration and sticky cookies to route user traffic.

    • Collaborated with developers to design a new deployment infrastructure using internal Debian packages, Saltstack, and Jenkins, empowering development teams to be self-sufficient when deploying, testing, and developing their applications and dependencies.

    • Created a microservice in Python that utilized custom Debian packages to gather package information, allowing QA engineers to easily track application versions across environments.

    • Planned, structured, and implemented new AWS users and roles as code, consolidating user information across multiple accounts.

  • Database Engineer, iTalk Global Communications, Inc

    Oct, 2015 - Nov, 20172 years 2 months

    • Automated routing and pricing database changes, eliminating human error and ensuring data was always up-to-date.

    • Designed and restructured new database models, resulting in improved reliability and performance.

    • Implemented and managed a datacenter inventory system that was previously undocumented and difficult to manage. This new system provided an accurate overview of datacenter assets, identified redundancy, and reduced unnecessary server costs.

    • Physically installed, managed, and serviced assets in company datacenters.

    • Wrote scripts that generated reports that previously took multiple days to compile manually, ensuring accuracy and punctuality of reporting.

  • Network Operations Analyst, One Source Networks

    Jan, 2015 - Aug, 20157 months

    • Analyzed and interpreted large data sets for audits, forecasting, and reports, enabling accurate budgeting and adjustments.

    • Developed database jobs, stored procedures, and scripts for automatic report generation, eliminating the need for manual report generation.

    • Automated customer pricing in the database, ensuring data accuracy for billing.

    • Optimized LCR server routes to improve call traffic flow.

    • Provided support and resolved customer routing and connectivity issues.

  • Information Systems Intern, Mcilhenny Company

    Sep, 2013 - Jan, 20144 months

    • Developed scripts that automated server management task which made servers more reliable and sustainable.

    • Researched new software and technologies that could be implemented in the IT infrastructure.

    • Helped design a new database layout that allowed for better data management and database performance.

    • Redesigned internal employment website to be more accurate and reliable, which increased internal traffic substantially.

Skills

Skills

  • Python

    SQL, PL-SQL and T-SQL

    Terraform

    AWS/GCP/Azure

    CI/CD Tooling

    Gitlab-CI

    Unix Based Systems

    Server Management and Provisioning

    System Monitoring

    Docker

    Kubernetes

    Helm

    Flux

Education

Education

  • Informatics, Bachelor, University of Lafayette Louisiana

    - Dec, 2014