Lead Platform Reliability Engineer,
Jul, 2021 - Jan, 209977 years 6 months
Once the PRE team started hiring new members, I was promoted to Lead Platform Reliability Engineer.
Unlike my previous roles at Gymshark, this one heavily involved enhancing and enabling the rest of the engineering teams to do the work, as well as parachuting in to do any glue work where necessary.
I got to work a lot on understanding the business objectives and translating them into technical ideas, as well as presenting these to the team. There are times where a technical decision may not make sense without the business objectives, such as the decision to refocus work. I was able to hone my ability to understand these decisions, and work with my team to ensure everyone understood what the business wanted and its goals. I also spent a lot of time working on identifying business value and honing my ability to discuss highly technical topics at a business level, providing a clear risk/reward analysis along with structured objectives and outcomes, and detailing value and how/when it will be realised.
Another part of this role was providing technical architecture guidance to all of the engineering teams in Gymshark, including my own. I have been involved in multiple cross-team projects spanning from simple builds to entire ecomerce site rebuilds and have been able to provide multiple aveneues of investigation for the teams to persue and to give them the knowledge and tools necessary to do so. All of this work leads to individual engineering teams to be able to make highly informed decisions on the best tech/approach for the task at hand, and to give steakholders the confidence that we have thought about as much as we can and identified any potential blockers or issues that might come up.
I also worked heavily with out engineering leads team to help influence change and encourage a DevOps culture and midset across engineering. One of my big wins in this was identify the ownership of technical areas and how these ares are influenced by business decisions. To help with this, I came up with a Policies, Standards, & Guidelines structure that helped convert business Policies (such as GDPR, data retention, logging & alerting, and infrastructure identification/tagging), to more engineer focused Standards and Guidelines. In this approach, Policies are the 'What We Have To Do', Standards are the 'How We Will Do It', and guidelines are the 'How Can We Make It Better'.
Worked with our Shop team to deploy and optimise a rebuild of the entire Gymshark ecom website into both Kuberntes and Serverless, providing guidance and solutions to various challenges along the way, with both infrastructures able to handle well over 100,000 requests per second, with almost no warm-up period
Restructured our AWS authentication setup to use the company-wide SSO through Okta in a fully automated way
Reorganised our AWS infrastructure and accounts under a single organisation with full auditing and logging capabilities
Created an automated sandbox provisioning setup to enable engineers to spin up their own sandbox accounts for proof of concepts and reserach
Hired more engineers for my team and supported them at various levels of progression
Worked with hiring juniors and ensuring they get the mentoring and training they need
Heavily refined my own ability to translate highly-technical highly complex concepts and ideas into business compatible values and objectives
Worked on business level strategy for the PRE team and projects we take on
Came up with our Policies/Standards/Guidelines document structure to provide both technical and non-technical overarching goals and general guidance that teams need to and can follow to meet business needs