Senior Site Reliability Engineer

Gousto ,
London, Greater London

Overview

Job Description

Gousto are on a mission to become the UK's favourite way to eat dinner. Delicious recipes, precise ingredients & a dollop of adventure, delivered to your door. Tech, data and artificial intelligence are at the heart of everything we do, every aspect of our business is powered by code and the tech we build. But what else is in our heart? Sustainability. We care about people and we care about the planet, so we reduce food waste and plastic. Proud moments in 2019: * We delivered so many million meals that we could feed all of Australia! * We saved the equivalent of 238 million plastic straws * We increased choice from 30 to 50 recipes and offered better convenience via next day delivery * We welcomed 150 new members to the #GoustoGang, 50 of those being in Tech and Data 2019 was big, but together we can make 2020 so much bigger! Our Tech: Our engineers have written code to reduce our transport and carbon footprint by optimising box contents to fit in smaller boxes (really, check out our blog!). We are a scale up business on a steep growth trajectory, and with that comes the need to build a platform that can scale effectively. We utilise our Machine Learning algorithms, from the recipe recommendations we show to customers, to efficiently routing boxes around our warehouse. Our platform is entirely cloud-based, and many of our backend services are now delivered via AWS Lambda. Our continuous delivery pipeline is fully automated and we release code to production multiple times a day. Were in the process of moving our microservices architecture to containers and we heavily use asynchronous event-driven architecture with data bulkheading. We know weve got lots of things right but we also have many things still to improve. Right now, were working on creating robust seed data for testing and flexible developer environments for coding. What's it like to work here? * We are customer-centric, data driven and outcome focused, using lean principles to work iteratively, experiment and learn fast. * We work in cross-functional squads, and our squads have a high level of autonomy in deciding both what outcomes they should pursue and how they will achieve them. * Learning and innovation are core to our working principles. We hold regular brown bag sessions, and our 10% of our time is dedicated to innovation and learning. * We are committed to being involved in the wider tech community, hosting meet-ups, speaking at events and sharing learnings via our blog. The opportunity: To support Goustos growth, were looking for an experienced and passionate Senior Site Reliability engineer to help us move our Platform to the next level. You would be joining as the SRE team is being formed. You will be able to have a massive impact on the shape of the team and how it delivers value to Gousto Tech. The SRE squad is the latest team in the Foundations tribe at Gousto. The Foundations tribe exists to make sure Gousto sets up their engineers for success. Our SRE team will be an enablement squad with the focus on improving Platform stability and reliability. It is also worth noting Gousto also has a Platform Engineering team sitting in the Foundations tribe. Our Platform team focuses on developer productivity and team scalability. The SRE team will achieve this by providing Platform tooling such as monitoring, alerting and testing capabilities. As well as improving operation knowledge and enabling operational best practices across the development teams. The team will own processes around incident management and encourage a learning culture. Gousto Tech has a DevOps mentality. Our development squads are responsible for the complete lifecycle of the software they build, including operational responsibilities in Production. At Gousto the SRE team helps enable this. We are making huge investments in technology, and plan to triple the size of our Tech team to over 200 people in the next few years. As well as providing lots of career growth for our team, there will be plenty of opportunities to lead, mentor, influence and learn. Core Responsibilities: * A depth of knowledge of Site Reliability Engineering. You are experienced in enabling platform stability through tooling, operational processes and sharing operational knowledge. * Ability to influence and introduce change into development teams. This might be introducing new tooling, new processes or ways of working to improve operational stability. * Excellent communication and presentation skills, whether to the engineering team, to business stakeholders or to our leadership team * You are curious and are always looking to learn. You encourage a culture of learning, emphasising the importance of breadth as well as depth of knowledge. * You deliver rapidly in small batches, reducing risk and creating a fast feedback loop. You have a continuous improvement mindset, constantly seeking to reduce waste and avoid re-work. Key Stakeholders: You will work closely with your team as well as having key stake