Lead Site Reliability Engineer

Sorted ,
Manchester, Greater Manchester

Overview

Job Description

At Sorted, we create game-changing delivery management software that sits in online checkouts, warehouses, retailer supply chains and on your smartphone. Our data-driven tech is used by some of the biggest retailers in the UK; we help them serve their global customers, making sure they have delightful delivery experiences. Now is an exciting time for us, as we expand into new markets and take on the world. Here, we allow you to carve your own Career path, whether you want to stay technical and remain in the tactical detail or if you want to move into leadership, we have a path to suit you and allow you to constantly develop. Check out our team At Sorted, the Site Reliability Engineering Team is responsible for the design, implementation, and management of Sorted's cloud platform. Sorted primarily host our highly-available online platforms via Microsoft Azure, with a small number of ancillary services hosted in AWS. SRE teams work collaboratively with our architecture and software engineering teams to deliver first-class cloud native software. Responsibilities * Identifying the best hosting solutions for new and existing software * Applying the principles of software development and engineering to infrastructure and operations * Leading the strategy and direction of hosting and infrastructure engineering including automation * Leading the Site Reliability Engineering Team * Owning Sorted's cloud stacks including cost optimisation, capacity management, platform efficiencies, security, and maintenance * Ensuring high platform availability * Leading the transformation to 100% infrastructure as code * Determining and implementing solutions for problems of scale including logging, monitoring, deployments, speed, and fault-tolerance * Supporting Sorted's transition to a DevOps culture * Collectively managing Sorteds Technology Radar * Participate in out-of-hours support of infrastructure and platforms Requirements A Lead SRE must have knowledge or experience of the following: * Cloud platforms ideally Microsoft Azure to deliver highly-available platforms including: * Monitoring * Provisioning and managing infrastructure * Infrastructure as code * Scalability * Cloud and infrastructure security including applying principles of best practice * Previous experiencing designing or architecting complex infrastructure in a cloud-based environment * Previous experience of leading a team * Kubernetes * Docker * Terraform * Windows * Linux * Software Engineering Experience * Scripting such as: * PowerShell * Bash * Python Benefits Very competitive Salary Remote working Pension Life Assurance On going tech training and development plans 32 days holiday plus your birthday and bank holidays (combined with flexible working) Stunning city centre office space Free beer, coffee, breakfast and even massages. If youd like to hear more about the business and the benefits, just get in touch. Whether its over the phone or over a coffee, wed love to speak with you.