Site Reliability Engineer

GDS ,
London, Greater London

Overview

Job Description

Salary: 52,500 - 70,000 (London) 47,000 - 61,040 (National) The base salary of this grade is 52,500 (London) / 47,000 (National). Any offer made above this will be made up with a specialist pay allowance For more information please visit our things you need to know page. Contract type: Permanent Grade: 7 Number of open roles: 3 Hours: 37 per week (excluding lunch) Working pattern: flexible working, full-time, part-time, job share Location: Bristol, London, Manchester Closing date for applications: Rolling Campaign, please apply as soon as possible Who we are The Government Digital Service (GDS) is part of the Cabinet Office. We lead the digital transformation of the UK government so that it works better for everyone. Our work continues to be user-focused, dynamic and forward-looking, making our organisation an exciting and innovative place to work. Find out more at the GDS Blog. Our reliability engineering team Helps GDS product teams deliver secure, reliable, and resilient products faster, in a consistent way and at a lower cost. As part of Reliability Engineering you will work in various mission teams providing operational tools, platforms, guidance, and support so product teams can easily integrate them into their services and build reliable services that are secure, don't break when they scale, are supported and able to recover quickly from disasters. If you're committed to improving user experience, and you want to help us make government better, please get in touch! What you'll do We're looking for people with good interpersonal skills who enjoy working in a delivery focused, agile environment. As a Site Reliability Engineer in GDS you'll: * have and apply broad knowledge of core web technologies * take responsibility for solving complex issues * automate tasks, deployments, and tests by creating infrastructure as code, taking responsibility for the quality of code you produce * implement resilient, highly available systems * share knowledge of tools and techniques with your wider team * act as a digital ambassador, supporting recruitment, identifying good practices for GDS to adopt and sharing experiences, eg through blog posts, tech talks at conferences etc * participate in our in-house (2nd line) support, and the out-of-hours support rota - you'll be paid an allowance, and a further hourly payment, for any duties you perform when on call * share knowledge among the GDS teams, ensuring that your team is understood by others and understanding the working of the wider organisation As a Senior Site Reliability Engineer you'll also: * provide technical leadership within the team, advising and working with Reliability Engineers and product teams to identify the best solutions Who you are We're interested in people who: * are experienced with UNIX-like operating systems and technologies used for web applications, e.g Linux, databases, backups, CDNs * can demonstrate a working familiarity with at least one programming language such as Ruby, Java, Python, Javascript, Go * are experienced with AWS and the use of orchestration tools such as Terraform, Cloud Foundry, Kubernetes * understand software design principles * take a systematic approach to solving problems * use testing to validate solutions * understand agile environments and version control * are familiar with web security * understand network protocols, eg HTTPS, TLS etc * have familiarity with working practices such as test driven development, continuous integration and continuous delivery Senior Site Reliability Engineers will also have experience of: * leading teams and projects, line management, helping colleagues with their career development and coaching more junior staff members What we value Respect, collaboration and trust are at the core of our culture . We trust each other to do our best work. We believe in our mission and work for the whole population. We can only do that by being an inclusive and diverse organisation. How you'll be assessed In the Civil Service, we use our Success Profiles. This gives us the best possible chance of finding the right person for the job, drives up performance and improves diversity and inclusivity. When reviewing your application we will be using the experience element of the framework to assess your career history and achievements. We're looking for examples of things you have previously achieved or your knowledge in a particular field which are relevant to the role. If you're shortlisted for an interview we'll use different elements of the success profiles framework. We'll be considering your ability, experience, technical/specialist skill and behaviours for this role. The following behaviours are the most relevant: * working together * changing and improving * making effective decisions Your application during Coronavirus (COVID-19) During these unprecedented times we remain committed to supporting citizens, enabling them to access government information and services they need. We're closely monitorin