Principal Site Reliability Engineer

Finastra ,
London, Greater London

Overview

Job Description

This role requires a 'can do' attitude; an individual with a passion for Site Reliability Engineering (SRE). Suitable candidates must thrive on the challenges of working in a fast-paced environment and who can help us to release outstanding software. PRIMARY RESPONSIBILITIES Lead Site Reliability Process and Technical Management of Finastra's Cloud Native Platforms Design authority for Finastra SRE Patterns & Practices Lead for Change Management of Finastra SRE transformation program Implementation of SRE Practices for Finastra Cloud based Financial Services Conducting reviews of achievable SLA of existing production deployments and identifying key areas of improvement Preparing and taking part in Production Weekly Operating Reviews Taking part in on-call rotation to understand the patterns of Production failures (capped at 25%) Conducting Blameless Post Mortems of high severity Production Incidents Implementing resiliency improvements (Software or System) Reviewing designs to improve resiliency and ensuring delivered code fulfills needs. QUALIFICATIONS & EXPIERIENCE A bachelor or master degree in IT (preferable computer science) 10+ years of experience in software development Experience with object-oriented programming (e.g. Java or equivalent) Solution design and deployment of resilient, HA, Highly Scalable & DR architecture Experience implementing SRE standards for Resiliency and Scalability of Java/Node.js based microservices in Cloud Experience implementing SLIs, SLOs and Error Budgets as part of development/delivery practices MANDATORY SKILLS Experience leading Failure Mode Analysis of Architectures Experience leading Root Cause analysis of Incidents using Incident Post Mortems Working knowledge of Cloud IaaS & PaaS Platforms - preferably Microsoft Azure Experience designing, deploying and managing container orchestration using Kubernetes Experience monitoring container based microservices & Cloud Platform services Use of Continuous Delivery tools - preferably Azure DevOps Knowing agile methodology and being able to work by its principals Fluent English skills ***** The above statements describe the general nature and level of work being performed by people assigned to this job. They are not intended to be an exhaustive list of all responsibilities, duties, and skills required.Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential job functions. If you need assistance or an accommodation due to disability please contact your recruitment partner. *****