Senior Site Reliability Engineer

Cookpad Ltd ,
Bristol, Bristol

Overview

Job Description

Description Cookpad is looking for engineers to join our Site Reliability Engineering team. Site Reliability Engineers are a hybrid between system engineers and software engineers who are responsible for and who take ownership of reliability, automation, and scalability. You will focus on the systems and tools that enable our engineers to operate and scale the largest recipe sharing community in the world. About Cookpad Cookpad is a tech company building a community platform that enables people to share recipe ideas and cooking tips. It's a global platform used by on average around 100 million people every month across the world. Over 4 million recipes have been created by people in almost 70 countries. And we're only just getting started. Our mission is to make everyday cooking fun. Because we believe that cooking is the key to a happier and healthier life for people, communities and the planet. Our heritage is unique: Cookpad was founded in Japan in 1997 and is a listed company in Tokyo. We set up our international HQ in the UK and here we're a start-up, building the global platform and working with our colleagues around the world. Cookpad is growing at speed and we're looking for exceptional people who make things happen and create solutions on the scale we're looking for. It's not for everyone. Because of our speed and growth, it can feel chaotic. It's an environment where creators and problem-solvers flourish. Expect talented, motivated, intense, and interesting colleagues. It feels like a start-up with global ambition. We work in small, collaborative teams and in a creative, fast-paced environment. The role is based in Bristol in the UK - one of the most exciting emerging tech hubs in Europe, a city on National Geographic's Cool List 2018 and the best place to live in the UK. www.cookpad.com What's it like to work at Cookpad? Visit www.cookpadteam.com to find out more. What you will do: As a Site Reliability Engineer, you will build high performance and scalable systems with AWS and software. You will also work closely with engineers to advocate sensible, scalable, systems design and share responsibility with them in diagnosing, resolving, and preventing production issues. In the case of incidents, you will triage, mitigate and solve them with product team engineers. * Build highly available, performant and scalable service infrastructure with AWS * Design, develop and implement software that improves the stability, scalability, availability and latency of Cookpad. * Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again * Participate in the operations on-call rotation, triaging and addressing production issues as they arise * Contribute to internal tools that help us improve our operations processes, manage our infrastructure, and scale our systems * Engage with product engineering teams to triage production outages and carry forward action items to improve ongoing reliability * Undertake measured, methodical, troubleshooting of complicated systems under pressure Requirements This is a senior level role and we are looking for the following skills and experience: * Extensive Site Reliability Engineer/DevOps experience in a Linux based AWS environment * Understanding of software development and coding experience in any major language * Willingness to learn Ruby on Rails * Strong written communication skills in English and develop working relationships with coworkers in locations around the globe * Fundamentals of TCP/IP(OSI) model and network architectures * Strong coding skills in at least one programming language. Cookpad server side engineers work primarily in Ruby, with smatterings of shell script, Go, and Python * Familiar with configuration management software such as Puppet and Chef * Possess a passion for solving problems using open source software Preferred skills * Solid foundation in deployment and management for large scale of Linux systems * Understand large-scale complex systems from a reliability perspective * Solid competency with SQL (ideally in a federated database environment; MySQL a plus) * Contributions to open source * Deep network analysis experience is a plus * Strong Linux system-level analysis capabilities (Ubuntu a plus) * Knowledge and experiences about highly available and scalable architectures for services expanded in multi-regions is a big plus Benefits Ask everyone who works at Cookpad what we love about working here and the answer is: being part of the mission and our colleagues. Expect to work with people who are passionate about and committed to the mission: we challenge ourselves and each other to do the best work we can. The benefits we offer are based on how we can best support your personal and professional well-being. We offer competitive salaries and all the things you'd expect - from employer pension contribution to private medical insurance; from time off for voluntary wor