Senior Site Reliability Engineer

Vortexa ,
London, Greater London

Overview

Job Description

About Vortexa Vortexa was founded to solve the immense information gap that exists in the energy industry. By using massive amounts of new satellite data and pioneering work in artificial intelligence, Vortexa creates an unprecedented view on the global flows of oil and fuels in real-time, the energy markets and society as a whole, thus enabling the society to use the natural resources of our planet to the benefit of all. The Challenge Ingesting data from multiple external vastly different sources at hundreds of rich data points per second, moving terabytes of data while processing it in real time, running complex and complicated prediction and forecasting AI models while coupling their output into a hybrid human-machine data refinement process and presenting the result through a nimble low-latency SaaS solution used by customers around the globe is no small feat of science and engineering. This processing requires a highly reliable, stable, fault-tolerant infrastructure that can withstand multiple and varied uses and abuses by data analysts, data scientists, industry experts and the end users. Data Services Team is responsible for the fabric, the foundation, the impressive Amazon AWS estate Vortexa uses to achieve its mission. We use a wide variety of technologies like Java/Scala/Kotlin, Rust, Python, Jupyter, Airflow, Kafka and Spark, as well as AWS services like MSK, EKS, RDS, Elasticsearch, Athena and others. You will be a key member of a team which is instrumental in assessing current technologies, re-defining technology stacks and implementing the infrastructure and production pipeline roadmap, while ensuring 100% uptime, availability and fault-tolerance of every software and hardware component of the platform in a cost-effective way. You will also be helping other teams optimise their use of the key technologies to their full potential. You will look after multiple mature CI/CD pipelines and ensure they grow and adapt with the infrastructure. You will participate in the on-call rota and act as the first responder for critical production issues to coordinate and in some cases fix firsthand by modifying and deploying Vortexa software on a temporary or permanent basis. Your seniority will enable you to effortlessly represent the team at stakeholder decision making events. Your customers will be data analysts, data scientists and other engineers and industry experts from across the company. Your main key performance indicator will be the uptime of all Vortexa infrastructure and processes. Requirements You Are: * An engineer by education, by calling and by choice * Calm, analytical and methodical troubleshooter * An AWS power user and evangelist, intimately familiar with the ecosystem, including IAM, Cloudwatch, EKS, Kafka, ES, RDS, MSK and others * Comfortable with and deeply understand Kubernetes and EKS * Proficient with Apache Kafka and Kafka Streams application deployment, monitoring, resiliency, fault tolerance, cluster planning and operations, applications troubleshooting * Sufficiently familiar with Java or other JVM languages, as well as Python to troubleshoot and deploy minor updates * Fluent in Terraform * Not afraid of challenges and bold yet calculated and safe infrastructure updates * A can-do person Ive done my bit has no place at Vortexa Awesome If You: * Have some relevant AWS certifications * Can code in Java or other JVM languages like Kotlin * Can write some good Python * Understand data lakes like Parquet, Orc, Athena * Worked with Pandas Dataframe, Spark and alike We Are: * A vibrant, diverse company pushing ourselves and the technology to deliver beyond the cutting edge * A team of motivated characters and top minds striving to be the best at what we do at all times * Constantly learning and exploring new tools and technologies * Acting as company owners, which all of us are in a business-savvy and responsible way * Enjoying a friendly working environment * Motivated by being collaborative, working and achieving together * Headquartered in Aldgate, a really cool part of London in the City near Shoreditch. Were on the 8th floor in a big open space, with weekly drinks & pizza and usual startup perks. We also have offices in the US and Singapore. * Not only teammates but friends, often finishing the week enjoying a glass of a favourite drink and a game of 3D Connect 4 together Benefits In return we offer... * An open, collaborative and supportive working culture built on merit, which celebrates diversity of thought, creative thinking and getting things done * A competitive remuneration package * Private health care * The opportunity to work with AI driven technology in a start-up environment with industry experts in commodity trading * Equity - we're all acting as business-savvy and responsible company owners, whilst having fun and enjoying the journey together along the way * Continuous training and development opportunities * Work Perks - discounted travel and cinema tickets, shoppin

Get a Free CV Review

Let the professionals help you find a job.

Learn More

Senior Site Reliability Engineer

Overview

Job Description

People also viewed

Get a Free CV Review

Related Jobs

Get a Free CV Review