Big Data AI Engineer, R&D Innovation

GlaxoSmithKline ,
Stevenage, Hertfordshire

Overview

Job Description

Big Data AI Engineer, R&D Innovation Stevenage Site Location Are you excited at the prospect of working on ground-breaking projects that leverage and develop novel, cutting-edge Big Data and AI/ML technologies that have a huge impact on health, wellness, and patient outcomes around the globe? If so, then read on to learn more about an exciting opportunity for you in the GSK's R&D Innovation Team! As a Big Data AI Engineer, R&D Innovation , you will work on the design, development, and implementation of the Enterprise Big Data Platform for GSK using Containers (e.g. Dockers and Kubernetes) and Cloud Technologies (e.g., Azure). You will also utilize Site Reliability Engineering and Infrastructure automation techniques (e.g., Terraform) to create, automate and use AI/ML ecosystems. You will be part of a multi-disciplinary team within GSK's R&D Technology vertical with experience in Entrepreneurship, Data Science and High-Performance Computing. Our members are passionate about technology innovation. Some hold multiple PhDs have worked in Silicon Valley (California) and at the IBM TJ Watson Research Centre (New York) for the US government. We have consistently delivered excellent results on mission-critical problems using scalable expertise and bleeding-edge technologies and methods. GSK leadership is leveraging us to ideate, create, and productionize novel, ground-breaking, high-value solutions for increasingly complex and critical challenges. This role will provide YOU the opportunity to lead key activities to progress YOUR career. Considering your skillset and abilities, your growth opportunities will include some or all of the following : * Designing automated Infrastructures that creates new auto-healing capabilities * Creation and integration of storage technology and DFS independency into the solution landscape * Data Pipeline development leveraging DevOps standards * Use of Continuous Integration (CI) and Continuous Deployment (CD) to build Data Engines * Creation of secure and private anonymization data systems using declarative programming languages that will interface between Data Silos, Data Engines and Graph Databases. These systems are fundamental for executing AI/ML workflows to accelerate drug discovery and to optimize the manufacturing processes * Creation of holistic (e.g. integrated) data views through the ingestion, cleaning, linking, harmonization and contextualization of multiple systems. These views will enable our AI/ML work on complex high-value, multi-root cause problems * Active involvement in all stages of the project lifecycle - from ideation to industrialization - in an Agile development environment. You will discover and develop new promising technologies in a collaborative way, create Proof-of-Concepts (POCs), Proof-of-Values (POVs) and Minimal-Viable Products (MVPs). Whatever we design and prototype, we make it scalable, flexible and robust. Our projects do not sit on the shelf! They are Industrialized by us and later handed-over to the R&D Support teams to drive their further adoption across GSK Why you? Basic Qualifications: We are looking for professionals with these required skills to achieve our goals: * Bachelor's Degree - Engineering, Mathematics, Statistics, or Computer Science * Extensive experience as a full-time software engineer * Expertise with Data Engineering or Site Reliability Engineering * Expertise with non-imperative paradigms - Scala, Haskell, F#, Typescript or OPA Rego * Experience working on Big Data platforms, preferably Spark * Experience deploying solutions on Cloud Platforms, preferably Azure or GCP * Infrastructure-as-Code experience: Terraform, Ansible or Cloud templates (Azure, GCP) * Expertise with container technologies: Kubernetes, Helm or Docker * Professional DevOps experience: Jenkins, Azure DevOps, CI/CD or Junit * Ability to design and implement logging, tracing, and application monitoring systems * Experience building and maintaining APIs Additional Qualifications: If you have the following characteristics, it would be a plus: * Streaming data experience with technologies like Apache Kafka * Cryptography / Cyber Securityexperience * Experience operating in a highly regulated and secure environment Why GSK? Our values and expectations are at the heart of everything we do and form an important part of our culture. These include Patient focus, Transparency, Respect, Integrity along with Courage, Accountability, Development, and Teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities: Operating at pace and Agile decision-making Using of evidence and applying judgement to balance pace, rigour and risk Committed to delivering high quality results, overcoming challenges, focusing on what matters, execution Continuously looking for opportunities to learn, build skills and share learning Sustaining energy and well-being Building s