Senior NLP / Computational Linguist Engineer (Information Extraction)

Causaly ,
London, Greater London

Overview

Job Description

Snapshot This is a rare opportunity for people who want to solve complex problems and be part of the development of a cutting edge, transformative knowledge product. Causaly is a VC backed startup working on a technology for Biomedical Cause & Effect Discovery and Epidemiology data extraction, empowering researchers and decision makers to quickly find causal evidence and generate insights from vast amounts of documents. Having already processed tens of millions of academic publications (and reading more than a hundred thousand new papers every month), our machine-reading platform turns free-flow text into causal knowledge graphs and descriptive data structures and applies machine learning to surface new knowledge. We are looking for engineers who will take the development of our NLP pipelines to the next level. This specific opening implies work with predominantly rule based systems and initial acquisition of the existing codebase of an information extraction module(s) to get ownership quickly. The role Were looking for independent and self-confident individuals who enjoy solving complex NLP problems, get motivated by seeing their works direct impact on growing customer base and want to apply their strategic mind to ground-breaking AI technology. Youll be working closely with the CTO and Product Owner, joining a close-knit team of highly committed and bright people. You are a resilient engineer, experienced in information extraction and thrive in a fast-paced complex environment. Responsibilities * Work predominantly with rule-based systems * Understand solutions aimed at the development of our NLP pipelines using all aspects of information extraction, text processing and text understanding including design, architecture, algorithms, correctness, and performance * Be able to inherit and develop further an existing codebase for particular information extraction modules * Work with a Computational Linguist Product Manager, iteratively engineering and implementing algorithm and linguistic rule improvements. The role requires working with large volumes of data to identify patterns for language modeling and to be able to efficiently evaluate outcomes of each such iteration before passing it for QA. * Strong focus on quality and performance based design * Staying up to date with the state of the art in information extraction and general NLP methods and techniques Requirements Minimum Qualifications: * Ph.D in NLP or Computational Linguistics or equivalent experience (MSc with 4+ years of research experience) * Strong programming/software engineering background enabling rapid codebase acquisition and scalable development * Fluency in fundamental NLP pipeline algorithms and tools (e.g. preprocessing and normalization, POS, dependency parsing, NER) * Experience with common NLP and Machine Learning libraries (e.g. nltk, spacy, scikit-learn) * Fluency with at least one of the modern distributed ML frameworks (e.g. TensorFlow, PyTorch) * Experience in dealing with a vast amount of textual data (optimization, performance enhancement) * Ability to drive technical projects autonomously and work in a diverse and collaborative environment * Proficiency in Python or similar scripting language Preferred Qualifications: * Empirical research experience in natural language processing and machine learning ideally in the biomedical domain with a focus on information extraction Benefits * Competitive Salary * Share option, were all in this together and you too shall own a small part of the company * Health care insurance * Pension contribution * Individual training budget for professional development * Plenty of opportunity to take on more responsibility as we grow * Be part of a multinational, diverse and exceptional early team that builds a transformative knowledge product with the potential to have real impact * Regular team outings * Annual team retreat to secret destination * Easily accessible office in the heart of Angel, Islington (right by the station) * Free coffee and tea (because you cant be a startup without caffeine) Causaly welcomes applications from all backgrounds. We are committed to diversity regardless of gender, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation or gender identity. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.