**Description
ABOUT NORSTELLA**
Norstella is a group of prominent pharmaceutical solutions providers – Evaluate, MMIT, Panalgo, The Dedham Group, Citeline – that help clients navigate complexities at each step of the drug development life cycle, from pipeline to patient. For more information, please visit Norstella.com.
THE TEAM
Our dedicated Data Science team is at the forefront of revolutionizing pharma intelligence and how
patients gain access to life-saving therapies. Armed with cutting-edge technology and a passion for
innovation, we leverage the vast landscape of data to extract actionable insights that drive informed
decision making. Our unique collaborative approach fosters a dynamic synergy between data science, product development, pharmaceutical industry experts, and engineering. Our deep expertise in artificial intelligence, machine learning, and advanced statistical modelling, combined with our domain knowledge, enables us to deliver comprehensive solutions that empower our clients to stay ahead in a rapidly evolving industry. We have delivered multiple products live to customers, with many more in development.
SCOPE OF THE ROLE
In this role as a Senior Data Scientist, you will:
- Collaborate with product partners and others to identify new opportunities to apply AI / ML to
our content and products
- Conduct research and identify AI / ML algorithms and methods to solve specific business problems
- Implement rigorous testing and evaluation of potential solutions; review indicative results and
proof-of-concepts with multi-disciplinary teams as part of customer-led product development
- Deploy your solutions as production-grade microservices in collaboration with engineering and
DevOps teams
- Contribute towards common data science platforms and data assets
- Stay up-to-date, constantly learning about advances in the field, and deliver periodic
presentations to internal teams on these developments
As a senior-level scientist, you will primarily be responsible for one major release at a time with a high degree of individual ownership. While not all research leads to successful deployed systems, past
successes have seen a general availability launch typically ~6 months from project start. Our solutionshave used a wide range of approaches, including: agentic systems and model context protocol (MCP) servers, retrieval augmented generation (RAG), classical machine learning, knowledge graphs, and simply well-designed data transformations and business logic – our focus is on solving problems rather than the technology used.
WHAT IT TAKES
Requirements:
- Graduate degree in a STEM field such as Computer Science, Engineering, Statistics, or equivalent
practical experience
- 5+ years of experience developing AI / ML applications and data driven solutions
- Excellent knowledge of Python and core data science libraries such as pandas, scikit-learn,
LangChain, etc.
- Experience with LLMs, e.g. foundation model APIs, prompt engineering, retrieval augmented
generation
- Substantial depth and breadth across state-of-the-art AI / ML techniques such as Generative AI,
NLP, Deep Learning, etc.
- Understanding of CS fundamentals, computational complexity, and algorithm design
- Ability to engineer and deploy well-architected software packages
- Strong communication and stakeholder management skills, ability to independently own and
execute a project holistically at a high level
- Experience mentoring junior team members
Preferred Qualifications:
- Deep expertise in engineering agentic AI systems
- Knowledge of the healthcare / pharma domain and experience applying AI to healthcare data
- Experience with AWS, especially ECS, Bedrock, SageMaker, serverless compute and storage
- Ability to prototype PoC webapps with familiarity across the full stack
- Expert usage of AI coding tools and workflows