Senior Deep Learning Engineer

Strativ Group · · $400k

Actively hiring Posted 3 months ago 2 min read

Role overview

Design and implement transformer architectures for biological sequence and multimodal data
Build and scale distributed training pipelines (multi-GPU / multi-node)
Optimize large-model training (FSDP, DeepSpeed, mixed precision, etc.)
Deploy models into production research platforms
Improve inference performance (quantization, distillation, optimization)
Collaborate with computational biologists and platform engineers

What we're looking for

4+ years of hands-on deep learning experience
Strong expertise with transformers and large-scale model training
Production experience deploying ML systems
Advanced proficiency in PyTorch (or similar framework)
Experience working in high-performance compute environments

Biotech or biological sequence modeling experience is a strong plus, but strong transformer experience from other domains (LLMs, multimodal models, etc.) is also highly valued.

This is an opportunity to work on foundation-style models in biology with real-world scientific impact.

The role is pay a salary of up to $400,000 per annum and comes with a wealth of benefits. The role is hybrid 3 days a week in SF.

Apply within if this is interesting to you.

Tags & focus areas

Used for matching and alerts on DevFound

Fulltime Ai Machine Learning Deep Learning Pytorch

Senior Deep Learning Engineer

Role overview

What we're looking for

Tags & focus areas

Ready to Join the Team?