Role overview
- Build frontier image/video models and the VLM captioning systems that power them
- Join a lean, senior team that holds a high engineering + research bar
- Direct product impact: your training runs become real user-facing capabilities
- Benchmark against the best in the world and compete on model quality through what we ship
What we're looking for
- Hands-on experience with diffusion/flow-based image or video generation, or large-scale generative modeling in adjacent domains
- Experience with distributed training at scale (multi-node) and performance tuning (throughput/memory)
- Experience building evaluation frameworks (offline metrics + human eval + regression tracking)
- Strong intuition for data quality and dataset/labeling tradeoffs for training and captioning
- Publications are a plus, but shipped impact and strong technical evidence matter more
- Visa sponsorship (where applicable); we’ll make a strong effort to relocate you to Switzerland or Poland if desired
- Remote-friendly: work fully remote, hybrid, or on-site from our hubs
- Regular offsites and in-person events to collaborate and connect
- Flexible PTO
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Remote Ai Generative Ai