Nous Research logo N

NOUS RESEARCH

French artificial intelligence company

About

Architect and implement efficient ML Inference pipelines for large language models. Responsibilities: Design and implement high-performance inference pipelines Optimize model serving for throughput, latency, and cost across different workloads Collaborate with research and product teams to integrate inference into real-world applications Help enhance and manage the deployment pipeline and monitor production clusters Debug production inference issues Stay up-to-date with the latest in inferen...

Updates & Highlights

Recent Openings

View all 2 jobs