Role overview
All your information will be kept confidential according to EEO guidelines.
What we're looking for
- Bachelor’s or master’s degree in computer science, Engineering, or related field.
- 6+ years of experience in software engineering, ML engineering, or platform engineering.
- Strong proficiency in writing production-grade Python, and experience with Claude Code or Cursor.
- Hands-on experience with LLM-based systems, including:
- LangChain / LangGraph
- MCP
- Langsmith
- Claude or comparable frontier models
- AWS AgentCore or comparable agentic frameworks
- Solid understanding of RAG architectures, embeddings, and vector search.
- Experience designing and consuming APIs (REST and/or async/event-driven).
- Strong cloud engineering experience on AWS.
- Knowledge of how to fine-tune frontier models to specific domain knowledge
- Experience with distillation, quantization and small language models is a plus
- Experience deploying traditional machine learning models into production environments using MLOps tools and best practices.
- Knowledge of distributed systems, large-scale model optimization, and API development.
- Exceptional ability to work on a team – especially a dynamic, innovative “tiger team” developing early stage PoC systems.
- Strong understanding of container orchestration and cloud-native application design.
- Ability to work in dynamic environments, handling rapid experimentation and iterative development.
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Machine Learning Ai