Role overview
**AI Engineer**
*Sunnyvale, CA (2-3 Days On-Site)*
Our client is a rapidly growing venture-backed software company pioneering AI innovation for next-generation technology. Their team is dedicated to transforming traditional hardware into intelligent, software-defined platforms deployed at a global scale.
**About the Role**
As an AI Engineer, you will work on breakthrough AI products that redefine how devices interact with software - building, iterating, and deploying solutions that reach millions of users. Embedded in a nimble, high-impact engineering team, you will help drive AI initiatives from proof-of-concept to production, collaborating directly with technical leadership and shaping the trajectory of AI.
**Responsibilities**
* Research, develop, and deploy innovative AI/ML models for next-generation in-device experiences and edge computing.
* Architect, fine-tune, and optimize large language models (LLMs), agentic frameworks, and retrieval-augmented generation (RAG) systems for real-world deployment.
* Deliver production-ready code in Python, leveraging frameworks such as TensorFlow and PyTorch.
* Collaborate closely with the CTO and technical team on greenfield projects - from ideation through scalable implementation.
* Benchmark, evaluate, and iterate on models for performance, efficiency, and reliability in constrained hardware environments.
* Work hands-on across the ML lifecycle, including MLOps, IoT/embedded integration, and inference optimization.
* Remain up to date on the latest research and technologies to rapidly prototype and validate new concepts.
**Qualifications**
* Degree in Mathematics, Physics, Computer Science, or Data Science from a top-tier university.
* 6+ years of progressive experience in AI/ML, with a successful track record shipping production AI systems.
* Expert-level Python programming skills.
* Hands-on experience with TensorFlow or PyTorch.
* Strong foundation in LLM fine-tuning, RAG systems, and agentic frameworks.
* Background in traditional machine learning or data science.
* Experience in startup environments or companies where AI/ML is a core mission.
* Familiarity with compiler technologies (e.g., ONNX, TensorFlow Lite), optimization techniques, and embedded system architectures.
* Willingness to work 2–3 days per week onsite in the Bay Area.
**Preferred Skills**
* Experience leading the development of visionary AI/ML proof-of-concept projects.
* Demonstrated ability to wear multiple hats (coding, MLOps, LLM benchmarking).
* Prior exposure to IoT or related industry.