Role overview
*Job TItle: Principal GenAI Systems Engineer
Location: REMOTE
Direct Hire
About the Principal GenAI Systems Engineer Role:**
Principal Generative AI Systems Engineer, you will be responsible for designing, developing, and deploying applications that leverage generative AI models. You will work closely with founders, machine learning engineers (DataML Engineers) and software engineers to develop a specialized and scalable generative AI application interfacing with telemetry-focused AI products. Your role will involve both front-end and back-end development, ensuring seamless functionality and performance consistency for a unique implementation of Retrieval Augmented Generation (RAG). A significant aspect of this role will involve architecting, engineering, implementing and testing system prompting configurations and pipelines, which are essential for unlocking the vast insights of AI products for downstream automated Actions and semi-autonomous Agents.
- Design, configure and optimize the GenAI-tech stack including: LLM, Vector DB, Encoder / Decoder, prompt framework (ex. DSPy) and supporting cloud compute and service resources.
- Design and implement RAG pipelines that enhance generative AI models by integrating external data sources.
- Architect and engineer efficient retrieval systems that can fetch relevant data from databases, knowledge graphs, or external APIs to augment AI-generated responses.
- Develop prompting pipelines that leverage context and retrieved information to generate accurate and contextually relevant responses.
- Collaborate with machine learning engineers to implement advanced techniques such as vector search, semantic search, and embeddings to improve data retrieval accuracy.
- Build and maintain robust pipelines for data retrieval, preprocessing, and integration into the generation process.
- Implement automated testing frameworks to validate the performance of RAG and prompting pipelines.
- Ensure that the retrieval and generation pipelines are scalable, reliable, and maintainable.
- Continuously monitor and refine pipelines to improve efficiency and reduce latency.
- Implement monitoring, logging, and alerting to maintain system health and uptime
- Collaborate with cross-functional teams including UX/UI designers, product managers, and DevOps engineers to deliver high-quality products.
- Collaborate with DataML Engineers, Integration Engineers & GenAI Engineers for customer-specific deployments & configurations
- Write clean, maintainable code and conduct code reviews.
- Document technical architecture, processes, and best practices.
What we're looking for
- Masters degree in Computer Science, Software Engineering or a related field
- Experience with scalable and high-performance application development in a cloud environment (AWS, GCP, Azure)
- Familiarity with technologies and/or data architectures such as: Product Analytics (e.g Pendo, Mixpanel), and Observability systems (e.. Grafana, New Relic, Dynatrace)