Role overview
We are seeking a talented LLM Engineer to join our customer's team in Washington, DC. This is a unique opportunity to work on cutting-edge AI/ML projects, leveraging the latest advancements in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and full-stack development. If you are passionate about building innovative solutions and thrive in a collaborative on-site environment, we want to hear from you.
What you'll work on
- Design, develop, and maintain applications utilizing LLMs and RAG methodologies.
- Implement and scale AI/ML models using frameworks like TensorFlow and PyTorch.
- Build and maintain robust front-end interfaces with React and scalable back-end systems with Node.js.
- Collaborate closely with cross-functional teams to translate business requirements into technical solutions.
- Deploy and manage applications on AWS, ensuring reliability, security, and scalability.
- Continuously research and integrate the latest technologies in AI/ML and cloud computing.
- Communicate complex technical concepts clearly, both in writing and verbally, to diverse stakeholders.
What we're looking for
- Experience with additional AI/ML frameworks and cloud-native technologies.
- Background in Retrieval-Augmented Generation (RAG) architectures and prompt engineering.
- Track record of delivering production-grade AI/ML solutions in a full-stack environment.