Role overview
Posted Date
3/10/2026
Description
We are seeking a highly skilled and motivated Machine Learning Engineer to join our technology team. The ideal candidate will be responsible for designing, developing, and deploying cutting-edge applications that leverage Generative AI and Large Language Models (LLMs). This role involves architecting end-to-end platform services with a focus on scalability, reliability, and performance. You will be a key player in driving the implementation of new AI-driven systems and programs that will impact all business line at Citi globally.
What you'll work on
- Architect and Build AI Solutions: Design and architect end-to-end platform capabilities and services that integrate with Generative AI technologies.
- Implement RAG Solutions: Develop and deploy solutions using VectorRAG and GraphRAG techniques with both open-source and commercial Large Language Models.
- Develop AI Workflows: Apply a strong understanding of Agentic AI workflows and frameworks to build sophisticated, automated systems.
- Drive DevOps Best Practices: Implement robust CI/CD pipelines, automated testing, and containerization (Docker, Kubernetes) to streamline the development and deployment of machine learning models.
- End-to-End Project Ownership: Manage your deliverables from conception to deployment, proactively identifying and mitigating risks to ensure project goals are met.
- Rapid Prototyping: Perform and lead quick proof-of-concept (POC) projects to evaluate and demonstrate the feasibility of new GenAI use cases.
- Ensure System Integrity: Appropriately assess and manage risk in all technical and business decisions, ensuring compliance with applicable laws, rules, and regulations, and maintaining transparency in reporting and managing control issues.
- Collaborate and Innovate: Work closely with cross-functional teams to solve complex problems, adapt quickly to changing priorities, and contribute to a culture of innovation.
What we're looking for
AI Frameworks, Generative AI, Large Language Model (LLM) Fine-Tuning, ML Frameworks, Python (Programming Language).