Role overview
Job Title: AI ML Infrastructure EngineerJob Category: Information TechnologyTime Type: Full timeMinimum Clearance Required to Start: NoneEmployee Type: RegularPercentage of Travel Required: NoneType of Travel: NoneAnticipated Posting End: 5/31/2026
The Opportunity:
Join our innovative AI Research and Development team as an AI/ML Infrastructure Engineer. In this role, you'll collaborate with world-class scientists and engineers to design, deploy, and maintain a state-of-the-art Linux-based infrastructure. You'll be at the cutting edge of machine learning and artificial intelligence, contributing to projects in computer vision, natural language processing, large language models, and more. Be a part of shaping the future of AI in the defense industry!
What you'll work on
- Design, procure, build, implement, and maintain on-prem servers, workstations, and software tooling across multiple development environments.
- Implement infrastructure components to support distributed compute, storage, and dataset management.
- Ensure system compliance with security requirements, including system updates and developing a System Security Plan (SSP).
- Manage access and integration with commercial cloud providers such as AWS.
- Design, implement, and maintain hybrid cloud architectures connecting on-premises environments with cloud infrastructure.
- Develop and support cross-domain solutions for secure data transfer between networks of varying classification levels.
- Configure and manage networking components (VPNs, Direct Connect, gateways, firewalls) to support hybrid connectivity.
- Collaborate with cybersecurity, infrastructure, and application teams to ensure secure, scalable, and resilient cloud integrations.
- Monitor and optimize cloud connectivity performance, availability, and cost efficiency.
- Support accreditation and compliance activities related to cloud and cross-domain architectures.
- Coordinate with data scientists and software engineers to understand and plan infrastructure requirements.
- Support other hardware, such as edge devices, as required by projects/customers.
What we're looking for
Required:
Desired:
- Experience managing a GPU-enabled compute cluster.
- Experience managing a FIPS-enabled Linux environment.
- Experience with high availability data solutions such as Ceph.
- Experience with cross-domain/hybrid solutions.
- Experience with DevSecOps tools such as CI/CD pipelines and logging and monitoring tools.
- Experience with virtualization, including VMware or KVM.
- Understanding of AI/ML concepts and the AI/ML development lifecycle.
- Security+ Certification.
- Experience working in environments with high regulatory and compliance requirements.
- Active and current Top Secret or TS-SCI clearance.
- Experience working with hardware in classified environments.
- Experience working on proposals.
*_________________________________________________________________________
What You Can Expect:
A culture of integrity.**
At CACI, we place character and innovation at the center of everything we do. As a valued team member, you’ll be part of a high-performing group dedicated to our customer’s missions and driven by a higher purpose – to ensure the safety of our nation.