I
AI

AI Infrastructure Engineer

Intellectt Inc ·

Actively hiring Posted 27 days ago

Role overview

**Job Title:**
AI Infrastructure Engineer

**Location:**
Remote, USA

**Duration:**
6 Months

**Overview:**

Seeking an AI Infrastructure Engineer to manage and optimize AI systems and tools, ensuring smooth operations and high performance. The role focuses on infrastructure maintenance, AI operations, and team training.

**Key Responsibilities:**

* Manage and optimize AI infrastructure for performance and reliability.
* Use tools like
**NVIDIA Mission Control**
and
**Run:AI**
for operations.
* Support AI workloads across teams and ensure efficient resource use.
* Train and mentor team members on infrastructure tools and best practices.
* Monitor systems, troubleshoot issues, and document configurations.

**Required Skills:**

* Hands-on experience with
**AI infrastructure**
and
**HPC environments**
.
* Proficiency in
**Linux (Ubuntu/RHEL)**
,
**Run:AI**
,
**NVIDIA Mission Control**
.
* Knowledge of
**cloud**
and
**on-premise**
systems.
* Strong problem-solving and communication skills.

**Preferred:**

* Experience with
**Docker/Kubernetes**
,
**TensorFlow/PyTorch**
,
**SLURM**
, and AI storage/network solutions.

Tags & focus areas

Used for matching and alerts on DevFound
Ai Pytorch Tensorflow Fulltime