Role overview
Role Overview: We are seeking a DevOps Engineer with familiarity in Python and machine learning to enhance our infrastructure and automation processes. The ideal candidate will have hands-on experience with Terraform, Ansible, and monitoring tools to support our infrastructure, deployment pipelines, and machine learning operations.
What you'll work on
- Infrastructure Management: Design, deploy, and manage scalable infrastructure using Terraform and cloud platforms (e.g., AWS, Azure, GCP). Automate infrastructure provisioning and configuration with Ansible.
- CI/CD Pipelines: Develop and maintain CI/CD pipelines to support code integration, testing, and deployment. Implement best practices for version control and automated
- Automation: Use Python for scripting and automating tasks related to infrastructure, deployment, and Create and maintain automation tools to improve system efficiency and reliability.
- Machine Learning Integration: Work with data science teams to deploy and manage machine learning Optimize and monitor ML model performance in production environments.
- Monitoring and Troubleshooting: Implement and maintain monitoring solutions to ensure system health and performance. Use monitoring tools to quickly identify and resolve issues related to infrastructure and applications.
- Documentation: Maintain clear and comprehensive documentation for infrastructure, deployment processes, and operational procedures.
What we're looking for
- Bachelor’s Degree in Computer Science, Engineering, or a related
- Certifications: Relevant certifications in cloud platforms (e.g., AWS Certified DevOps Engineer) and Kubernetes.
- Experience with ML Ops: Understanding of ML Ops practices and tools for managing the lifecycle of machine learning models.