Role overview
We are looking for an experienced DevOps Engineer to optimize and implement our software development methodologies and architecture. We are looking for someone to execute and automate operational processes fast, accurately, and securely. In this role, you should be able to work independently with little supervision. You should have excellent organization and problem-solving skills. If you also have hands-on experience in software development and agile methodologies, we’d like to meet you.
What you'll work on
- Develop, maintain and administer tools which will automate operational activities and improve engineering productivity.
- Design and implement procedures for system troubleshooting and maintenance, perform root cause analysis for production errors.
- (Re)design & improve infrastructure & application monitoring and alerting stack (e.g. Prometheus, OpsGenie).
- Automate infrastructure management using configuration management and provisioning tools (e.g. Ansible, Terraform).
- Design, implement, support and improve tools the team depends on (e.g. CI/CD, monitoring, data processing, research cluster management).
- Provide continuity and stability to the production environment.
- Provide quick response to production issues to help eliminate down time.
What we're looking for
- Cloud services: AWS, Alibaba Cloud, Azure, GCP, collocation knowledge
- IaC: Terraform, Ansible
- Monitoring: Prometheus, Grafana, ELK
- Alerting: OpsGenie
- Database: AWS AuroraDB (MySQL 5.6), general database administration, optimization a plus
- CI/CD: CircleCi
- Code version control and management: Github