Role overview
Consensys is the leading blockchain and web3 software company founded by Joe Lubin, CEO of Consensys and Co-Founder of Ethereum. Since 2014, Consensys has been at the forefront of innovation, pioneering technological developments within the web3 ecosystem.
Through our product suite, including the MetaMask platform, Infura, Linea, Diligence, and our NFT toolkit Phosphor, we have become the trusted collaborator for users, creators, and developers on their path to build and belong in the world they want to see.
Whether building a dapp, an NFT collection, a portfolio, or a better future, the instinct to build is universal. Consensys inspires and champions the builder instinct in everyone by making web3 universally easy to use and develop on.
What you'll work on
- Architect, build, and maintain AWS cloud infrastructure supporting Linea Mainnet/L2 nodes and supporting services.
- Design and optimize Kubernetes clusters for scaling, resiliency, and healthy node operations, leveraging Karpenter autoscaling and spot instance capabilities.
- Implement infrastructure-as-code using Terraform for reproducible, secure, and audit-ready deployments.
- Own monitoring, alerting, and observability pipelines (Grafana, Prometheus, Loki) for end-to-end production health and actionable insights.
- Drive automation of deployment and operational workflows, enabling zero-downtime upgrades and rapid rollbacks.
- Participate in incident response, root cause analysis, and postmortem reporting for production blockchain services.
- Collaborate closely with engineering teams to identify opportunities for reliability and performance improvements.
- Document infrastructure, operational protocols, and DevOps best practices, ensuring knowledge sharing and team alignment.
- Stay current with new tools and cloud advancements relevant to blockchain, DevOps, and Kubernetes ecosystems.
What we're looking for
- Senior/Staff experience with AWS cloud services and advanced Kubernetes ops, in production-grade environments.
- Proven experience in infrastructure-as-code, particularly Terraform, and delivering best practices for security and scalability.
- Proficiency in monitoring/observability tools: Grafana, Prometheus, Loki.
- Hands-on with Kubernetes autoscaling (Karpenter or Cluster Autoscaler) and EC2 spot instance cost optimization.
- Strong scripting/programming skills (Bash, Python, Go, or similar).
- Strong troubleshooting, communication, and documentation abilities.
- Blockchain/Layer 2 protocol, cryptography, and Web3 infrastructure experience preferred but not required.
- Experience working in agile, high-performance teams within top technology environments.