Job Summary
A technology company is seeking a Telecommute Cluster Operations Engineering Manager.
Core Responsibilities of this position include:
- Developing processes for on time delivery of products
- Building tools that enable self-service utilization of Heptio's infrastructure
- Recruiting and ensuring mentorship of new site reliability engineers (SREs)
Must meet the following requirements for consideration:
- You have been responsible for hosted services and recognize the challenges, complexities and tradeoffs to be made when building and deploying new systems to production
- You have operational experience with containers and container orchestration technologies, particularly Docker and Kubernetes
- You have implemented automation for the deployment, monitoring and observability of production services
- You have led software teams that have successfully delivered complex products
- You are capable of conducting and participating in design and code reviews with your team