Job Summary
A growing social impact platform is searching for a person to fill their position for a Remote Senior Site Reliability Engineer.
Must be able to:
- Engage with engineering teams to design, build and maintain services for high availability and resiliency
- Build reusable software layers, scripts, deployment frameworks, alarms, probes, and self-healing tools
- Maintain documentation and standards for engineering teams to follow for onboarding and operating services
Skills and Requirements Include:
- 7+ years of experience with operations, DevOps, and/or software engineering
- Experience operating large scale distributed systems, especially in cloud environments
- Strong programming fundamentals with 3+ years of experience in one of the following languages: C#, Python, Scala, Go, or Java
- Experience supporting and managing Docker containers from local development to production on Kubernetes
- Committed to infrastructure as code and config using tools
- Experience with configuring and operating Kubernetes clusters in Azure or AWS