Job Summary
A software company has an open position for a Telecommute Central Engineering Site Reliability Engineer.
Must be able to:
- Provide on-call and incident management support for production systems
- Enhance, maintain, and scale the Packet Platform infrastructure
- Ensure a high quality of service for our customer and developers through monitoring
Position Requirements Include:
- Expertise writing and maintaining, Python,Go, or another similar language
- Experience with Infrastructure as Code
- Experience with container orchestration
- Insatiable curiosity and strong problem solving skills
- Experience with Prometheus & Grafana & log aggregation systems
- Experience with OS & Container security, data center networking