Job Summary
A digital marketing company is searching for a person to fill their position for a Remote Senior Site Reliability Engineer.
Core Responsibilities of this position include:
- Improve the tooling and automation of our infrastructure to minimize manual work, increase performance, and decrease the frequency and severity of incidents
- Build, maintain, and support core applications
- Monitor our systems for capacity, performance, and troubleshooting issues
Must meet the following requirements for consideration:
- Experience in Site Reliability or Software Engineering, building and maintaining scalable, resilient services
- Building the tooling and automation to manage those services, as well as investigating system and application metrics to diagnose and resolve performance issues
- 4+ years experience as an SRE or Software Engineer, with a focus on Cloud platforms
- Experience and willingness to operate in an on-call environment, evaluating and improving monitoring and alerting systems, and developing run books to investigate and debug issues
- Strong experience with infrastructure as code tools
- Linux experience (Ubuntu/Debian) is a must