Job Summary
A software company is in need of a Telecommuting Site Reliability Engineer.
Individual must be able to fulfill the following responsibilities:
- Diving into problems with an eye to both immediate remediation as well as the follow-through changes and automation
- Working to constantly improve our resiliency by developing self-healing, self-assembling infrastructure
- Participating in a 24/7 on-call rotation that supports our production infrastructure
Skills and Requirements Include:
- Working knowledge of industry best practices with regards to information security
- Experience operating and maintaining production systems in a Linux and public cloud environment
- Familiarity with infrastructure management and operations lifecycle concepts and ecosystem
- Experience building and scaling distributed, highly available systems
- Comfortable with Go or another low-level programming language, such as C