Job Summary
A software solutions company has an open position for a Remote Infrastructure Principal Site Reliability Engineer.
Candidates will be responsible for the following:
- Developing the automation and tooling needed to keep track of the health of our infrastructure
- Working with developers, architects, product management, operations, and other cross-functional teams
- Outlining our plan for reducing the risk associated with technology operations
Skills and Requirements Include:
- 10+ years of relevant experience building distributed systems on a public cloud (AWS, GCP)
- Strong foundation in software application design and system internals
- Industry experience with SaaS architecture
- Collaborate effectively with other engineers to solve most complex technical problems
- Experience with distributed systems in a production operations environment
- Experience working with large-scale Internet infrastructure