Job Summary
A venture-backed startup working at the forefront of AI is filling a position for a Remote Principal Site Reliability Engineer.
Must be able to:
- Write software to build, maintain, automate, and introspect our production systems
- Mentor teams to reliably and cost effectively operate and maintain their services
- Take proactive steps to improve our availability, reliability, and efficiency
Applicants must meet the following qualifications:
- 5 years as a Software Engineer, Systems Administrator, Operations Engineer, Site Reliability Engineer, or similar role
- A systematic problem-solving approach, coupled with good communication skills, sense of ownership, and drive
- Experience operating large-scale, distributed systems on top of cloud infrastructure
- Experience programming in one or more of the following: Go, Python, Node.js, Bash, or similar languages
- A proven grasp of Linux systems administration and programming concepts