Manager of Reliability Operations
Location: Remote
Compensation: To Be Discussed
Reviewed: Fri, Jun 05, 2026
This job expires in: 30 days
Job Summary
Leading the evolution of incident management and reliability practices, the full-time remote Manager of Reliability Operations will oversee incident response, drive accountability and learning, and ensure operational excellence across a complex platform ecosystem.
Key responsibilities
- Continuously improve incident management and establish clear standards for incident declaration and communication
- Own post-incident reviews and translate incident trends into actionable reliability improvements
- Operate across diverse environments and provide data-driven reporting on availability and operational performance
Required qualifications
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
- 7+ years of experience in systems operations, site reliability, or platform engineering
- 2+ years of experience leading teams or major operational functions
- Proven experience managing incidents in a 24/7 production environment
- Strong background in troubleshooting, root cause analysis, and operational improvement
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...