Lead Site Reliability Engineer
Location: Remote
Compensation: Salary
Reviewed: Fri, Jun 19, 2026
This job expires in: 28 days
Job Summary
To enhance reliability across the organization, the full-time Lead Site Reliability Engineer will analyze incidents, prioritize improvements, and collaborate with engineering teams remotely to implement effective solutions and best practices.
Key responsibilities
- Identify patterns of failure and analyze incidents to determine root causes and prevent future occurrences
- Prioritize and implement engineering interventions that improve reliability and reduce incident response times
- Collaborate with cross-functional teams to disseminate best practices and lead post-incident reviews
Required qualifications
- 5+ years of hands-on experience in Site Reliability, Platform, or Infrastructure Engineering in a large-scale environment
- Proficiency in at least one programming language (e.g., Python, Go, TypeScript, Java) with a track record of code shipped to production
- Experience driving the adoption of reliability patterns across teams with measurable outcomes
- Hands-on experience with major cloud platforms and observability tools
- Exposure to Artificial Intelligence and Large Language Model tooling in an engineering context
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...