Lead and grow a high-performing SRE team responsible for the reliability, performance, and scalability of production systems
Own the incident management process, postmortems, and root cause analysis to improve system resilience
Drive implementation of SLAs, SLOs, and error budgets across services to align operational goals with business objectives

Required Qualifications

Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment
Extensive experience running high-availability web services at a large scale, with comprehensive knowledge of cloud-native architectures
Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring
Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, and Jenkins

FREE TOOLS

Unlock Expert Career Tools

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Senior Manager, Site Reliability Engineering

Job Summary

Key Responsibilities

Required Qualifications

COMPLETE JOB DESCRIPTION

Company Overview

Related Jobs!

Applied for this Job?