Senior Manager, Site Reliability Engineering

This job has been removed
Location: Remote
Compensation: To Be Discussed
Reviewed: Wed, May 14, 2025
This job expires in: 16 days
SRE DevOps Incident Management Postmortems

Job Summary

A company is looking for a Senior Manager, Site Reliability Engineering (EU CET).

Key Responsibilities
  • Lead and grow a high-performing SRE team responsible for the reliability, performance, and scalability of production systems
  • Own the incident management process, postmortems, and root cause analysis to improve system resilience
  • Drive implementation of SLAs, SLOs, and error budgets across services to align operational goals with business objectives
Required Qualifications
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
  • Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment
  • Extensive experience running high-availability web services at a large scale, with comprehensive knowledge of cloud-native architectures
  • Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring
  • Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, and Jenkins
FREE TOOLS
Unlock Expert Career Tools

Register free for worksheets, guides, and on-demand coaching to support your job search.

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...