Senior Site Reliability Engineer
Location: Remote
Compensation: Salary
Reviewed: Tue, Jun 30, 2026
This job expires in: 27 days
Job Summary
As a remote full-time Senior Site Reliability Engineer, the successful candidate will be responsible for ensuring the availability, performance, and reliability of a large federal enterprise cloud platform, while defining service level objectives and leading incident response practices.
Key responsibilities:
- Defining and maintaining service level objectives (SLOs) and driving the platform toward them
- Designing and operating observability across metrics, logging, tracing, and alerting
- Leading incident response and on-call practices, including escalation and time-to-recovery improvements
Required qualifications:
- Bachelor's degree and 7+ years of relevant experience, or equivalent experience in lieu of education
- Demonstrated experience owning reliability for production systems, including SLOs and incident response
- Expert-level knowledge of at least one infrastructure-as-code tool, preferably Terraform
- Deep command of cloud infrastructure, containerization, and networking
- Ability to obtain and maintain a U.S. Public Trust/suitability determination
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...