Site Reliability Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, Jul 02, 2026
This job expires in: 29 days
Job Summary
Working remotely, the full-time Site Reliability Engineer will ensure 99.99% availability across a mission-critical Department of Veterans Affairs cloud platform by managing incident response, capacity planning, and automation.
Key responsibilities
- Lead incident triage and root cause analysis while driving automation to reduce operational toil
- Perform daily monitoring and reporting of performance metrics, including the four Golden Signals
- Participate in 24x7 on-call support rotations and manage incident response activities
Required qualifications
- Bachelor's Degree in computer science, information technology, or related field
- 3 years of experience in site reliability engineering or platform operations
- AWS Certified DevOps Engineer - Associate or Certified Kubernetes Administrator (CKA) preferred
- Experience with Infrastructure as Code (IaC) practices and Terraform-based environment provisioning
- Active Federal Civilian Public Trust clearance or eligibility for it
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...