Site Reliability Engineer

Location: Remote
Compensation: Salary
Reviewed: Fri, Apr 03, 2026
This job expires in: 30 days

Job Summary

A company is looking for a Senior Site Reliability Engineer to ensure the reliability, scalability, and efficiency of its systems.

Key Responsibilities
  • Maintain system uptime and reduce Mean Time to Recovery (MTTR)
  • Design and execute chaos experiments to validate system reliability and improve resilience
  • Analyze performance metrics and implement optimization strategies for Kubernetes clusters and cloud resources
Required Qualifications
  • Strong experience with Kubernetes administration and container orchestration
  • Hands-on experience with chaos engineering tools and best practices
  • Proficiency in cloud platforms (AWS, OCI, or GCP) and cost optimization strategies
  • Familiarity with performance testing tools (e.g., JMeter, Locust, k6)
  • Expertise with core DevOps and SRE technologies like Ansible, Docker, and Terraform

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...