Site Reliability Engineer

Location: Remote

Compensation: Salary

Reviewed: Wed, May 20, 2026

This job expires in: 30 days

Job Category: Information Technology

Weekly Hours: Full Time

Employment Status: Independent Contractor

Employer Type: Employer

Career Level: Experienced

Education Level: Bachelors

Job Summary

Seeking an experienced Site Reliability Engineer (SRE) to ensure the availability and performance of large-scale distributed systems in a full-time, remote position, where the candidate will manage incident responses, design monitoring strategies, and automate operational processes.

Key Responsibilities

Define and refine service-level objectives (SLOs) and indicators (SLIs) while driving engineering decisions based on these metrics
Lead incident response efforts for production issues, ensuring effective post-incident reviews and improvements
Design and implement monitoring, logging, and tracing strategies to enhance system visibility and operational efficiency

Required Qualifications

Bachelor's degree in Computer Science, Engineering, or a related technical discipline
Five or more years of SRE, DevOps, or production engineering experience with large-scale distributed systems
Strong programming skills in Python, Go, or Java for automation and tooling development
Hands-on experience operating Kubernetes and container-based workloads
Deep knowledge of observability tools such as Prometheus, Grafana, and ELK/EFK

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...