Site Reliability Engineer

Location: Remote

Compensation: To Be Discussed

Reviewed: Fri, Mar 13, 2026

This job expires in: 30 days

Job Category: Information Technology

Weekly Hours: Full Time

Employer Type: Employer

Career Level: Experienced

Education Level: Masters

Job Summary

A company is looking for a Site Reliability Engineer.

Key Responsibilities

Develop and maintain observability solutions using platforms like Datadog, Prometheus, and Grafana
Lead incident management efforts, including coordinating responses and troubleshooting issues
Collaborate with product engineering teams to architect reliable systems and implement monitoring strategies

Required Qualifications

4+ years of experience in Site Reliability Engineering or similar DevOps roles
2+ years of hands-on experience with Kubernetes and managing its infrastructure
Strong experience with modern monitoring stacks including Prometheus, Grafana, and Datadog
Experience with Infrastructure as Code tools, like Terraform and Helm
Expertise with at least one major cloud service provider (AWS, GCP, Azure)

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...