Senior Site Reliability Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Wed, Jun 17, 2026
This job expires in: 30 days
Job Summary
Seeking a Senior Site Reliability Engineer for a full-time remote position who will ensure the reliability of services, set up monitoring systems, and manage incident investigations while collaborating closely with developers.
Key responsibilities
- Ensure the reliability of services through SLIs/SLOs, availability management, and bottleneck elimination
- Establish and maintain monitoring systems, metrics, alerts, and user-friendly dashboards
- Conduct load testing, analyze results, and provide resource and scaling recommendations
Required qualifications
- 5+ years of experience in SRE/DevOps with a focus on high-load production systems
- Deep practical knowledge of Docker and Kubernetes, with production experience
- Hands-on experience with Prometheus, Alertmanager, and Grafana for metrics and alerts
- Proficiency in Python for automation and tooling tasks
- Experience with cloud platforms (GCP and/or AWS) and strong Linux skills
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...