Senior Site Reliability Engineer

Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, May 21, 2026
This job expires in: 30 days

Job Summary

Owning the performance, reliability, and cost efficiency of a large-scale production platform, the full-time Senior Site Reliability Engineer will drive improvements in latency and availability while leading the shift to AI-assisted reliability engineering in a remote environment.

Key responsibilities
  • Drive measurable improvements in system performance and eliminate bottlenecks across production environments
  • Define and enforce SLIs, SLOs, and error budgets to balance speed, reliability, and growth
  • Lead database performance optimization and oversee AI-assisted load testing and capacity planning workflows
Required qualifications
  • 7+ years of experience operating and evolving large-scale production systems
  • Deep expertise in Linux systems with hands-on performance tuning across various resources
  • Strong Python skills for automation and tooling in AI-assisted systems workflows
  • Advanced experience with Kubernetes, including workload sizing and multi-tenant operations
  • Proven ability to diagnose and resolve complex database performance issues with MySQL/MariaDB or PostgreSQL

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...