Senior Site Reliability Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, May 21, 2026
This job expires in: 30 days
Job Summary
Owning the performance, reliability, and cost efficiency of a large-scale production platform, the full-time Senior Site Reliability Engineer will drive improvements in latency and availability while leading the shift to AI-assisted reliability engineering in a remote environment.
Key responsibilities
- Drive measurable improvements in system performance and eliminate bottlenecks across production environments
- Define and enforce SLIs, SLOs, and error budgets to balance speed, reliability, and growth
- Lead database performance optimization and oversee AI-assisted load testing and capacity planning workflows
Required qualifications
- 7+ years of experience operating and evolving large-scale production systems
- Deep expertise in Linux systems with hands-on performance tuning across various resources
- Strong Python skills for automation and tooling in AI-assisted systems workflows
- Advanced experience with Kubernetes, including workload sizing and multi-tenant operations
- Proven ability to diagnose and resolve complex database performance issues with MySQL/MariaDB or PostgreSQL
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...