Senior Site Reliability Engineer

Job is Expired
Location: Remote
Compensation: Salary
Reviewed: Sun, Jul 06, 2025

Job Summary

A company is looking for a Senior Site Reliability Engineer, AI Infrastructure.

Key Responsibilities
  • Develop and maintain large-scale systems for AI Infrastructure, ensuring reliability and scalability
  • Implement SRE fundamentals, including incident management and automation tools to enhance operational efficiency
  • Establish frameworks for operational maturity and lead incident response protocols to improve system resilience
Required Qualifications
  • Degree in Computer Science or related field, or equivalent experience with 12+ years in Software Development, SRE, or Production Engineering
  • Proficiency in Python and at least one additional programming language (C/C++, Go, Perl, Ruby)
  • Expertise in systems engineering within Linux or Windows environments and cloud platforms (AWS, OCI, Azure, GCP)
  • Strong understanding of SRE principles, including error budgets and Infrastructure as Code tools
  • Hands-on experience with observability platforms and CI/CD systems

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...