Platform Reliability Engineer

Location: Remote
Compensation: Salary
Reviewed: Fri, May 08, 2026
This job expires in: 26 days

Job Summary

A company is looking for a Platform Reliability Engineer.

Key Responsibilities
  • Design and maintain a Kubernetes-based platform for autonomous AI execution
  • Automate infrastructure processes using Terraform and other tools to achieve zero manual intervention
  • Establish reliability metrics and implement self-healing systems for AI workflows
Required Qualifications
  • 6+ years of experience in Platform Engineering, SRE, or Infrastructure roles with a focus on AI/ML systems
  • Mastery of Terraform, ArgoCD, and GitOps workflows
  • Expert-level knowledge of Kubernetes networking, scaling, and security
  • Hands-on experience with MLOps pipelines and scaling AI inference services
  • Proficiency in Python for automation and platform tool development

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...