Platform Reliability Engineer
Location: Remote
Compensation: Salary
Reviewed: Fri, May 08, 2026
This job expires in: 26 days
Job Summary
A company is looking for a Platform Reliability Engineer.
Key Responsibilities
- Design and maintain a Kubernetes-based platform for autonomous AI execution
- Automate infrastructure processes using Terraform and other tools to achieve zero manual intervention
- Establish reliability metrics and implement self-healing systems for AI workflows
Required Qualifications
- 6+ years of experience in Platform Engineering, SRE, or Infrastructure roles with a focus on AI/ML systems
- Mastery of Terraform, ArgoCD, and GitOps workflows
- Expert-level knowledge of Kubernetes networking, scaling, and security
- Hands-on experience with MLOps pipelines and scaling AI inference services
- Proficiency in Python for automation and platform tool development
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...