Site Reliability Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Mon, Jun 22, 2026
This job expires in: 30 days
Job Summary
To support a growing AI infrastructure, the full-time Site Reliability Engineer will provision and operate Kubernetes-based clusters, build automation tools, and debug customer issues while working remotely.
Key responsibilities
- Provision, configure, and operate Kubernetes-based clusters for customers across multiple providers
- Build automation and tooling to streamline cluster deployments and integrations
- Design and implement monitoring, alerting, and observability for critical systems
Required qualifications
- 5+ years of experience in SRE, DevOps, or infrastructure engineering roles
- Strong Linux systems and networking fundamentals
- Deep experience with Kubernetes and container orchestration at scale
- Proficiency with Infrastructure-as-Code tools such as Terraform, Helm, or Ansible
- Strong automation and scripting skills in Python, Go, or Bash
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...