Principal AI/ML Infra Engineer

Location: Remote
Compensation: Salary
Reviewed: Mon, May 05, 2025
This job expires in: 15 days
AI/ML Platforms DevOps Azure AWS

Job Summary

A company is looking for a Principal AI/ML Infra and Ops Engineer, responsible for managing operations related to an enterprise AI/ML platform.

Key Responsibilities:
  • Implement automation across the infrastructure lifecycle using Infrastructure as Code (IaC) and DevOps practices
  • Develop monitoring frameworks for infrastructure to ensure high availability and performance optimization
  • Provide SRE support for users on the platform, including ticket triage and customer liaison
Required Qualifications:
  • Bachelor's degree in computer science, information technology, or a STEM-related field
  • 8+ years of infrastructure experience with large-scale, cloud-based software platforms
  • 6+ years of experience in Infrastructure-as-Code and CI/CD tools, such as Terraform and Git Actions
  • 4+ years of experience with containerization technologies like Kubernetes and Docker
  • 4+ years of scripting and automation experience, particularly in Python and Bash
GET ACCESS
Access New Remote Job Listings Now

Create a free account to begin your remote job search with our expert-vetted listings, resume tips, and career tools.

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...