Senior AI HPC Cluster Engineer

Job is Expired
Location: Remote
Compensation: Salary
Reviewed: Fri, Feb 20, 2026

Job Summary

A company is looking for a Senior AI-HPC Cluster Engineer - MLOps.

Key Responsibilities
  • Lead the management of large-scale HPC systems, including compute, networking, and storage deployment
  • Develop scalable automation solutions for GPU-accelerated computing ecosystems
  • Support researchers with workload management, performance analysis, and optimizations
Required Qualifications
  • Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience
  • Minimum of 8+ years of experience in large-scale compute infrastructure
  • Experience with AI/HPC job schedulers and orchestrators (e.g., Slurm, K8s, LSF)
  • Proficient in Linux (Centos/RHEL, Ubuntu) and container technologies (e.g., Docker, Podman)
  • Proficiency in one scripting language (Python, Bash) and one compiled language (Golang, Rust, C, C++)

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...