Senior AI HPC Cluster Engineer

Job is Expired

Location: Remote

Compensation: Salary

Reviewed: Fri, Feb 20, 2026

Job Category: Information Technology

Weekly Hours: Full Time

Employment Status: Permanent

Employer Type: Employer

Career Level: Experienced, Senior Level

Education Level: Bachelors

Job Summary

A company is looking for a Senior AI-HPC Cluster Engineer - MLOps.

Key Responsibilities

Lead the management of large-scale HPC systems, including compute, networking, and storage deployment
Develop scalable automation solutions for GPU-accelerated computing ecosystems
Support researchers with workload management, performance analysis, and optimizations

Required Qualifications

Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience
Minimum of 8+ years of experience in large-scale compute infrastructure
Experience with AI/HPC job schedulers and orchestrators (e.g., Slurm, K8s, LSF)
Proficient in Linux (Centos/RHEL, Ubuntu) and container technologies (e.g., Docker, Podman)
Proficiency in one scripting language (Python, Bash) and one compiled language (Golang, Rust, C, C++)

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Job is Expired

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

Wikipedia Wikipedia URL

BBB URL BBB URL

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...