AI Infrastructure Engineer

Location: Remote

Compensation: Salary

Reviewed: Fri, May 29, 2026

This job expires in: 30 days

Job Category: Information Technology

Weekly Hours: Full Time

Employment Status: Independent Contractor

Employer Type: Employer

Career Level: Experienced

Education Level: Bachelors, Masters

Job Summary

Seeking a skilled AI Infrastructure Engineer to design, build, and operate the platform layer for large-scale AI training and inference workloads in a full-time remote position, focusing on GPU clusters, distributed training frameworks, and enhancing developer experience for ML engineers and researchers.

Key Responsibilities

Design and operate GPU and accelerator infrastructure for training and inference, including on-prem and cloud-managed services
Build scheduling and resource-sharing systems to maximize accelerator utilization across teams
Integrate ML frameworks into a unified platform and maintain high-performance storage systems and data pipelines

Required Qualifications

Bachelor's or Master's degree in Computer Science or a related field
Six or more years of experience in infrastructure, platform, or HPC engineering
Hands-on experience with GPU clusters or large-scale ML training infrastructure
Strong proficiency in Python and at least one systems language such as Go or C++
Deep understanding of distributed training, accelerator architectures, and collective communication

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...