AI Performance Optimization Engineer

Location: Remote

Compensation: Salary

Reviewed: Tue, May 26, 2026

This job expires in: 30 days

Job Category: Project Management

Weekly Hours: Full Time

Employment Status: Independent Contractor

Employer Type: Employer

Career Level: Experienced, Management

Education Level: Bachelors, Masters

Job Summary

Focused on maximizing throughput and minimizing latency, the full-time AI Performance Optimization Engineer will work remotely to optimize training and inference workloads for large neural network systems, utilizing deep knowledge of GPU architecture, memory management, and compiler-level optimization.

Key Responsibilities

Profile and optimize end-to-end AI training and inference pipelines for throughput, latency, and cost
Identify and eliminate bottlenecks across data loading, model compute, communication, and memory
Implement and tune quantization, sparsity, and pruning strategies to reduce model footprint and accelerate inference

Required Qualifications

Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field
Six or more years of experience in performance engineering, ML systems, or HPC
Strong proficiency in Python and C++
Hands-on experience optimizing deep learning workloads on modern GPUs
Deep understanding of distributed training and inference techniques

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...