Machine Learning Systems Engineer

Location: Remote
Compensation: Salary
Reviewed: Tue, May 26, 2026
This job expires in: 30 days

Job Summary

Focused on optimizing performance and scalability, the full-time Machine Learning Systems Engineer will enhance core systems for training frontier models, utilizing profiling tools and optimizing distributed training pipelines, with opportunities for hybrid or fully remote work.

Key responsibilities:
  • Utilize profiling tools to identify bottlenecks and implement optimizations for data loading and gradient computation
  • Optimize distributed training pipelines using frameworks such as PyTorch Distributed
  • Design and maintain high-performance GPU kernels for machine learning workloads
Required qualifications:
  • Bachelor's, Master's degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline
  • Strong proficiency in Python programming
  • Extensive hands-on experience with PyTorch
  • Experience optimizing machine learning model execution during training and inference
  • Exceptional analytical and problem-solving skills with a data-driven approach

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...