Machine Learning Systems Engineer
Location: Remote
Compensation: Salary
Reviewed: Tue, May 26, 2026
This job expires in: 30 days
Job Summary
Focused on optimizing performance and scalability, the full-time Machine Learning Systems Engineer will enhance core systems for training frontier models, utilizing profiling tools and optimizing distributed training pipelines, with opportunities for hybrid or fully remote work.
Key responsibilities:
- Utilize profiling tools to identify bottlenecks and implement optimizations for data loading and gradient computation
- Optimize distributed training pipelines using frameworks such as PyTorch Distributed
- Design and maintain high-performance GPU kernels for machine learning workloads
Required qualifications:
- Bachelor's, Master's degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline
- Strong proficiency in Python programming
- Extensive hands-on experience with PyTorch
- Experience optimizing machine learning model execution during training and inference
- Exceptional analytical and problem-solving skills with a data-driven approach
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...