AI Data Infrastructure Engineer

Location: Remote
Compensation: Salary
Reviewed: Tue, May 26, 2026
This job expires in: 30 days

Job Summary

To support the development of large-scale data systems for AI training and evaluation, the full-time AI Data Infrastructure Engineer will design and operate data pipelines remotely, ensuring high-quality data delivery across diverse modalities.

Key Responsibilities
  • Design and operate large-scale data pipelines supporting AI training and evaluation workflows
  • Build ingestion systems for various data modalities and implement quality assurance processes
  • Collaborate with ML researchers to align data systems with model development needs
Required Qualifications
  • Bachelor's or Master's degree in Computer Science or a related field
  • Six or more years of data engineering experience, particularly with ML or AI workloads
  • Strong proficiency in Python and at least one JVM or systems language
  • Deep experience with modern data processing frameworks such as Spark, Ray, or Beam
  • Hands-on experience operating petabyte-scale storage and pipeline systems

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...