AI Data Infrastructure Engineer
Location: Remote
Compensation: Salary
Reviewed: Thu, Jun 04, 2026
This job expires in: 29 days
Job Summary
Seeking an experienced AI Data Infrastructure Engineer, the full-time remote position will design and operate large-scale data pipelines, build ingestion systems for diverse modalities, and implement data quality assurance processes to support AI training and evaluation workflows.
Key Responsibilities
- Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows
- Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals
- Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale
Required Qualifications
- Bachelor's or Master's degree in Computer Science or a related field
- Six or more years of data engineering experience, with significant work supporting ML or AI workloads
- Strong proficiency in Python and at least one JVM or systems language
- Deep experience with modern data processing frameworks such as Spark, Ray, or Beam
- Hands-on experience operating petabyte-scale storage and pipeline systems
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...