Data Engineer

Job is Expired
Location: Remote
Compensation: To Be Discussed
Reviewed: Wed, Jun 25, 2025

Job Summary

A company is looking for a Member of Technical Staff, Pre-Training Data Engineer.

Key Responsibilities
  • Design and build scalable data pipelines for diverse datasets, ensuring effective ingestion, cleaning, filtering, and optimization
  • Conduct data ablations to assess quality and experiment with data mixtures to enhance model performance
  • Develop robust data modeling techniques to structure datasets for optimal training efficiency
Required Qualifications
  • Strong software engineering skills with proficiency in Python
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, or Pandas
  • Experience working with large-scale datasets, including web and multilingual data
  • Knowledge of data quality assessment techniques
  • A passion for bridging research and engineering in AI model training

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...