Data Engineer
Job is Expired
Location: Remote
Compensation: To Be Discussed
Reviewed: Wed, Jun 25, 2025
Job Summary
A company is looking for a Member of Technical Staff, Pre-Training Data Engineer.
Key Responsibilities
- Design and build scalable data pipelines for diverse datasets, ensuring effective ingestion, cleaning, filtering, and optimization
- Conduct data ablations to assess quality and experiment with data mixtures to enhance model performance
- Develop robust data modeling techniques to structure datasets for optimal training efficiency
Required Qualifications
- Strong software engineering skills with proficiency in Python
- Familiarity with data processing frameworks such as Apache Spark, Apache Beam, or Pandas
- Experience working with large-scale datasets, including web and multilingual data
- Knowledge of data quality assessment techniques
- A passion for bridging research and engineering in AI model training
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...
Job is Expired