Principal GenAI Data Engineer

Location: Remote
Compensation: Salary
Reviewed: Tue, May 26, 2026
This job expires in: 30 days

Job Summary

Seeking a fully remote Principal GenAI Data Engineer, the role focuses on architecting enterprise-scale data platforms and pipelines for Generative AI applications, driving the design and implementation of data ingestion and processing workflows.

Key responsibilities
  • Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data
  • Design scalable pipelines for enterprise knowledge ingestion from diverse data sources
  • Define architecture for metadata extraction, chunking, enrichment, and knowledge preparation workflows
Required qualifications
  • Expert-level Python programming and software engineering capabilities
  • Experience building distributed/scalable data pipelines for AI workloads
  • Strong understanding of unstructured data extraction and processing pipelines
  • Experience with vector databases, graph databases, and metadata/knowledge storage systems
  • Hands-on experience with clustering, entity recognition algorithms, and modern retrieval strategies

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...