Principal GenAI Data Engineer
Location: Remote
Compensation: Salary
Reviewed: Tue, May 26, 2026
This job expires in: 30 days
Job Summary
Seeking a fully remote Principal GenAI Data Engineer, the role focuses on architecting enterprise-scale data platforms and pipelines for Generative AI applications, driving the design and implementation of data ingestion and processing workflows.
Key responsibilities
- Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data
- Design scalable pipelines for enterprise knowledge ingestion from diverse data sources
- Define architecture for metadata extraction, chunking, enrichment, and knowledge preparation workflows
Required qualifications
- Expert-level Python programming and software engineering capabilities
- Experience building distributed/scalable data pipelines for AI workloads
- Strong understanding of unstructured data extraction and processing pipelines
- Experience with vector databases, graph databases, and metadata/knowledge storage systems
- Hands-on experience with clustering, entity recognition algorithms, and modern retrieval strategies
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...