Senior AI Infrastructure Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, Jul 02, 2026
This job expires in: 28 days
Job Summary
Serving as a technical leader within the operations organization, the full-time Senior AI Infrastructure & Platform Operations Engineer will manage complex AI infrastructure environments powered by NVIDIA GPUs and Kubernetes, focusing on operational excellence and incident resolution in a remote capacity across the EU.
Key responsibilities:
- Lead the investigation and resolution of complex infrastructure and platform-related incidents
- Drive improvements in platform reliability, observability, and operational processes
- Mentor AI Infrastructure & Platform Operations Engineers and develop operational standards
Required qualifications:
- 7+ years of experience in infrastructure operations or related technical roles
- Expert-level Linux administration and troubleshooting skills
- Strong experience operating Kubernetes in production environments
- Proven experience leading technical investigations and managing complex incidents
- Strong understanding of observability and service reliability practices
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...