AI Infrastructure Operations Engineer
Location: Remote
Compensation: Salary
Reviewed: Fri, Jun 05, 2026
This job expires in: 30 days
Job Summary
To support the operationalization of an AI-enabled clinical platform, the full-time remote AI Infrastructure Operations Engineer will manage platform reliability, observability, and security for the Azure-based infrastructure while collaborating with technology leadership and security stakeholders.
Key responsibilities
- Establish operational reliability for the Companion platform across AKS infrastructure and deployment pipelines
- Develop observability practices to monitor platform behavior and identify operational risks
- Maintain security and operational hygiene through regular updates, CVE remediation, and compliance practices
Required qualifications
- Strong hands-on experience with Kubernetes operations and troubleshooting
- Experience with cloud-native infrastructure in Azure environments, particularly AKS
- Demonstrated ability in monitoring, observability, and incident response
- SRE mindset with experience in operational prioritization and post-incident analysis
- Comfort operating in fast-paced environments with evolving processes and broad ownership
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...