Senior Software Engineer
Location: Remote
Compensation: Salary
Reviewed: Wed, May 27, 2026
This job expires in: 30 days
Job Summary
Building and operating automation for large-scale GPU clusters, the full-time Senior Software Engineer will focus on Kubernetes-based infrastructure, reliability, and operational systems in a remote environment.
Key responsibilities
- Develop automation and tooling for provisioning, monitoring, and lifecycle operations of GPU clusters
- Enhance workflows for cluster operations and reduce manual interventions through automation and GitOps
- Collaborate with cross-functional teams to ensure infrastructure is production-ready and participate in on-call incident response
Required qualifications
- 8+ years of experience in building or operating production infrastructure
- Strong programming skills in Python, Go, or similar languages
- Experience with Linux, Kubernetes, containers, and infrastructure automation
- Ability to troubleshoot distributed systems in a production environment
- BS/MS in Computer Science or equivalent experience
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...