Platform Lifecycle Team Manager
Location: Remote
Compensation: To Be Discussed
Reviewed: Fri, Jun 05, 2026
This job expires in: 29 days
Job Summary
Seeking a highly skilled Platform & Lifecycle Team Manager, the full-time remote position will lead a team focused on technical troubleshooting and lifecycle management for a large-scale server fleet, ensuring high performance and uptime in cloud environments.
Key Responsibilities
- Lead the Platform, Lifecycle & Troubleshooting team in resolving complex incidents and platform issues
- Own server repurposing, migrations, and deeper lifecycle management
- Perform advanced troubleshooting for RDMA links, GPU, storage, and server-side networking
Required Qualifications
- 8+ years of experience in Linux systems administration, platform engineering, or SRE-style operations
- Deep expertise in troubleshooting GPU, storage, RDMA, and high-performance networking issues
- Proven track record leading technical teams and managing on-call rotations
- Strong scripting/automation skills (Python, Bash, Ansible, etc.) and experience with monitoring tools
- Bachelor's degree in Computer Science, Engineering, or equivalent experience
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...