Manager HPC Fleet Operations

Location: Remote
Compensation: Salary
Staff Reviewed: Thu, Feb 08, 2024
This job expires in: 14 days

Job Summary

A company is looking for a Manager, HPC Fleet Operations.

Key Responsibilities:
  • Build and lead a 24/7 team of process-oriented, reliability and observability-focused engineers
  • Lead the socialization and documentation of clear and consistent processes for provisioning, validating and troubleshooting nodes in the server fleet
  • Provide a 24/7 engineering support function for high-criticality, time-sensitive node delivery and maintenance

Required Qualifications, Training, and Education:
  • Seven or more years of experience in a software or infrastructure engineering industry, with at least two years in a leadership capacity
  • Background in SRE fundamentals, incident management, blameless culture, observability, and change management
  • Experience building and leading a 24/7 team of high-performing, diverse engineers
  • Strong belief in the value of automation and driving reliability through cross-team processes and tooling
  • Ability to mentor and support personal growth and capability of team members

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

BECOME A PREMIUM MEMBER TO
UNLOCK FULL JOB DETAILS & APPLY

  • ACCESS TO FULL JOB DETAILS AND APPLICATION INFORMATION
  • HUMAN-SCREENED REMOTE JOBS AND EMPLOYERS
  • COURSES, GROUP CAREER COACHING AND RESOURCE DOWNLOADS
  • DISCOUNTED CAREER SERVICES, RESUME WRITING, 1:1 COACHING AND MORE
  • EXCELLENT CUSTOMER SUPPORT FOR YOUR JOB SEARCH