Principal Systems Engineer

Location: Remote
Compensation: Salary
Reviewed: Wed, Mar 25, 2026
This job expires in: 30 days

Job Summary

A company is looking for a Principal Systems At-Scale Engineer.

Key Responsibilities
  • Deploy strategies for analyzing and collecting debugging signals from large clusters to enhance quality and experience
  • Build and expand debugging tools for diagnosing and recovering out-of-service systems
  • Lead cross-team task forces to address undefined failure modes in AI/GPU systems
Required Qualifications
  • 15+ years of experience in systems debugging at scale
  • BS/MS in Computer Science or related field (or equivalent experience)
  • Proven understanding of performance clusters and workload patterns
  • Experience with telemetry and at-scale analytics for large platforms
  • Programming/scripting experience in C/Python/Bash/Lua

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...