Principal Systems Engineer
Location: Remote
Compensation: Salary
Reviewed: Wed, Mar 25, 2026
This job expires in: 30 days
Job Summary
A company is looking for a Principal Systems At-Scale Engineer.
Key Responsibilities
- Deploy strategies for analyzing and collecting debugging signals from large clusters to enhance quality and experience
- Build and expand debugging tools for diagnosing and recovering out-of-service systems
- Lead cross-team task forces to address undefined failure modes in AI/GPU systems
Required Qualifications
- 15+ years of experience in systems debugging at scale
- BS/MS in Computer Science or related field (or equivalent experience)
- Proven understanding of performance clusters and workload patterns
- Experience with telemetry and at-scale analytics for large platforms
- Programming/scripting experience in C/Python/Bash/Lua
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...