Systems Engineer, HPC
Location: Remote
Compensation: Salary
Reviewed: Fri, Jun 19, 2026
This job expires in: 29 days
Job Summary
To support the infrastructure behind cutting-edge AI platforms, the full-time remote Systems Engineer, HPC will operate and maintain large-scale Linux environments, monitor system health, and collaborate with cross-functional teams to ensure reliable performance and scalability.
Key responsibilities
- Operate and maintain large-scale Linux environments, ensuring high availability and troubleshooting incidents
- Scale clusters and improve performance for systems handling petabyte-scale storage
- Automate operational tasks using tools like Python, Bash, Ansible, or Terraform, and contribute to system design decisions
Required qualifications
- Strong Linux systems administration experience
- Experience working in large-scale environments such as HPC clusters or cloud infrastructure
- Familiarity with job schedulers (e.g., Slurm)
- Solid troubleshooting skills across systems, hardware, and networks
- Experience with automation tooling or Infrastructure as Code
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...