Senior Solutions Architect
Location: Remote
Compensation: Salary
Reviewed: Wed, Jun 24, 2026
This job expires in: 21 days
Job Summary
To enhance the performance of HPC systems and AI factories, the full-time remote Senior Solutions Architect will develop observability solutions, interpret telemetry data, and collaborate with cross-functional teams to ensure system readiness and health.
Key responsibilities
- Run validation tools and microbenchmarks to assess system health and performance
- Establish metrics and thresholds that define system health across the stack
- Develop automation for data collection and transformation, enhancing system visibility and reporting
Required qualifications
- Bachelor's degree or equivalent experience in Computer Science, Mathematics, Engineering, Physics, or related field
- 6+ years of experience managing Linux-based systems in HPC or large AI/ML environments
- Hands-on experience with multi-GPU and multi-node cluster architecture
- Proficiency in Python and Shell/Bash for scripting and automation
- Practical experience with observability systems like Prometheus or Grafana
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...