Forward Deployed Engineer
Location: Remote
Compensation: Salary
Reviewed: Wed, Jan 07, 2026
This job expires in: 27 days
Job Summary
A company is looking for a Forward Deployed Engineer, AI Inference (vLLM and Kubernetes).
Key Responsibilities
- Deploy and configure LLM-D and vLLM on Kubernetes clusters to maximize hardware utilization
- Run performance benchmarks and tune parameters to meet SLOs for latency and throughput
- Collaborate with customer engineers to write production-quality code that integrates the inference engine
Required Qualifications
- 8+ years of engineering experience in Backend Systems, SRE, or Infrastructure Engineering
- Deep expertise in Kubernetes, including custom resources and high-performance networking
- Proficiency in Python and Go for systems programming
- Experience with Infrastructure as Code tools like Helm or Terraform
- Familiarity with deploying LLMs on bare-metal and hyperscaler Kubernetes clusters
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...