Forward Deployed Engineer

Location: Remote
Compensation: Salary
Reviewed: Wed, Jan 07, 2026
This job expires in: 27 days

Job Summary

A company is looking for a Forward Deployed Engineer, AI Inference (vLLM and Kubernetes).

Key Responsibilities
  • Deploy and configure LLM-D and vLLM on Kubernetes clusters to maximize hardware utilization
  • Run performance benchmarks and tune parameters to meet SLOs for latency and throughput
  • Collaborate with customer engineers to write production-quality code that integrates the inference engine
Required Qualifications
  • 8+ years of engineering experience in Backend Systems, SRE, or Infrastructure Engineering
  • Deep expertise in Kubernetes, including custom resources and high-performance networking
  • Proficiency in Python and Go for systems programming
  • Experience with Infrastructure as Code tools like Helm or Terraform
  • Familiarity with deploying LLMs on bare-metal and hyperscaler Kubernetes clusters

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...