ML Ops Engineer

Location: Remote
Compensation: To Be Discussed
Reviewed: Mon, May 12, 2025
This job expires in: 22 days
ML Inference VLLM AWS GCP

Job Summary

A company is looking for an ML Ops Engineer to architect and implement efficient ML inference pipelines for large language models.

Responsibilities:
  • Design and implement high-performance inference pipelines
  • Optimize model serving for throughput, latency, and cost across different workloads
  • Collaborate with research and product teams to integrate inference into real-world applications
Qualifications:
  • Deep experience developing and tuning LLM inference frameworks (e.g. vLLM)
  • Experience with cloud infrastructure (AWS, GCP, Azure) and Kubernetes
  • Passion for AI and practical ML systems
  • Experience building, deploying, and operating highly available, scalable, distributed cloud services
FREE TOOLS
Unlock Expert Career Tools

Register free for worksheets, guides, and on-demand coaching to support your job search.

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...