Design and implement high-performance inference pipelines
Optimize model serving for throughput, latency, and cost across different workloads
Collaborate with research and product teams to integrate inference into real-world applications

Qualifications:

Deep experience developing and tuning LLM inference frameworks (e.g. vLLM)
Experience with cloud infrastructure (AWS, GCP, Azure) and Kubernetes
Passion for AI and practical ML systems
Experience building, deploying, and operating highly available, scalable, distributed cloud services

FREE TOOLS

Unlock Expert Career Tools

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

ML Ops Engineer

Job Summary

Responsibilities:

Qualifications:

COMPLETE JOB DESCRIPTION

Company Overview

Related Jobs!

Applied for this Job?