ML Ops Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Mon, May 12, 2025
This job expires in: 22 days
Job Summary
A company is looking for an ML Ops Engineer to architect and implement efficient ML inference pipelines for large language models.
Responsibilities:
- Design and implement high-performance inference pipelines
- Optimize model serving for throughput, latency, and cost across different workloads
- Collaborate with research and product teams to integrate inference into real-world applications
Qualifications:
- Deep experience developing and tuning LLM inference frameworks (e.g. vLLM)
- Experience with cloud infrastructure (AWS, GCP, Azure) and Kubernetes
- Passion for AI and practical ML systems
- Experience building, deploying, and operating highly available, scalable, distributed cloud services
FREE TOOLS
Unlock Expert Career Tools
Register free for worksheets, guides, and on-demand coaching to support your job search.
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...