Model Performance Engineer

Location: Remote

Compensation: To Be Discussed

Reviewed: Thu, Apr 30, 2026

This job expires in: 30 days

Job Category: Information Technology

Employment Status: Permanent

Employer Type: Employer

Education Level: Doctorate

Job Summary

A company is looking for an AI Engineer - Model Performance to optimize model inference and build fine-tuning infrastructure.

Key Responsibilities

Optimize inference performance for models, focusing on speed and cost-effectiveness
Develop fine-tuning pipelines to streamline the model training process
Debug production inference issues and evaluate serving frameworks for optimal performance

Required Qualifications

Deep experience with LLM serving frameworks and tuning strategies
Hands-on quantization experience with a strong understanding of various techniques
Production fine-tuning experience with familiarity in training frameworks
Strong proficiency in Python for infrastructure and pipeline development
Comfort with GPU profiling and performance analysis

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...

Apply

Company Overview

Company Company Name

Headquarters Headquarters

Founded Founded

Website

The company description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...