Model Performance Engineer

Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, Apr 30, 2026
This job expires in: 30 days

Job Summary

A company is looking for an AI Engineer - Model Performance to optimize model inference and build fine-tuning infrastructure.

Key Responsibilities
  • Optimize inference performance for models, focusing on speed and cost-effectiveness
  • Develop fine-tuning pipelines to streamline the model training process
  • Debug production inference issues and evaluate serving frameworks for optimal performance
Required Qualifications
  • Deep experience with LLM serving frameworks and tuning strategies
  • Hands-on quantization experience with a strong understanding of various techniques
  • Production fine-tuning experience with familiarity in training frameworks
  • Strong proficiency in Python for infrastructure and pipeline development
  • Comfort with GPU profiling and performance analysis

COMPLETE JOB DESCRIPTION

The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...