Model Performance Engineer
Location: Remote
Compensation: To Be Discussed
Reviewed: Thu, Apr 30, 2026
This job expires in: 30 days
Job Summary
A company is looking for an AI Engineer - Model Performance to optimize model inference and build fine-tuning infrastructure.
Key Responsibilities
- Optimize inference performance for models, focusing on speed and cost-effectiveness
- Develop fine-tuning pipelines to streamline the model training process
- Debug production inference issues and evaluate serving frameworks for optimal performance
Required Qualifications
- Deep experience with LLM serving frameworks and tuning strategies
- Hands-on quantization experience with a strong understanding of various techniques
- Production fine-tuning experience with familiarity in training frameworks
- Strong proficiency in Python for infrastructure and pipeline development
- Comfort with GPU profiling and performance analysis
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...