Research Scientist in Reinforcement Learning
Location: Remote
Compensation: Salary
Reviewed: Fri, Apr 17, 2026
This job expires in: 22 days
Job Summary
A company is looking for a Research Scientist, RL Training.
Key Responsibilities
- Research and implement reinforcement learning techniques and translate them into data products for training large language models
- Design and build data pipelines to generate high-quality training signals for reinforcement learning workflows
- Prototype end-to-end RL training recipes and collaborate with teams to translate research into customer-ready data products
Required Qualifications
- Deep expertise in reinforcement learning from human or AI feedback and reward modeling
- Experience training or fine-tuning large language models at scale, with knowledge of distributed training infrastructure
- Strong proficiency in Python and ML frameworks, particularly PyTorch and HuggingFace
- Solid software engineering fundamentals for building research prototypes
- Ph.D. in machine learning, reinforcement learning, or a related field strongly preferred; exceptional industry experience considered
COMPLETE JOB DESCRIPTION
The job description is available to subscribers. Subscribe today to get the full benefits of a premium membership with Virtual Vocations. We offer the largest remote database online...