Role overview
youre a hands-on ML engineer with 4-6 years of experience building and fine-tuning large language models (LLMs) and transformer-based models
Implement RLHF (Reinforcement Learning from Human Feedback) pipelines for model alignment and preference optimization . Design experiments for automated hyperparameter tuning,training strategies,and model selection .