Role overview
A focused opportunity to apply deep ML expertise to the evaluation of next-generation AI systems, with fully remote work and flexible engagement.
What you'll work on
- Design and write detailed evaluation suites for machine learning engineering tasks
- Assess AI-generated solutions across model training, debugging, optimization, and experimentation
- Translate real-world ML research and engineering workflows into structured benchmarks
- Review and validate technical outputs for correctness, robustness, and clarity
Tags & focus areas
Used for matching and alerts on DevFound Parttime Remote Ai Machine Learning