Role overview
**Benefits:**
* 401(k) matching
* Dental insurance
* Health insurance
What you'll work on
* Design and engineer effective prompts to guide LLMs in generating accurate, relevant, and context-aware outputs.
* Conduct prompt experiments and A/B tests to evaluate model performance and optimize results.
* Collaborate with AI researchers, data scientists, and product teams to develop prompt libraries and evaluation datasets.
* Generate complex, edge-case queries to test reasoning, factuality, and robustness of AI models.
* Evaluate and annotate AI-generated responses for accuracy, tone, bias, and hallucination detection.
* Document prompt structures, test results, and performance improvements.
* Work with multi-modal inputs (text, image, video) when applicable.
This is a remote position.