Research Scientist Interpretability

Anthropic · Anywhere · $90k - $100k

Actively hiring Posted 6 months ago 1 min read

Role overview

About the position

Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights
Design and run robust experiments, both quickly in toy scenarios and at scale in large models
Build infrastructure for running experiments and visualizing results
Work with colleagues to communicate results internally and publicly

Have a strong track record of scientific research (in any field), and have done some work on Interpretability
Enjoy team science - working collaboratively to make big discoveries
Are comfortable with messy experimental science. We're inventing the field as we work, and the first textbook is years away
You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results
You can clearly articulate and discuss the motivations behind your work, and teach us about what you've learned. You like writing up and communicating your results, even when they're null
Familiarity with Python is required for this role

Benefits

Used for matching and alerts on DevFound

Research Scientist Remote Python Fulltime