Anthropic
AI

Research Scientist Interpretability

Anthropic · Anywhere · $90k - $100k

Actively hiring Posted 4 months ago

Role overview

About the position

What we're looking for

  • Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights
  • Design and run robust experiments, both quickly in toy scenarios and at scale in large models
  • Build infrastructure for running experiments and visualizing results
  • Work with colleagues to communicate results internally and publicly
  • Have a strong track record of scientific research (in any field), and have done some work on Interpretability
  • Enjoy team science - working collaboratively to make big discoveries
  • Are comfortable with messy experimental science. We're inventing the field as we work, and the first textbook is years away
  • You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results
  • You can clearly articulate and discuss the motivations behind your work, and teach us about what you've learned. You like writing up and communicating your results, even when they're null
  • Familiarity with Python is required for this role

Benefits

  • Competitive compensation
  • Generous vacation and parental leave
  • Flexible working hours
  • Lovely office space in which to collaborate with colleagues
  • Optional equity donation matching

Tags & focus areas

Used for matching and alerts on DevFound
Research Scientist Remote Python Fulltime