Role overview
About the team
Frontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks. To ensure that AI promotes positive change, we have dedicated a team to help us best prepare for the development of increasingly capable frontier AI models. This team, Preparedness, reports directly to our CTO and is tasked with identifying, tracking, and preparing for catastrophic risks related to frontier AI models.
Specifically, the mission of the Preparedness team is to:
- Closely monitor and predict the evolving capabilities of frontier AI systems, with an eye towards misuse risks whose impact could be catastrophic (not necessarily existential) to our society; and
- Ensure we have concrete procedures, infrastructure and partnerships to mitigate these risks and, more broadly, to safely handle the development of powerful AI systems.
Our team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, as well as overall coordination on AGI preparedness. The team’s core goal is to ensure that we have the infrastructure needed for the safety of highly-capable AI systems—from the models we develop in the near future to those with AGI-level capabilities.
About you
We are looking to hire an exceptional technical program manager that can push the boundaries of our frontier models. Specifically, we are looking for those that will help us shape our empirical grasp of the whole spectrum of AI safety concerns and will own individual threads within this endeavor end-to-end. We are running complex evaluations and looking for a strong TPM to help support them.
In this role, you will:
- Work on identifying emerging AI safety risks and new methodologies for exploring the impact of these risks
- Design evaluations of frontier AI models that assess the extent of identified risks, using the latest research on capability elicitation
- Perform statistical analyses on our frontier evaluations
- Collaborate with cross-functional teams within and outside of OpenAI to translate our technical findings into policy recommendations
- Contribute to the refinement of risk management and the overall development of "best practice" guidelines for AI safety evaluations
Ideal (but not necessarily hard-and-fast) background: