Amanda Askell

Amanda Askell

Philosopher & AI Alignment Researcher

About

Amanda Askell is a philosopher and researcher at Anthropic, where she leads work on Claude's character and values. With a PhD in philosophy from NYU, her research bridges moral philosophy and AI alignment—figuring out how to make AI systems that are helpful, harmless, and honest. She has been instrumental in developing Claude's personality and ethical guidelines, bringing rigorous philosophical thinking to the practical challenge of shaping AI behavior.

Key Contributions

  • Leads Claude's character work, making model personality a deliberate design surface rather than an accidental byproduct
  • Brings moral philosophy into alignment practice, especially questions about honesty, harmlessness, autonomy, and refusal
  • Contributed to Anthropic's Constitutional AI direction, where written principles help shape model behavior at scale
  • Shows how humanities expertise can matter inside frontier labs, not only as external critique but as product-shaping work
  • Her work also raises the hard governance question behind 'AI character': whose values become defaults for millions of users?

Videos & Interviews

Theme
Language
Support
© funclosure 2025