Amanda Askell
Philosopher & AI Alignment Researcher
About
Amanda Askell is a philosopher and researcher at Anthropic, where she leads work on Claude's character and values. With a PhD in philosophy from NYU, her research bridges moral philosophy and AI alignment—figuring out how to make AI systems that are helpful, harmless, and honest. She has been instrumental in developing Claude's personality and ethical guidelines, bringing rigorous philosophical thinking to the practical challenge of shaping AI behavior.
Key Contributions
- Leads Claude's character work, making model personality a deliberate design surface rather than an accidental byproduct
- Brings moral philosophy into alignment practice, especially questions about honesty, harmlessness, autonomy, and refusal
- Contributed to Anthropic's Constitutional AI direction, where written principles help shape model behavior at scale
- Shows how humanities expertise can matter inside frontier labs, not only as external critique but as product-shaping work
- Her work also raises the hard governance question behind 'AI character': whose values become defaults for millions of users?
Videos & Interviews
Amanda Askell Answers Questions About Claude's Character
Anthropic philosopher answers community questions about her work shaping Claude
View Details
Amanda Askell Segment: Claude's Character Training
Segment from Lex Fridman Podcast #452 discussing the ethical and epistemic virtues Claude should enact
View Details