Philosophical insights for AI alignment
Askell is a philosophical inquiry tool focused on fine-tuning and AI alignment, developed by a team at Anthropic. The tool aims to enhance the honesty and character traits of AI models through innovative fine-tuning techniques. The creator, a philosopher with a PhD from NYU, has a background in ethics, decision theory, and formal epistemology. Askell is designed to scale interventions for more capable models, reflecting a commitment to AI safety and ethical considerations. The creator has also pledged to donate a significant portion of their income to global poverty charities, emphasizing a strong ethical foundation in their work.