
Sitting alongside Pope Leo XIV at the launch of Magnifica humanitas, the company’s interpretability lead conceded that frontier-lab incentives can pull researchers away from doing the right thing. Christopher Olah, Anthropic’s co-founder and the head of its interpretability research, used his seat…
No discussion yet. Be the first to share your thoughts!