OpenAI researchers show small doses of "beneficial trait" training make AI models broadly safer and harder to manipulate — Blankdot