Evaluating chain-of-thought monitorability — Blankdot