Validating agentic behavior when “correct” isn’t deterministic — Blankdot