What AI benchmarks miss about real-world performance — Blankdot