Researchers pinpoint why larger language models pick up skills that small ones miss — Blankdot