Large Language Models’ Emergent Abilities Are a Mirage

March 24, 2024

1

The original version of this story appeared in Quanta Magazine.

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up—the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped. Other studies found similar leaps in ability.

The authors described this as “breakthrough” behavior; other researchers have likened

→ Continue reading at WIRED

Large Language Models’ Emergent Abilities Are a Mirage

Similar Articles

Most Popular

Large Language Models’ Emergent Abilities Are a Mirage

Similar Articles

Programming in Assembly Is Brutal, Beautiful, and Maybe Even a Path to Better AI

New Rules Could Force Tesla to Redesign Its Door Handles. That’s Harder Than It Sounds

Most Popular

The first products with Apple’s M5 chip could make their debut this week

Jim Furyk is as much into sports rivalries as anyone else

Today’s best iPad deals include the iPad A16 for $279