A New Attack Impacts ChatGPT—and No One Knows How to Stop It

August 1, 2023

7

ChatGPT and its artificially intelligent siblings have been tweaked over and over to prevent troublemakers from getting them to spit out undesirable messages such as hate speech, personal information, or step-by-step instructions for building an improvised bomb. But researchers at Carnegie Mellon University last week showed that adding a simple incantation to a prompt—a string text that might look like gobbledygook to you or me but which carries subtle significance to an AI model trained on huge quantities of web data—can defy all of these defenses in several popular chatbots at once.

The work suggests that the propensity for the cleverest AI

→ Continue reading at WIRED

A New Attack Impacts ChatGPT—and No One Knows How to Stop It

Similar Articles

Most Popular

A New Attack Impacts ChatGPT—and No One Knows How to Stop It

Similar Articles

Silk Road Creator Ross Ulbricht Is Waiting for Trump to Keep His Word—and Set Him Free

America Inc is hoping for a tax bonanza. It may be disappointed

Most Popular

‘Paddington in Peru’ Earns Biggest U.K. Opening for British Film Since ‘No Time to Die’

Seattle police officer suspended for violating overtime policy for 2nd time in 3 years

The Dolphins defense saved their season for another week