'A new attack impacts ChatGPT' in information analyst

Tags
Current selected tag: 'A new attack impacts ChatGPT'. Clear

A New Attack Impacts ChatGPT—and No One Knows How to Stop It | information analyst | Scoop.it

From www.wired.com - August 3, 2023 10:00 AM

Gust MEES's curator insight, August 3, 2023 9:13 AM

CHATGPT AND ITS artificially intelligent siblings have been tweaked over and over to prevent troublemakers from getting them to spit out undesirable messages such as hate speech, personal information, or step-by-step instructions for building an improvised bomb. But researchers at Carnegie Mellon University last week showed that adding a simple incantation to a prompt—a string text that might look like gobbledygook to you or me but which carries subtle significance to an AI model trained on huge quantities of web data—can defy all of these defenses in several popular chatbots at once.

Learn more / En savoir plus / Mehr erfahren:

https://www.scoop.it/topic/21st-century-innovative-technologies-and-developments/?&tag=ChatGPT

https://www.scoop.it/t/21st-century-innovative-technologies-and-developments/?&tag=AI

https://www.scoop.it/topic/21st-century-innovative-technologies-and-developments/?&tag=Ethics

https://www.scoop.it/topic/21st-century-innovative-technologies-and-developments/?&tag=OpenAI

https://www.scoop.it/t/securite-pc-et-internet/?&tag=AI

https://www.scoop.it/topic/securite-pc-et-internet

A New Attack Impacts ChatGPT—and No One Knows How to Stop It