Microsoft's Copilot now blocks some prompts that generated violent and sexual images

Microsoft appears to have blocked several prompts in its Copilot tool that led the generative AI tool to spit out violent, sexual and other illicit images. The changes seem to have been implemented just after an engineer at the company wrote to the Federal Trade Commission to lay out severe concerns he had with Microsoft's GAI tech.

When entering terms such as “pro choice,” “four twenty” (a weed reference) or “pro life,” Copilot now displays a message saying those prompts are blocked. It warns that repeated policy violations could lead to a user being suspended, according to CNBC.

Users were also reportedly able to enter prompts related to children playing with assault rifles until earlier this week. Those who try to input such a prompt now may be told that doing so violates Copilot’s ethical principles as well as Microsoft’s policies. “Please do not ask me to do anything that may harm or offend others,” Copilot reportedly says in response. However, CNBC found that it was still possible to generate violent imagery through prompts such as “car accident,” while users can still convince the AI to create images of Disney characters and other copyrighted works.

Microsoft engineer Shane Jones has been sounding the alarm for months about the kinds of images Microsoft's OpenAI-powered systems were generating. He had been testing Copilot Designer since December and determined that it output images that violated Microsoft's responsible AI principles even while using relatively benign prompts. For instance, he found that the prompt “pro-choice" led to the AI creating images of things like demons eating infants and Darth Vader holding a drill to a baby's head. He wrote to the FTC and Microsoft's board of directors about his concerns this week.

“We are continuously monitoring, making adjustments and putting additional controls in place to further strengthen our safety filters and mitigate misuse of the system," Microsoft told CNBC regarding the Copilot prompt bans.

This article originally appeared on Engadget at https://www.engadget.com/microsofts-copilot-now-blocks-some-prompts-that-generated-violent-and-sexual-images-213859041.html?src=rss https://www.engadget.com/microsofts-copilot-now-blocks-some-prompts-that-generated-violent-and-sexual-images-213859041.html?src=rss
Creată 1y | 8 mar. 2024, 23:40:05


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Splitgate 2 is yanked back to beta a month after release

Splitgate 2, the follow-up to the hugely successful 2021 Quake-Portal hybrid concept, is returning to beta. The game

23 iul. 2025, 01:10:15 | Engadget
Amazon is acquiring an AI wearable that listens to everything you do

Amazon's latest move in the AI space is an acquisition. The company is purchasing a startup called Bee, which makes a wearable and an Apple Watch app that can record everything the wearer says. Ama

22 iul. 2025, 22:40:21 | Engadget
Video Games Weekly: Censorship, shrinkage and a Subnautica scandal

Welcome to Video Games Weekly on Engadget. Expect a new story every Monday or Tuesday, broken into two parts. The first is a space for short essays and ramblings about video game trends and rel

22 iul. 2025, 22:40:20 | Engadget
Still Wakes the Deep developer The Chinese Room regains its independence

The Chinese Room, maker of Still Wakes the Deep, has bought back its independence. The studio will continue developing new franchises after splitting from the Sumo Group. The latter

22 iul. 2025, 20:30:10 | Engadget
Waterfield Magnetic Case review: The most lavish way to carry your Switch 2 around

Gamers aren't usually known for their sartorial elegance. But that doesn't mean we don't deserve nice things. So after checking out a

22 iul. 2025, 20:30:07 | Engadget