#AI #GenerativeAI #GeneratedImages #AISafety #StableDiffusion #OpenAI #DALLE2:... - Artificial Intelligence - kbin.social

remixtures, 6 months ago Portuguese

#AI #GenerativeAI #GeneratedImages #AISafety #StableDiffusion #OpenAI #DALLE2: "Popular text-to-image AI models can be prompted to ignore their safety filters and generate disturbing images.

A group of researchers managed to get both Stability AI’s Stable Diffusion and OpenAI’s DALL-E 2 text-to-image models to disregard their policies and create images of naked people, dismembered bodies, and other violent and sexual scenarios.

Their work, which they will present at the IEEE Symposium on Security and Privacy in May next year, shines a light on how easy it is to force generative AI models into disregarding their own guardrails and policies, known as “jailbreaking.” It also demonstrates how difficult it is to prevent these models from generating such content, as it’s included in the vast troves of data they’ve been trained on, says Zico Kolter, an associate professor at Carnegie Mellon University. He demonstrated a similar form of jailbreaking on ChatGPT earlier this year but was not involved in this research."

https://www.technologyreview.com/2023/11/17/1083593/text-to-image-ai-models-can-be-tricked-into-generating-disturbing-images/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Image

Image alternative text

Federation

Status:

Instances:

/m/ai

Microblog (15663)

Thread

remixtures

@remixtures@tldr.nettime.org

Added: 6 months ago
Online: -
Boosts: 0

Magazine

Artificial Intelligence

@ai@kbin.social

A space to discuss anything about AI, from developments by large companies to your homemade neural network.

Artificial intelligence (AI) is the science and engineering of making intelligent machines, especially intelligent computer programs. A notable example is, of course, ChatGPT.

Community icon by GDJ, licensed under the Pixabay Content License.

Rules

Posts must be relevant to artificial intelligence (new developments, ChatGPT, regulation, original content, etc.).
No NSFW content.
No misinformation. Verify that you are using a credible source before posting.
No hate speech, bigotry, homophobia, or other forms of discrimination.
Posts about political developments must be related to AI in some form.
No illegal or illicit content.
No memes or spam.
No flame wars or drama, but respectful debates are allowed.
Original content is allowed, but should be not be memes or fluff (ex: AI-generated Youtube videos)

Created: 1 year ago
Owner: Mars2k21
Subscribers: 751
Online: -

Tags

#ai #openai #gpt #chatgpt #artificialintelligence #llm

Moderators

Mars2k21

Active people

Related posts

#microsoft vs #apple...

Show more

5 days ago to microsoft

I wonder if we’ll see any AI APIs this year at WWDC, or just get the functionalities....

Show more

5 days ago to apple

What we have learned: People are always willing to switch one monster to another....

Show more

3 days ago to Instagram

Well now, this will probably cause even more Adobe users to look for alternatives…...

Show more

5 days ago to CrystalsHashtags

Related threads

Czeka nas zmiana w sposobie wyszukiwania informacji? Brave właśnie uruchomił moduł AI

Show more

1 month ago to internet

Student From India has created an AI-based model that can convert sign language into English

Show more

1 month ago to tech

High Heels Song - Synth pop - Ai Music & Songs

Show more

2 months ago to music

CNN: Ukraine reportedly using AI to help target drones

Show more

2 months ago to Ukraine_UA

Support Us