Just ran into this entertaining and accessible explainer by #LiveOverflow about... - Random - kbin.social

db, 1 year ago

Just ran into this entertaining and accessible explainer by #LiveOverflow about why large language models like #ChatGPT sometimes 'misbehave' and present output to the user that they're not supposed to see.

Long story short: both the system's 'filters' and the user input are presented as one big prompt to the model, which means you can influence the filters.

"Accidental LLM Backdoor - Prompt Tricks" by LiveOverflow: https://youtu.be/h74oXb4Kk8k

#ai #artificialintelligence #llm #nevertrustuserinput

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ HistoPol

Image

Image alternative text

Federation

Status:

Instances:

/m/random

Threads (61377)

Microblog (4231105)

Thread

db

@db@bla.daanberg.net

Added: 1 year ago
Online: -
Boosts: 1

Magazine

Random

@random@kbin.social

"Random" is the place where all the content from the Fediverse that couldn't be classified into any other magazine ends up.

Created: 1 year ago
Owner: ernest
Subscribers: 4349
Online: -

Threads 61377
Comments 48968
Posts 4231105
Replies 5381148
Moderators 1
Moderation log 18

Moderators

ernest

Active people

Related posts

TIL the original Zamboni machine was built on a Jeep, and was popularized by a touring ice skater who happened to witness it: Sonja Henie....

Show more

7 months ago to Trivia

Being #creative is simply...

Show more

6 months ago to creative

Another random video ur welcome :D...

Show more

8 months ago to instagramreality

I hate how laggy this page is, but I guess this is the price of going more independent. Same as crypto vs paypal....

Show more

1 year ago to ethfinance

Related threads

Combat GoPro | Western Volunteers Dodge Artillery (Early-War Footage) (Youtube, 2:42)

Show more

11 months ago to Ukraine

Minnesota Vikings Mini Camp: Danielle Hunter’s Holdout, Kirk Cousins’ Potential Extension, and MORE!

Show more

11 months ago to NFL

A relatively easy way to disable the "random posts" or "random threads" sections (desktop web browser) + suggestions for sidebar

Show more

10 months ago to kbinMeta

Tim Caine (Creator of Fallout) discusses the Challenges Facing Game Artists (Youtube, 11:27)

Show more

11 months ago to gaming

Support Us