#largelanguagemodels - kbin.social

BradRubenstein, 11 months ago to ChatGPT

It is #onhere a commonplace that LLMs reflect the interests/biases of their training data.

But we also must recognize that LLMs don't exist in a vacuum.

They are trained on data provided by actors with interests/biases, and

they generate results to other actors who promulgate them according to their own interests/biases,

to audiences who have their own interests/biases.

My concern is not whether LLMs can or cannot give an "intelligent" perspective on reality, but rather, in the process of doing so, whose interests and biases are being promulgated, and whose are being served.

#ChatGPT #LargeLanguageModels #Propaganda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago (edited 11 months ago) to ChatGPT

All those #AIHype headlines reporting #GPT4 #ChatGPT as passing some exam at 90% and higher

Yannic Kilcher explains why you should be very suspicious of those stories
https://youtu.be/Tkijsu129M0

He references the debunking done by
https://flower-nutria-41d.notion.site/No-GPT4-can-t-ace-MIT-b27e6796ab5a48368127a98216c76864
and reviews the original paper to confirm all the problems in the academic evaluation of #LLMs and limitations of peer review

#LargeLanguageModels #GenerativeAI #AI #ArtificialIntelligence #LLM

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TheRealBozoTClown, 11 months ago to ai

#LargeLanguageModels increasingly trained on content created in part by #AI platforms like #ChatGPT. An endless photocopy of a photocopy. Is the future of this stuff inevitably hobbled by a digital Habsburg jaw?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ samhainnight

persagen, 11 months ago to llm

Large language models are universal biomedical simulators
https://www.biorxiv.org/content/10.1101/2023.06.16.545235v1?rss=1

LLMs such as GPT-4 have proven surprisingly successful for a wide range of tasks. We explore the potential of leveraging LLMs as simulators of biological systems.

This text-based simulation paradigm is well-suited for modeling & understanding complex living systems difficult to describe with physics-based 1st-principle simulations

#LLM #LargeLanguageModels #GPT4 #ComputationalModels #biomedical

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

persagen, 11 months ago to llm

AI could replace humans in social science research
https://www.sciencedaily.com/releases/2023/06/230616161958.htm

Researchers look at how AI (large language models or LLMs in particular) could change the nature of social science research.

Igor Grossmann et al. (2023) AI and the transformation of social science research. Science. 380: 1108 DOI: 10.1126/science.adi1778
Careful bias management and data fidelity are key
https://www.science.org/doi/10.1126/science.adi1778

#LLM #LargeLanguageModels #science #research #AI #SocialScience

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ tschenkel

greg, 11 months ago to reddit

While I disagree with u/spez's actions, I understand his perspective. Reddit's most valuable asset is its curated text data for training Large Language Models like ChatGPT. Closing down the API protects that asset. He's likely betting that subreddit moderation will be solved with LLMs so the mods that generated that data are of little concern going forward. There will be only one chance to monetize this data asset.

The reason I disagree with u/spez's actions is because I don't believe this asset belongs to him. I'm sure he's protected from a legal perspective but from a philosophical perspective, when a user writes an idea on an online forum, they don't forfeit ownership of that idea. People who contributed to Reddit even 12 months ago had no idea their thoughts will be monetized and consumed by LLMs. We need laws to protect people's data and to democratize data assets.

I also wish u/spez would just be honest about what he's doing. Telling possibly career ending lies about developers and disregarding the mods that made Reddit is inexcusable. It's clear that Reddit has succeeded despite u/spez's leadership.

#Reddit #RedditBlackout #RedditMigration #Spez #Huffman #SteveHuffman #LargeLanguageModels #LLM #LLMs

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago to ChatGPT

Kyle Hill overexerts himself while attempting to explain how #ChatGPT works
https://youtu.be/-4Oso9-9KTQ
#OpenAI #LargeLanguageModels #LLM #GPT4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 11 months ago to ChatGPT

Researchers discover that ChatGPT prefers repeating 25 jokes over and over - Enlarge / An AI-generated image of "a laughing robot." (credit: Midjour... - https://arstechnica.com/?p=1946662 #largelanguagemodels #machinelearning #airesearch #chatgpt #biz⁢ #openai #humor #jokes #tech #ai

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago to ChatGPT

Ann Reardon [How To Cook That] gives an overview of #GenerativeAI and all the controversial issues that have been raised since #DallE and #LargeLanguageModels captured the attention of the wider public
https://youtu.be/Rlq2CZI6wDo
#ChatGPT #Midjourney #ArtificialIntelligence

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago to ChatGPT

Eight (plus one) things everyone who uses #ChatGPT #GoogleBard #GPT4 #BingChat should know about #LargeLanguageModels
https://youtu.be/RX-gGs_EV7M
[#AI Coffee Break with Letitia]

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 11 months ago to apple

Apple avoids “AI” hype at WWDC keynote by baking ML into products - Enlarge / Someone scans their face using Apple's "most advanced machine... - https://arstechnica.com/?p=1945446 #largelanguagemodels #machinelearning #applekeynote #wwdc2023 #m2ultra #biz⁢ #apple #tech #wwdc #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

daringfireball, 11 months ago to random

Ted Chiang on AI: ‘The Machines We Have Now Are Not Conscious’
https://daringfireball.net/linked/2023/06/04/chiang-ai

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jann

mjgardner, 11 months ago (edited 11 months ago)

@daringfireball @gruber If you must use a metaphor of actual consciousness, I like Emily Bender’s “stochastic parrot.” https://dl.acm.org/doi/10.1145/3442188.3445922

“#AI” “#ArtificialIntelligence” #LargeLanguageModels #LLM

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mjgardner, 11 months ago to ChatGPT

“Users speak of #ChatGPT as ‘hallucinating’ wrong answers — #LargeLanguageModels make stuff up and present it as fact when they don’t know the answer. But any answers that happen to be correct were ‘hallucinated’ in the same way.” — @davidgerard, https://davidgerard.co.uk/blockchain/2023/06/03/crypto-collapse-get-in-loser-were-pivoting-to-ai/

“#AI” “#ArtificialIntelligence” #GPT #GPT3 #GPT4 “#OpenAI” #LLM

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ 1br0wn

hperrin, 11 months ago to ai

How come #transformer models aren't made to go back and change their answer as they work? If you ask a human to write something, they will very rarely just spit out an entire document word for word and be done. Most human work involves revising your own output as you work. If you prompt an #LLM to do this, you will get a better result, so why not build the model to do this from the get go?

(I revised this post 4 times before posting it.)

#AI #ChatGPT #LargeLanguageModels

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago to ai

Don't use #AI #LargeLanguageModels to learn chord progressions

Charles Cornell tries discussing music theory with #ChatGPT
https://youtu.be/O-dj-ACUCCc

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

trashrobot, 11 months ago to random

The #bullshit regarding #quantum is reaching a fever pitch on #tiktok now. I suspect this is a coordinated campaign. But by whom? Who is paying to spin up quantum bullshit on tiktok of all places? Things are getting really weird. This is not the technology dystopia I planned for.

It makes me imagine an alternate plot for Terminator/War Games/The Forbin Project. What if the machine never actually works, but a cult forms around it who believe it works, and that cults seizes power? What if "skynet" was just a sock puppet for fascist generals and tech CEOs, who are the actual ones using machines to kill us all?

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rml, bornach

bornach, 11 months ago

@trashrobot

Is it like the "Quantum Woo" that was discussed in this Sixty Symbols video?

https://youtu.be/8DGgvE6hLAU

Your Wizard of Oz "man behind the curtain" alternate plot to Terminator is what we're living through right now - what with The #Singularity cult latching onto the #AGI possibilities suggested by #ChatGPT, when in reality these #LargeLanguageModels rely on a gig economy of low paid data labellers to even work at all.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ovid, 11 months ago to opensource

Since I've been using #GitHub #CoPilot heavily the past week or so, I thought it was time to write up my experience. You might find it surprising.

https://ovid.github.io/articles/using-github-copilot-with-vim.html

#software #OpenSource

reply

expand (24)

collapse (24)

report

activity

copy /kbin url

copy original url

open original url

Loading...

mjgardner, 11 months ago (edited 11 months ago)

@barubary @randomgeek We demand accurate output from every other form of software.

But some people are giving a free pass to “#ArtificialIntelligence” #LargeLanguageModels like #ChatGPT because they’re attracted to the metaphors used by backers like #OpenAI and #Microsoft that confuse computer malfunction with human behavior.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mjgardner

bornach, 11 months ago to ChatGPT

[tibees] tests #ChatGPT on a question from The International Mathematical Olympiad 2022
https://youtu.be/Fi1e-B60cok

Not the results many were expecting given all the #AIHype

Recall [Two Minute Papers] gushing review of that "Sparks of AGI" paper assessing #GPT4 where at 5:20
https://youtu.be/wHiOKDlA8Ac?t=5m20s
it nailed an IMO question almost instantly

Perhaps it simply memorised the answer?
#LargeLanguageModels #ArtificialIntelligence #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 11 months ago to random

#GenerativeAI #LargeLanguageModels rely a lot on the human to do the reasoning for it, and even then #BingChat (Creative) has problems with following the guidance. Notice I only specified the use of "unwieldy" and never required it to use "beard" or "weird" yet the #LLM got fixated on that instead.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 1 year ago to ChatGPT

Fearing leaks, Apple restricts its employees from using ChatGPT and AI tools - Enlarge / An AI-generated cartoon depiction of a chatbot being crossed ... - https://arstechnica.com/?p=1940472 #largelanguagemodels #johngiannandrea #machinelearning #githubcopilot #aiprivacy #microsoft #chatgpt #biz⁢ #openai #apple #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

waldoj, 1 year ago to random

Google says that Bard is “is intended to…not replicate existing content at length” and, that if it does quote at length, it'll cite the page. But I wrote most of this Wikipedia entry, and I immediately recognized that Bard is copying it, word-for-word, at length. Sure, there's a footnote, but this is a straight-up duplicate of a webpage. https://en.wikipedia.org/wiki/Southwest_Mountains

A screenshot from Wikipedia, reading: The Southwest Mountains of Virginia are a mountain range centered on Charlottesville, parallel to and geologically associated with the Blue Ridge Mountains, which lie about 30 miles (50 km) to the west.[3] Some of the more prominent peaks include Carters Mountain,[1] Fan Mountain, Brush Mountain, Peters Mountain, Walton's Mountain, and Hightop Mountain. Geography The Southwest Mountains are not particularly large. They are one of the easternmost ranges in Virginia (along with the geologically associated Bull Run Mountains and Catoctin Mountain) and the viewshed for the Blue Ridge Mountains through Nelson and Albemarle Counties.[4] The range bisects Nelson, Albemarle, and Orange counties. A portion of the Southwest Mountains in Albemarle County has been designated a Rural Historic District by the National Park Service,[5] though none of the range is designated as state or national parkland.[6][7] Mineral resources of soapstone and vermiculite continue to be mined here.[8]

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ moira, Paxxi, emilygorcenski, realn2s +5 more

bornach, 1 year ago

@eyeinhand @waldoj
Trouble is that is not how #LargeLanguageModels work. The token embedding that a deep neural transformer network has being trained upon does not encode the information source from which the token stream was derived
https://youtu.be/rURRYI66E54

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 1 year ago to tech

Anthropic’s Claude AI can now digest an entire book like The Great Gatsby in seconds - Enlarge / An AI-generated image of a robot reading a book. (credit: Ben... - https://arstechnica.com/?p=1938873 #largelanguagemodels #machinelearning #anthropic #biz⁢ #claude #openai #gpt-4 #tech #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ AAKL

itnewsbot, 1 year ago to tech

The AI race heats up: Google announces PaLM 2, its answer to GPT-4 - Enlarge (credit: Google)

On Wednesday, Google introduced PaLM ... - https://arstechnica.com/?p=1937904 #largelanguagemodels #machinelearning #googlei/o #biz⁢ #google #openai #gpt-4 #palm2 #tech #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

itnewsbot, 1 year ago to ChatGPT

Google’s ChatGPT-killer is now open to everyone, packing new features - Enlarge (credit: Google)

At Wednesday's Google I/O conference,... - https://arstechnica.com/?p=1938256 #largelanguagemodels #machinelearning #googlebard #googlei/o #chatgpt #biz⁢ #google #openai #tech #ai

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 1 year ago to random

Weighing in on that study that found #ChatGPT to be more empathetic than human physicians,
@rebeccawatson finds the research lacked certain rigor - the authors participating in the "blind" study, and whether the diagnosis was even correct didn't feature very highly in their assessment of quality

https://youtu.be/zRWm1E2Bn-U
#GenerativeAI #ArtificialIntelligence #AIHype

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

bornach, 1 year ago

@rebeccawatson

I pointed #BingChat at that #ChatGPT vs human physicians empathy study and it still reassured me that #AI was not suitable for professional medical advice.

I then referred it to Mike Hansen's video :
https://youtu.be/Gk8LQfAe6f8
where ChatGPT instantly nailed a diagnosis that took him and his team weeks to diagnose, and then Bing pretended to have watched the video (probably just read the transcript) and "hallucinated" things that were not in the video.

#LargeLanguageModels #LLM

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...