#LLMs - kbin.social

metin, 26 days ago (edited 26 days ago) to ai

Interesting article, discussing the status of OpenAI and the recent resignations of Ilya Sutskever and Jan Leike…

https://www.vox.com/future-perfect/2024/5/17/24158478/openai-departures-sam-altman-employees-chatgpt-release

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #tech #technology #OpenAI #SamAltman #BigTech #article #read #reading #news

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Lazarou, 26 days ago to stackoverflow

This just makes me want to delete everything of mine on corporate social media, and I pretty much have tbh

#StackOverflow #AI #LLMs

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

LChoshen, 27 days ago to llm

Do LLMs learn foundational concepts required to build world models? (less than expected)

We address this question with 🌐🐨EWoK (Elements of World Knowledge)🐨🌐

a flexible cognition-inspired framework to test knowledge across physical and social domains

https://ewok-core.github.io

#llm #llms #evaluation #ml #machinelearning

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 27 days ago to ai

AI… There's no escaping the scraping.

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #tech #technology #BigTech #Google #Microsoft #OpenAI #Meta #Apple

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ai6yr, 27 days ago to ai

Giant sucking sounds from over there on Reddit https://www.bbc.com/news/articles/cxe92v47850o #AI #LLMs #reddit #openai

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ divya, joycebell

remixtures, 28 days ago to ai Portuguese

#AI #GenerativeAI #LLMs #ParetoCurves: "Which is the most accurate AI system for generating code? Surprisingly, there isn’t currently a good way to answer questions like these.

Based on HumanEval, a widely used benchmark for code generation, the most accurate publicly available system is LDB (short for LLM debugger).1 But there’s a catch. The most accurate generative AI systems, including LDB, tend to be agents,2 which repeatedly invoke language models like GPT-4. That means they can be orders of magnitude more costly to run than the models themselves (which are already pretty costly). If we eke out a 2% accuracy improvement for 100x the cost, is that really better?

In this post, we argue that:

AI agent accuracy measurements that don’t control for cost aren’t useful.

Pareto curves can help visualize the accuracy-cost tradeoff.

Current state-of-the-art agent architectures are complex and costly but no more accurate than extremely simple baseline agents that cost 50x less in some cases.

Proxies for cost such as parameter count are misleading if the goal is to identify the best system for a given task. We should directly measure dollar costs instead.

Published agent evaluations are difficult to reproduce because of a lack of standardization and questionable, undocumented evaluation methods in some cases."

https://www.aisnakeoil.com/p/ai-leaderboards-are-no-longer-useful

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

leanpub, 28 days ago to ai

AI for Efficient Programming: Harnessing the Power of Large Language Models http://leanpub.com/courses/fredhutch/ai_for_software is the featured online course on the Leanpub homepage! https://leanpub.com #AI #courses #programming #LLMs

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

doctorambient, 28 days ago to ai

"The biggest question raised by a future populated by unexceptional A.I., however, is existential. Should we as a society be investing tens of billions of dollars, our precious electricity that could be used toward moving away from fossil fuels, and a generation of the brightest math and science minds on incremental improvements in mediocre email writing?" (From an NYT article. See original thread.)

#ai #llm #llms #chatbot #chatbots

@peter https://thepit.social/@peter/112445916259675495

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

AccordionGuy, 29 days ago to ai

Do you REALLY want to get a feel for how GPT-4o does what it does? Just complete this poem — by doing so, you’ll have performed a computation similar to the one it does when you feed it a text-plus-image prompt.

#AI #ArtificialIntelligence #LLM #LLMs #LargeLanguageModel #LargeLanguageModels

https://www.globalnerdy.com/2024/05/15/the-simplest-way-to-illustrate-how-gpt-4o-works/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

iammannyj, 29 days ago to opensource

IBM open-sources its Granite AI models - and they mean business

Many companies claim to have open-sourced their LLMs, but IBM actually did it.

https://www.zdnet.com/article/ibm-open-sources-its-granite-ai-models-and-they-mean-business/

#OpenSource #IBM #LLMs

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ai6yr, 29 days ago to ai

IEEE Spectrum: telling it like it is. #ai #jobs

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ nyquildotorg

ai6yr, 29 days ago

Alas, they wimped out and changed the title online, probably after a bunch of tech-bros or engineers-in-love-with-AI complained. #Ai #llms

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ai6yr

tayarndt, 29 days ago to LLMs

Live talking about the dev keynote for google watch here https://www.youtube.com/watch?v=6F7hZYygx1chttps://www.twitch.tv/tayarndt #YouTube_Number_i #LLMs #Twitch #programmer

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mikedoise

CatherineFlick, 30 days ago to LLMs

Just FYI, if you have older parents or other family members, set up some sort of shibboleth with them so they know what to ask you if you ever call them asking for something. These new generative models are going to be extremely convincing, and the idiots in charge of these companies think they can use guardrails to stop it being used inappropriately. They can't. #genAI #LLMs #chatgpt

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mpjgregoire, alanz, qkslvrwolf, perkinsy +1 more

ai6yr, 30 days ago to random

OpenAI or Science Fiction Movie?

https://www.youtube.com/watch?v=ne6p6MfLBxc

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ai6yr, 30 days ago

On one hand, the technology advance!

On the other hand: it'll burn down the planet faster with all that energy use

Also, the complete decline of civilization itself, if you think Futurama had it right here: https://www.youtube.com/watch?v=IrrADTN-dvg

#ai #openai #llms #dating

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ai6yr

metin, 30 days ago to ai

Modern times summarized in one headline...

This AI software is helping schools detect guns

https://www.fastcompany.com/91123682/schools-detect-guns-ai-software-zeroeyes-kansas

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #guns #school #schools #software

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 1 month ago to LLMs

The most interesting stuff in #LLMs right now (to me) is:

figuring out how to do it small

figuring out how to do it on CPU

figuring out how to do it well for specific tasks

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ twilliability

Seirdy, 1 month ago to react

New bookmark: React, Electron, and LLMs have a common purpose: the labour arbitrage theory of dev tool popularity.

“React and the component model standardises the software developer and reduces their individual bargaining power excluding them from a proportional share in the gains”. An amazing write-up by @baldur about the de-skilling of developers to reduce their ability to fight back against their employers.

Originally posted on seirdy.one: See Original (POSSE). #GenAI #llms #webdev

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ brome

ceoln, 1 month ago to Bitcoin

I feel like it would be very consistent if the next thing after #Bitcoin and The #Blockchain and #NFTs and #LLMs, turned out to be #Neom.

I don't know if it will actually attract and support tons of scams and media bros and think pieces, but if it did it would feel right somehow.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

changelog, 1 month ago to LLMs

💥 New episode of Changelog & Friends!

🎙️ with @anniesexton

🎧 https://changelog.com/friends/43

#career #llms #culture #podcast

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

happyborg, 1 month ago to ai

The first thing we taught #AI is how to lie convincingly.

WTF could go wrong and who TF decided this was a good way to start?

#LLMs

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ markhughes

smach, 1 month ago to LLMs

“The general problem of mixing data with commands is at the root of many of our computer security vulnerabilities.” Great explainer by security researcher Bruce Schneier on why large language models may not be a great choice for tasks like processing your emails.
https://cacm.acm.org/opinion/llms-data-control-path-insecurity/

#GenAI #LLMs #InfoSec

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ onepict

phryk, 1 month ago to LLMs

#LLMs be like

Scene from Adventure Time with the demon cat. Demon Cat: Greetings, Frank the human boy. Finn, astonished: How did you almost know my name? Demon Cat, smug: I have approximate knowledge of many things.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

sohkamyung, 1 month ago to singapore

"When the Singaporean government asked local writers if they would agree to having their work used to train a large language model, it probably did not expect the country’s tiny literary community to react so fiercely."

https://restofworld.org/2024/singapore-writers-reject-ai-training/

#Singapore #AI #LLMs #Writers #Literature #Culture #Society

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ divya

KathyReid, 1 month ago to stackoverflow

I just issued a data deletion request to #StackOverflow to erase all of the associations between my name and the questions, answers and comments I have on the platform.

One of the key ways in which #RAG works to supplement #LLMs is based on proven associations. Higher ranked Stack Overflow members' answers will carry more weight in any #LLM that is produced.

By asking for my name to be disassociated from the textual data, it removes a semantic relationship that is helpful for determining which tokens of text to use in an #LLM.

If you sell out your user base without consultation, expect a backlash.

reply

expand (16)

collapse (16)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ai6yr, 1 month ago to LLMs

"Stack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPT" https://www.tomshardware.com/tech-industry/artificial-intelligence/stack-overflow-bans-users-en-masse-for-rebelling-against-openai-partnership-users-banned-for-deleting-answers-to-prevent-them-being-used-to-train-chatgpt

#llms #ai #chatgpt #allyourbasearebelongtous

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hankg