cassidy, to ChatGPT
@cassidy@blaede.family avatar

I was curious if a niche blog post of mine had been slurped up by so I asked a leading question—what I discovered is much worse. So far, it has told me:

• use apt-get on Endless OS
• preview a Jekyll site locally by opening files w/a web browser (w/o building)
• install several non-existent “packages” & extensions

It feels exactly like chatting w/someone talking out of their ass but trying to sound authoritative. need to learn to say, “I don’t know.”

ids1024,
@ids1024@fosstodon.org avatar

@cassidy " need to learn to say, 'I don’t know.'"

Doing that properly might require... something that isn't an LLM. I'd say the LLM generates something that (statistically) looks like an answer, because that's what its trained to do.

Actually modeling some understanding of truth and knowledge might be a different and more difficult task than modeling language.

fizise, to LLMs
@fizise@sigmoid.social avatar

Nice example of how important emphasis can be for language understanding. Depending on which word in the sentence below is emphasized, it completely changes its meaning.
For #LLMs (and for our #ise2024 lecture) this means that learning to understand language purely from written text is probably not an "easy" task....

Picture from Brian Sacash, via LinkedIn, cf. https://www.linkedin.com/feed/update/urn:li:activity:7195767258848067584/

#nlp #languagemodel #computationallinguistics @sourisnumerique @enorouzi @shufan @lysander07

scottjenson, to LLMs
@scottjenson@social.coop avatar

Saying "LLMs will eventually do every job" is a bit like:

  1. Seeing Wifi wireless data
  2. Then predicting "Wireless" Power saws (no electrical cord or battery) are just around the corner

It's a misapplication of the tech. You need to understand how work and extrapolate that capability. It's all text people. Summarizing, collating, template matching. All fair game. But stray outside of that box and things get much harder.

scottjenson, to Figma
@scottjenson@social.coop avatar

I just tried a few AI plugins for and they were all bad. This domain might be a great test for . I predict these failings are unlikely to be fixed any time soon:

  • Layout was poor
  • They can't create components
  • Laughably complex object hierarchies (everything was enclosed in a frame)

Of course things will improve, but I expect fixing these deep structural problems are a function of many new constraints, likely beyond what today's LLMs are actually capable of. @simon ?

scottjenson,
@scottjenson@social.coop avatar

@simon my point being there are limits as to what #LLMs can do:

Structural
There is no clear API to "genAI" components

Training
There is very little training data on how to create a clean Figma object structure

These may be solved, eventually, but they also are likely quite different from the chat based solution patterns offered today. My concern is that it's much harder than boosters believe.

kubikpixel, to gentoo
@kubikpixel@chaos.social avatar

Gentoo and NetBSD ban 'AI' code, but Debian doesn't – yet

The problem isn't just that LLM-bot generated code is bad – it's where it came from.

🐧 https://www.theregister.com/2024/05/18/distros_ai_code/


#gentoo #netbsd #debian #ai #llm #LLMs #bsd #linux #opensource #oss #bot #it

metin, (edited ) to ai
@metin@graphics.social avatar
Lazarou, to stackoverflow
@Lazarou@mastodon.social avatar

This just makes me want to delete everything of mine on corporate social media, and I pretty much have tbh

LChoshen, to llm
@LChoshen@sigmoid.social avatar

Do LLMs learn foundational concepts required to build world models? (less than expected)

We address this question with 🌐🐨EWoK (Elements of World Knowledge)🐨🌐

a flexible cognition-inspired framework to test knowledge across physical and social domains

https://ewok-core.github.io

metin, to ai
@metin@graphics.social avatar
ai6yr, to ai

Giant sucking sounds from over there on Reddit https://www.bbc.com/news/articles/cxe92v47850o #AI #LLMs #reddit #openai

remixtures, to ai Portuguese
@remixtures@tldr.nettime.org avatar

#AI #GenerativeAI #LLMs #ParetoCurves: "Which is the most accurate AI system for generating code? Surprisingly, there isn’t currently a good way to answer questions like these.

Based on HumanEval, a widely used benchmark for code generation, the most accurate publicly available system is LDB (short for LLM debugger).1 But there’s a catch. The most accurate generative AI systems, including LDB, tend to be agents,2 which repeatedly invoke language models like GPT-4. That means they can be orders of magnitude more costly to run than the models themselves (which are already pretty costly). If we eke out a 2% accuracy improvement for 100x the cost, is that really better?

In this post, we argue that:

  • AI agent accuracy measurements that don’t control for cost aren’t useful.

  • Pareto curves can help visualize the accuracy-cost tradeoff.

  • Current state-of-the-art agent architectures are complex and costly but no more accurate than extremely simple baseline agents that cost 50x less in some cases.

  • Proxies for cost such as parameter count are misleading if the goal is to identify the best system for a given task. We should directly measure dollar costs instead.

  • Published agent evaluations are difficult to reproduce because of a lack of standardization and questionable, undocumented evaluation methods in some cases."

https://www.aisnakeoil.com/p/ai-leaderboards-are-no-longer-useful

leanpub, to ai
@leanpub@mastodon.social avatar

AI for Efficient Programming: Harnessing the Power of Large Language Models http://leanpub.com/courses/fredhutch/ai_for_software is the featured online course on the Leanpub homepage! https://leanpub.com

doctorambient, to ai
@doctorambient@mastodon.social avatar

"The biggest question raised by a future populated by unexceptional A.I., however, is existential. Should we as a society be investing tens of billions of dollars, our precious electricity that could be used toward moving away from fossil fuels, and a generation of the brightest math and science minds on incremental improvements in mediocre email writing?" (From an NYT article. See original thread.)

@peter https://thepit.social/@peter/112445916259675495

AccordionGuy, to ai
@AccordionGuy@mastodon.cloud avatar

Do you REALLY want to get a feel for how GPT-4o does what it does? Just complete this poem — by doing so, you’ll have performed a computation similar to the one it does when you feed it a text-plus-image prompt.

https://www.globalnerdy.com/2024/05/15/the-simplest-way-to-illustrate-how-gpt-4o-works/

iammannyj, to opensource
@iammannyj@fosstodon.org avatar

IBM open-sources its Granite AI models - and they mean business

Many companies claim to have open-sourced their LLMs, but IBM actually did it.

https://www.zdnet.com/article/ibm-open-sources-its-granite-ai-models-and-they-mean-business/

ai6yr, to ai

IEEE Spectrum: telling it like it is.

ai6yr,

Alas, they wimped out and changed the title online, probably after a bunch of tech-bros or engineers-in-love-with-AI complained. #Ai #llms

tayarndt, to LLMs
@tayarndt@techopolis.social avatar
CatherineFlick, to LLMs
@CatherineFlick@mastodon.me.uk avatar

Just FYI, if you have older parents or other family members, set up some sort of shibboleth with them so they know what to ask you if you ever call them asking for something. These new generative models are going to be extremely convincing, and the idiots in charge of these companies think they can use guardrails to stop it being used inappropriately. They can't.

ai6yr, to random

OpenAI or Science Fiction Movie?

https://www.youtube.com/watch?v=ne6p6MfLBxc

ai6yr,

On one hand, the technology advance!

On the other hand: it'll burn down the planet faster with all that energy use

Also, the complete decline of civilization itself, if you think Futurama had it right here: https://www.youtube.com/watch?v=IrrADTN-dvg

metin, to ai
@metin@graphics.social avatar
  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • everett
  • rosin
  • Youngstown
  • ngwrru68w68
  • khanakhh
  • slotface
  • InstantRegret
  • mdbf
  • GTA5RPClips
  • kavyap
  • thenastyranch
  • DreamBathrooms
  • magazineikmin
  • anitta
  • tacticalgear
  • tester
  • Durango
  • cubers
  • ethstaker
  • cisconetworking
  • modclub
  • osvaldo12
  • Leos
  • normalnudes
  • megavids
  • provamag3
  • lostlight
  • All magazines