#anthropic - kbin.social

kellogh, 23 days ago to LLMs

i’m very excited about the interpretability work that #anthropic has been doing with #LLMs.

in this paper, they used classical machine learning algorithms to discover concepts. if a concept like “golden gate bridge” is present in the text, then they discover the associated pattern of neuron activations.

this means that you can monitor LLM responses for concepts and behaviors, like “illicit behavior” or “fart jokes”

https://www.anthropic.com/research/mapping-mind-language-model

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

kellogh, 23 days ago

this is great work. i’m excited to see where this goes next

i hope #anthropic exposes this via their API. at this point in time, most of the promising interpretability work is only available on open source models that you can run yourself. it would be great to also have them available from #AI vendors

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

CharlieMcHenry, 7 months ago to ai

Google to invest up to $2B in Anthropic - and… the race is on between, on one side, Microsoft and OpenAI; and on the other side, Google and Anthropic. My $$ is on MS & OpenAI at the moment - and I don’t expect that to change. OpenAI is the clear leader in AI, with a considerable head start and a top-shelf team. Anthropic will have a lot of catching up to do unless they’ve got some kind of killer, breakthrough tech they’re hiding until launch. #AI #Microsoft #Google #OpenAI #Anthropic https://www.reuters.com/technology/google-agrees-invest-up-2-bln-openai-rival-anthropic-wsj-2023-10-27/

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ErikJonker, 3 months ago (edited 3 months ago) to ai

And Claude 3 is here !
Exciting times for AI.
#AI #LLM #GenerativeAI #Claude3 #Anthropic
https://www.anthropic.com/news/claude-3-family

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ppatel, 4 months ago to microsoft

Not sure where this will go but it sounds like a novel approach to antitrust.

The #FTC launches a review of investments by #Microsoft in #OpenAI and by #Amazon and #Google in #Anthropic, to assess how the deals alter the competitive landscapein AI.

https://www.nytimes.com/2024/01/25/technology/ftc-ai-microsoft-amazon-google.html

#AI

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

williamgunn, 9 months ago to llm

Sobering assessment of the state of security among LLM firms. https://www.dwarkeshpatel.com/p/dario-amodei#%C2%A7transcript
#llm #llms #ai #artificialintelligence #chatgpt #gpt4 #infosec #anthropic

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

mjgardner, 5 months ago to ai

“#Anthropic researchers find that #AI models can be trained to deceive.” The popular ones seem born to it. https://apple.news/ANjknJFTOSrGiElbxYVXdlw

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 3 months ago to random

Anthropic's Claude 3 causes stir by seeming to realize when it was being tested

#anthropic #claude

https://arstechnica.com/?p=2007736

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

rhys, 1 month ago to llm

My first troublesome hallucination with a #LLM in a while: #Claude3 #Opus (200k context) insisting that I can configure my existing #Yubikey #GPG keys to work with PKINIT with #Kerberos and helping me for a couple of hours to try to do so — before realising that GPG keys aren't supported for this use case. Whoops.

No real bother other than some wasted time, but a bit painful and disappointing.

Now to start looking at PIV instead.

#AI #Anthropic #Claude

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 1 month ago to llm

“Claude.ai is now available to users in the EU”

Via a T&Cs update email. Claude 3 Opus is my favourite LLM. I haven’t had a chance to fully test ChatGPT-4o yet to compare them.

#LLM #Claude3

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

q7AtQ1Pvy3kx, 1 month ago

#Anthropic is killing it with their AI game, especially for a small startup. Their models are way better than #OpenAI's, but they're focusing more on enterprise stuff rather than hyping it up. This might be a risky move since they don't have a cult following like other AI companies. Still, gotta give them props for their impressive tech. It'll be interesting to see how they balance enterprise with getting more attention from the AI community.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

upright, 1 month ago to random

Why would #anthropic require a phone number to use its app? NOPE.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Binder

br00t4c, 1 month ago to random

6 Practical Tips for Using Anthropic's Claude Chatbot

#anthropic

https://www.wired.com/story/six-practical-tips-for-using-anthropic-claude-chatbot/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

seav, 10 days ago to ai

Big news in the #AI world: Current and former employees of #OpenAI and other AI companies like #DeepMind and #Anthropic warn of ethical and safety risks and want a way to publicly whistleblow about these without fear of retaliation.

https://righttowarn.ai

Media coverage:
• https://www.vox.com/future-perfect/353933/openai-open-letter-safety-whistleblowers-right-to-warn
• https://arstechnica.com/information-technology/2024/06/ex-openai-staff-call-for-right-to-warn-about-ai-risks-without-retaliation/
• https://www.theatlantic.com/technology/archive/2024/06/openai-right-to-warn/678607/
• https://www.wired.com/story/openai-right-to-warn-open-letter-ai-risk/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

robert, 1 month ago to emacs

org-ai got an update today. It now supports the #anthropic #claude and the #perplexity.ai APIs.

https://github.com/rksm/org-ai

#emacs #orgmode #llms

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ sachac

br00t4c, 16 days ago to random

Chatbots Are Entering Their Stone Age

#anthropic #stone

https://www.wired.com/story/chatbots-are-entering-the-stone-age/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 1 month ago to random

Anthropic's founders took a shot at OpenAI executives

#anthropic

https://qz.com/anthropic-founders-openai-executives-ai-1851469940

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 18 days ago to OpenAI

Anthropic Hires Former OpenAI Safety Lead To Head Up New Team

#anthropic #openai

https://slashdot.org/story/24/05/28/2138230/anthropic-hires-former-openai-safety-lead-to-head-up-new-team

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 1 month ago to random

Anthropic Founders Publicly Needle OpenAI Execs

#anthropic

https://gizmodo.com/anthropic-founders-publicly-needle-openai-execs-1851467231

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 16 days ago to ai

Anthropic's AI now lets you create bots to work for you

#ai #anthropic

https://www.theverge.com/2024/5/30/24167231/anthropic-claude-ai-assistant-automate-tasks

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 21 days ago to apple

Does anyone know why Anthropic isn’t in the conversation about Apple doing a deal with an LLM provider?

#Apple #LLM #OpenAi #Anthropic

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TechDesk, 15 days ago to ai

Back in 2022, Anthropic CEO Dario Amodei chose not to release the super-powerful AI chatbot, Claude, that his company had just finished training, opting instead to focus on further internal safety testing. That move likely cost the company billions — three months later, OpenAI launched ChatGPT.

Having a reputation for credibility and caution in an industry that appears to have thrown a large chunk of it to the wind is not a bad thing though. Claude is now in its third iteration, but that caution remains, with the company pledging not to release AIs above certain capability levels until it can develop sufficiently robust safety measures.

TIME’s interview with Amodei gives an insight into what the AI industry might look like when safety is considered a core part of the strategy.

https://flip.it/COiwDU

#AI #Anthropic #ChatGPT #Tech #Interview

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Flipboard

br00t4c, 24 days ago to llm

Here's what's really going on inside an LLM's neural network

#anthropic #llm

https://arstechnica.com/?p=2026236

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ianRobinson, 25 days ago to llm

Research paper from Anthropic.

“Today we report a significant advance in understanding the inner workings of AI models. We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, production-grade large language model. This interpretability discovery could, in future, help us make AI models safer.”

#LLM #Anthropic https://www.anthropic.com/research/mapping-mind-language-model

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 25 days ago to random

New Anthropic Research Sheds Light on AI's 'Black Box'

#anthropic #thepeople

https://gizmodo.com/new-anthropic-research-sheds-light-on-ais-black-box-1851491333

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

br00t4c, 1 month ago to OpenAI

Anthropic co-founders say their AI models are taking lessons from the harms of social media

#anthropic #openai

https://qz.com/anthropic-safe-ai-bloomberg-technology-summit-amodei-1851466207

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...