vicki

@vicki@jawns.club

#Recsys. #MachineLearning. #Engineering and distributed systems. Python, the JVM, and two small children. Writing. Terrible puns. Nutella. Building https://viberary.pizza 📚🍕

This profile is from a federated server and may be incomplete. Browse more on the original instance.

vicki, 28 days ago to LLMs

The most interesting stuff in #LLMs right now (to me) is:

figuring out how to do it small

figuring out how to do it on CPU

figuring out how to do it well for specific tasks

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ twilliability

vicki, 1 month ago to random

new post on new redis ❤️‍🩹

https://vickiboykis.com/2024/04/16/redis-is-forked/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

vicki, 1 month ago to random

There is this widely-held belief that current models are magic, instead of carefully-calibrated systems of components:

Millions of people generating training data

People aligning models through supervised RHLF / constitutional ai

Human filters for model outputs

People building human-centric UIs

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 2 months ago to random

Something I've been wondering about it, what was different about the invention of the printing press, which also allowed potentially multiple copies of low-quality content, and LLMs? I'd love to read some historical context around what people did to combat low-quality copies in previous eras and how it worked or didn't work.

https://www.wheresyoured.at/are-we-watching-the-internet-die/

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ futurebird

MozillaAI, 3 months ago to ai

After a year of breakneck innovation and amidst the neverending #AI hype, how do we know if a model is any "good"?

We're excited to share our team’s learnings written by @vicki at @MozillaAI

https://blog.mozilla.ai/exploring-llm-evaluation-at-scale-with-the-neurips-large-language-model-efficiency-challenge/

Given how complex the architectures of these models are, it is crucial that the community start seriously addressing the #LLMevaluation minefield.

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

vicki, 3 months ago

@kellogh @MozillaAI @vicki yup sorry this is the only kind of content I produce 😂😂😂

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 4 months ago to machinelearning

New post: I’ve been meaning to write something around what has fundamentally changed around the process of putting ML into prod now that we have LLMs.

TL;DR: It's still just compression, we just don't control as much anymore.

#machinelearning #llm #engineering

https://vickiboykis.com/2024/01/15/whats-new-with-ml-in-production/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh, amcasari, gzt

Edent, 5 months ago to random

Does anyone have a good primer on why Telegram seems to be the default social media platform for Russia?

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 5 months ago

@Edent here’s a couple posts on it https://vicki.substack.com/p/whats-up-with-russias-internet

https://vicki.substack.com/p/with-the-feed-as-my-witness

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Edent

vicki, 5 months ago to python

Went down a rabbithole around why we use the TYPE_CHECKING constant in #python.

https://vickiboykis.com/2023/12/11/why-if-type_checking/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ willmcgugan

vicki, 7 months ago to random

I’m never complaining about CI that takes a minute again. https://news.ycombinator.com/item?id=38170845

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ djfiander

vicki, 7 months ago to random

Joined a bunch of Discords to catch up on the LLM space. The energy of building is amazing but absolutely all of the information and ghost knowledge (https://vickiboykis.com/2021/03/26/the-ghosts-in-the-data/) is getting lost.

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gzt, pythonbynight, CodenameTim

vicki, 7 months ago

@hrbrmstr @astrojuanlu yeah it’s not so much saying “Discord sucks for community”, it’s a wonderful tool for bringing people together. It’s “Discord sucks for discovery” which is important for community continuity

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kellogh, 7 months ago to LLMs

i wish i knew more about comparing #embeddings. anyone have resources? one thing i’ve wondered is how to convert an embedding from a “point” to an “area” or “volume”. e.g. an embedding of a 5 paragraph essay will occupy a single point in embedding space, but if you broke it down (e.g. by paragraph), there would be several points and the whole would presumably be at the center. is there a way to trace the full space a text occupies in #embedding space? #LLMs #LLM #AI #NLP

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ acdha

vicki, 7 months ago

@kellogh what are you looking to solve ultimately?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 7 months ago

@kellogh so somewhere between sentiment analysis and topic detection at scale?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 7 months ago

@kellogh it sound a bit what you’re talking about is averaging embeddings or coming up with unit embeddings that you can then do math on but that collapses like you said, depending on the context. There might be a different way to formulate your problem.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 7 months ago to Java

Does anyone have any recommended reading about the context behind Java’s virtual threads, the system architecture behind them and the related systems concepts?

#java #java21 #programming

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ janriemer

vicki, 8 months ago to random

An extremely low-effort but high-impact thing you can do in documentation, emails, PRs, etc, is to add a sentence at the top that is just "context," giving the context of the code you're working with links to the project, whatever. Not everyone coming into the doc knows what it's about.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ bde, pythonbynight, kellogh

vicki, 8 months ago to random

What a banger of a post about what it means to care about software quality and thinking about it in a larger frame.

“When you work day in and day out with something where Quality isn't important, you take psychic damage.”

https://ludic.mataroa.blog/blog/reverence-for-the-craft-business-and-your-immortal-soul/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ janriemer

vicki, 8 months ago to random

The biggest insult I’ve seen software developers I respect say about software that they don’t think is good is that it is “not elegant.” Somehow hits much harder than “this is crap.”

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

amcasari, 8 months ago to random

My alarm goes off when it's still dark, so that my brain can wake up a bit before the morning family routines. Sometimes I open Mastodon, just to glance through for any big news to start the day.

This morning, I was immediately riveted in place by @vicki's keynote on "Build and keep your context window."

Vicki expertly weaves together multiple concepts across literature, social science, history, engineering, and familiar concepts for "data people".

#RecommendedRead

https://jawns.club/@vicki/111062827076966684

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 8 months ago

@amcasari ❤️❤️❤️

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 8 months ago to machinelearning

Enormous thank you to PyData Amsterdam for inviting me to keynote. Slides and notes from my talk, "Build and keep your context window" are all here, and if you're in Amsterdam say hi. :)

https://vickiboykis.com/2023/09/13/build-and-keep-your-context-window/

#data #machinelearning #stroopwafel

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

vicki, 9 months ago to random

It’s AI if you need money for GPUs, it’s ML if you need money for K8s.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rewarp, Paxxi, tartley, kellogh +1 more

dabeaz, 9 months ago to random

As Python continues its ascendance, I sometimes wonder if I even have a place in that world. I'm not so sure.

reply

expand (18)

collapse (18)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 9 months ago

@dabeaz not true, as the ecosystem continues to grow and experience eternal September and draw people new to both Python and programming as a skill, we need experienced voices of reason more than ever to give context.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

webology, 9 months ago to random

🏕️ "Leave it better than when you found it."

🤔 I am sure it will take me a while to update my various email and social media footers, but my ~5-year term as @ThePSF director and vice chair ended today.

🙏 It was an honor to serve, work with everyone, and leave a small mark on the community and our direction.

👟 If you are a runner and have ever finished a race a little faster than you thought you could or met a new distance goal and surprised yourself, that's how I feel today. ❤️

reply

expand (36)

collapse (36)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ phildini, frank, ThePSF, amcasari +7 more

vicki, 9 months ago

@webology @ThePSF thank you so much for serving! 🙏🐍

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vicki, 9 months ago to random

Starting a reading list for only normcore content around LLMs. What would you add?

Normcore = no hype, no langchain, no AI is going to destroy us all, but practical, technical readings in navigating in this brave new world.

https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hazelweakly, pythonbynight

vicki, 9 months ago to random

So funny we tried to outsource the data management layer by using pre-trained LLMs that scraped the entire internet but now we are also doing RAG so that not only do we have to manage remote API latency, but we now also still have to do in-house data management.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ amcasari