osma

@osma@sigmoid.social

Inf Sys Specialist at NatLibFi. Creator of #Annif automated subject indexing (text classification) tool built on AI and ML methods.
Opted in to tootfinder

This profile is from a federated server and may be incomplete. Browse more on the original instance.

qlp, 22 days ago to python

This joke has probably been made a bunch of times, but...

Python 3.14, not to be confused with PyPI.

#Python

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ juliobiason, acdha, amcasari, adnan +13 more

osma, 22 days ago

@qlp
What about PyPy then?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 1 month ago to random

I just got baited into applying Annif on a new task...

Ari Hershowitz published a data set of US Congress bills and posted about it on LinkedIn, including some results on applying LLMs for classifying them.

https://huggingface.co/datasets/dreamproit/bill_labels_us
https://www.linkedin.com/posts/ari-hershowitz_dreamproitbilllabelsus-datasets-at-hugging-activity-7193325364230721536-fz61

I applied #Annif on the data set to predict policy areas with 90% accuracy and legislative subjects with a F1 score of nearly 74%. These are much better results using only cheap traditional ML approaches.

https://github.com/osma/annif-us-congress-bills

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ trochee, acdha

mcc, 1 month ago to random

Hard to imagine a signal that a website is a rugpull more intense than banning users for trying to delete their own posts

https://www.tomshardware.com/tech-industry/artificial-intelligence/stack-overflow-bans-users-en-masse-for-rebelling-against-openai-partnership-users-banned-for-deleting-answers-to-prevent-them-being-used-to-train-chatgpt

Like just incredible "burning the future to power the present" energy here

reply

expand (121)

collapse (121)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ futureisfoss, apq, demofox, noodlejetski +62 more

osma, 1 month ago

@mcc
@WomanCorn That's exactly what they've done. https://stackoverflow.com/help/gen-ai-policy

As noted above, all content published on SO is available under the CC BY-SA license, which is usually taken to mean that training LLMs is permitted. https://stackoverflow.com/help/licensing

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 1 month ago to ai

AI Sauna is about to start!
I will give a talk about Civilized AI in around half an hour. You can follow the livestream.

In the evening you can meet me at the sauna. Tomorrow will be an AI hackathon.

https://meta.wikimedia.org/wiki/AI_Sauna

#AISauna #AI #Sauna #AvoinGLAM #OpenGLAM #Wikimedia #Helsinki #Allas

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ nemobis

osma, 3 months ago to random

OTD in 1712, in Sweden (and Finland) it was not the first of March, but the 30th of February.

#impossibledates #otd
https://en.m.wikipedia.org/wiki/1712_in_Sweden#Events

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ OskarImKeller

osma, 3 months ago

@SemAntiKast @osma
A few other instances of February 30th are listed here: https://en.m.wikipedia.org/wiki/List_of_non-standard_dates#February_30

...but none of the others are as "real" as the one in Sweden 1712.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

SmudgeTheInsultCat, 4 months ago to random

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ qurlyjoe, okapilips, gdinwiddie, revk +41 more

osma, 4 months ago

@SmudgeTheInsultCat The same happened with a lake in Northern Karelia, near the Finnish-Russian border. It was called simply "jaur" (lake) by locals speaking Sámi language. In Finnish maps it thus became "Jaurjärvi" (lake lake). Russian mapmakers called it "Jaurjärviozero" (you can guess...) and then during WW2, Germans based their maps on Russian maps and dutifully named it "Jaurjärviozerosee" (lake lake lake lake).

Recounted e.g. here (in Finnish): http://karirydman.blogspot.com/2010/12/jaurjarviozerosee.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ qurlyjoe

dgar, 4 months ago to random

Finland has closed it borders.

Now nobody can cross the Finnish line.

reply

expand (15)

collapse (15)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ fkamiah17, wonka

osma, 4 months ago

@tessarakt
Because we Finns like to punnish them.
@dgar

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 5 months ago to random Finnish

Uudelleensyntynyt Väri-Signe-bottini @varisigne sai aika kivasti seuraajia, joten päätin perustaa sille kollegan: Väri-Timiri @varitimiri julkaisee vastaavaan tapaan Ivan Timiriasewin, Helsinkiin rakastuneen venäläissyntyisen upseerin, historiallisia valokuvia mm. ensimmäisen maailmansodan ja sisällissodan ajoilta. Tervetuloa seuraamaan sitäkin!

Molemmat kuvabotit julkaisevat nyt tekoälyväritetyn kuvan lisäksi myös alkuperäisen mustavalkoisen kuvan.

#tekoäly #valokuvat #Helsinki #BotArt

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ antti404

simon, 5 months ago to random

Many options for running Mistral models in your terminal using LLM

I wrote about a whole bunch of different ways you can use my LLM tool to run prompts through Mistral 7B, Mixtral 8x7B and the new Mistral-medium from the terminal:

https://simonwillison.net/2023/Dec/18/mistral/

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 5 months ago

@simon
Excellent as always! Thanks!

Minor nitpick: You say that Mistral Small beats GPT-3.5 on every metric. But in the table it has slightly lower scores for WinoGrande and MT Bench.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

b0rk, 7 months ago (edited 7 months ago) to random

if you're an infrequent command line user -- what text editor do you use if you need to occasionally edit a file on the command line (other than vim/emacs)?

curious about what people use to edit a git commit message etc

if you picked 'other', I'd love to hear what you do in the replies!

reply

expand (258)

collapse (258)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ juandesant, alter_unicorn, ryanc, brainwane +1 more

osma, 7 months ago

@b0rk
joe!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 7 months ago to random

New LLM paper highlighting quite how weird and ridiculous these things are https://arxiv.org/abs/2307.11760

Adding "it's important to my career" can produce better results, across every model they tested!

reply

expand (32)

collapse (32)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ lacey, pluralistic, ronald, rakyat +5 more

osma, 7 months ago

@simon
How does this compare to taking a deep breath? https://arstechnica.com/information-technology/2023/09/telling-ai-model-to-take-a-deep-breath-causes-math-scores-to-soar-in-study/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 7 months ago

@simon I propose that this kind of prompt engineering should be called "silly computing", or "sillyputing" for short. With nods to Silly Putty and of course Monty Python.

"I'm afraid your prompt isn't silly enough. Can you make it sillier?"

#sillyputing

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

b0rk, 7 months ago to random

today I'm thinking about the tradeoffs of using git rebase a bit. I think the goal of rebase is to have a nice linear commit history, which is something I like.

but what are the costs of using rebase? what problems has it caused for you in practice? I'm really only interested in specific bad experiences you've had here -- not opinions or general statements like “rewriting history is bad”

reply

expand (183)

collapse (183)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ odddev, alcinnz, _chrismay

osma, 7 months ago

@b0rk
It can be painful to review feature branches that are rebased during development (often for good reasons). My local branch gets out of sync with the remote, rebased branch so I can't just pull in the most recent work on top of what I had from before. I usually just delete the local branch and re-fetch and checkout it. I don't know if there's a more elegant way.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 7 months ago

@sarajw
@b0rk Seconded!

Also, if you had to fix any conflicts during earlier merges, when you do a rebase, you will often have to fix them again.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

aarontay, 7 months ago to random

I understand whenever there's some new technology, librarians need to say things like rah rah things like "users will still need us Librarians to guide them to use those tools" to encourage ourselves but this means nothing if this is all words and no action. Think you are the best person to guide users for generative AI tools? Then really study them, as deeply as you can & not just wait for vendors to "educate" you & you endup a mouthpiece of a product you paid for.

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 7 months ago

@brewsterkahle
@aarontay
Definitely interested, this is basically what I've been doing with my colleagues for the last few years. Especially #Annif but also other AI-related things and projects, including LLMs.

I think the chatbot made by the National Library of Luxembourg is an interesting example of AI helping library patrons: https://bnl.public.lu/en/a-la-une/actualites/communiques/2023/chatbot-eluxemburgensia.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

b0rk, 7 months ago to random

confusing git terminology https://jvns.ca/blog/2023/11/01/confusing-git-terminology/

reply

expand (71)

collapse (71)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Paxxi, c0dec0dec0de, stuartmarks, paulox +16 more

osma, 7 months ago

@b0rk Thanks a lot for this, you did an amazing job here!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 7 months ago to random Finnish

Väittävät, että 15-pikaratikan reitti Viikissä olisi jotenkin poikkeuksellisen vaikea ymmärtää. Enpä tiedä. Kiskojen reitillä on kiveä, liikenneympyrän ajoradalla asfaltointi. Kiskojen suuntaan on ihan selvä ajokiellon liikennemerkki.

Toisaalta kun Kalasatamassa asuu, niin päivittäin näkee kymmenien autoilijoiden täysin surutta ajavan vasten moottoriajoneuvolla ajokiellon merkkejä useammalla kujalla ja sillalla. Olisko kuitenkin kyse kuskien osaamisesta tai asenteesta?

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 7 months ago

@osma Olen ajanut Viikin liikenneympyrästä aika monta kertaa viime vuosina ja aina se on hämmentävä, pimeällä suorastaan pelottava kokemus vilkkuvaloineen ja takavasemmalta tulevine ratikoineen. Jo ennen pikaratikkaa, kun Viikinmäen siltaa käyttivät 550-bussit. Olen nähnyt henkilöauton eksyvän sinne jo ennen raiteiden rakentamista. Jos minulta kysytään, niin ei kovin onnistunut risteys.

Tämä ei tietenkään millään tavoin poista kuljettajan vastuuta tarkkailla ympäristöään ja liikennemerkkejä.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 8 months ago to random

I'm on the latest episode of the Rooftop Ruby podcast with @collin and @joeldrapper talking about Large Language Models

It was a really excellent conversation - we covered a huge amount of ground

I'm trying something new: I put together my own transcript with Whisper, then cleaned that up and added inline links and section headings. Here's the result, complete with an embedded audio player that can jump to each different section: https://simonwillison.net/2023/Sep/29/llms-podcast/

reply

expand (23)

collapse (23)

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 8 months ago

@simon
@collin @joeldrapper

Once again you've done an an amazing job, both in terms of content and form! I really liked the transcript but I would never had listened to such a long podcast. Thanks so much!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

arstechnica, 8 months ago to random

Can you melt eggs? Quora’s AI says “yes,” and Google is sharing the result

Incorrect AI-generated answers are forming a feedback loop of misinformation online.

https://arstechnica.com/information-technology/2023/09/can-you-melt-eggs-quoras-ai-says-yes-and-google-is-sharing-the-result/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social

reply

expand (18)

collapse (18)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ dgoldsmith, baldur, Nerdfest, 50Percent_Cooler +5 more

osma, 8 months ago

@arstechnica Apparently this kind of misinformation feedback loop happens with Google's own Bard conversations as well: https://mastodon.social/@nixCraft/111132139532868852

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

treyhunner, 9 months ago to python

What's a #Python feature you wish existed but doesn't? 🤔

It can even be something others would think is absurd. 🛸

Dream big! 💭

reply

expand (18)

collapse (18)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hugovk

osma, 8 months ago

@treyhunner
Bing able to run computations over big data structures in parallel without absurd amounts of overhead. Something like:

results = set()
parallel for big_obj in big_obj_list:
results.add(analyze(big_obj))

Maybe the no-GIL work will enable this eventually.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jhilden, 9 months ago to random

The Holy See implies the existence of the Holy Hear, Holy Smell and Holy Feel

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 9 months ago

@jhilden Not to mention the Holy Taste

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 9 months ago

@jhilden
If we extend this to not just using our senses but also other everyday activities, then we have things like Holy Talk, Holy Walk, Holy Sleep etc. And, well, eventually, Holy Shit.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 9 months ago to Finland

Big demonstration against government racism and fascism in #Helsinki, #Finland.

#MeEmmeVaikene

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ immersfer, DrDottir, antti404, hugovk

simon, 9 months ago to random

Here's the video, full set of slides and annotated transcript for the talk I gave at WordCamp US #WCUS on Friday: "Making Large Language Models work for you"
https://simonwillison.net/2023/Aug/27/wordcamp-llms/

reply

expand (31)

collapse (31)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ TechDesk, paulox, lewiscowles1986, ElZoido +3 more

osma, 9 months ago

@simon
Thanks again for a fantastic talk and a superb transcript! You're doing an awesome job!

One question: you have a wide repertoire of skills and tricks for using pretrained LLMs, but you didn't mention fine-tuning, why? To me this is perhaps the single most exciting way of applying LLMs to solve practical problems - teaching them new skills just by example. And with techniques like PEFT and QLoRA it's very cheap and easy. Of course many things are possible without fine-tuning as well...

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

osma, 9 months ago

@simon
Very good points!

I've tried fine-tuning with the GPT-3 API, and it was surprisingly easy, but a bit costly. Lately I've also fine-tuned Llama 2 locally, and it was a lot more difficult to put together all the pieces in the right way. Even ChatGPT is of limited help because the tools and libraries are so new. A good cookbook would certainly help! There are of course many examples in blog posts etc. but it's hard to adapt those to your own situation.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...