Try a local #LLM using #Mozilla's #Llamafile:... - Large Language Models

happyborg, 6 months ago

Try a local #LLM using #Mozilla's #Llamafile:

Get the 4.3GB file:
curl -LO https://huggingface.co/jartine/llava-v1.5-7B-GGUF/resolve/main/llava-v1.5-7b-q4-server.llamafile

Make executable

Run it

Visit http://127.0.0.1:8080/

You're welcome!

This is a text and image capable LLM, so ~ #ChatGPT 4

h/t @simon

More in Simon's blog: https://simonwillison.net/2023/Nov/29/llamafile/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ markhughes

Image

Image alternative text

happyborg, 6 months ago

@simon
Has anyone found a use for #Llama #LLM (see above)?

Serious question. It's interesting to play with but I haven't found anything I'd use it for yet. The only thing it seems it may be good at is summarising a news page and articles filtered by subject - though I'm not sure that's really useful to me.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ markhughes, happyborg

heysupratim, 6 months ago

@happyborg @simon The mental model with decently large models (>7b params ) is you have to treat them as just nicer thought assistants.

Basically an improved focused search and text generator.

Anytime you require generation of some volume of textual data, just use your LLM as a take off point

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

happyborg, 6 months ago

@heysupratim

> Anytime you require generation of some volume of textual data, just use your LLM as a take off point

Can you elaborate. I don't see how that would help me tbh. IME it's valuable to sit with a task until I am ready to write and then it writes itself.

I've never accepted that gmail or other response generators are doing anything other than taking creativity and autonomy away from humans and moulding them into components of a machine which has anti-human goals.

Easy is not good.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

heysupratim, 6 months ago

@happyborg yeah those are discussions that i am sure no-one would be able to justify their own sides for ever.

Over time, i have just hardened my skin to just accept the changing dynamics of how we interact with our computer. I totally get though it can be a personal values based decision but try giving it a chance in the next few years, who knows you might like it then

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mkhoury, 6 months ago

@happyborg @simon I use it to do passive things like categorizing, topic extraction and summarization on some RSS feeds. Not super useful with 4k context window, but I see some ways to extend it which I haven't played with yet. I also haven't really played with fine tuning.

It's nice that I can run it on my computer. As long as I find passive/async use cases, it's a great tool.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

happyborg, 6 months ago

@mkhoury
Yes, the one thing I've found so far that might be useful.

I asked it to summarise a news page, visiting the linked articles and summarising them. It seemed to do a reasonable job - also filtering out all the graphics, ads etc as a bonus. Pretty basic stuff though.

In another session when I hadn't asked it to do anything like that it blurted out that it was unable to access the internet and couldn't do what I asked. 🤷‍♂️

Nobody would use or release this crap if it wasn't human like.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment