#javascript - kbin.social

joe, 3 hours ago to ai

LLaVA (Large Language-and-Vision Assistant) was updated to version 1.6 in February. I figured it was time to look at how to use it to describe an image in Node.js. LLaVA 1.6 is an advanced vision-language model created for multi-modal tasks, seamlessly integrating visual and textual data. Last month, we looked at how to use the official Ollama JavaScript Library. We are going to use the same library, today.

Basic CLI Example

Let’s start with a CLI app. For this example, I am using my remote Ollama server but if you don’t have one of those, you will want to install Ollama locally and replace const ollama = new Ollama({ host: 'http://100.74.30.25:11434' }); with const ollama = new Ollama({ host: 'http://localhost:11434' });.

To run it, first run npm i ollama and make sure that you have "type": "module" in your package.json. You can run it from the terminal by running node app.js <image filename>. Let’s take a look at the result.

The Image The Description

https://i0.wp.com/jws.news/wp-content/uploads/2024/05/window-sign-580x423.jpg?resize=580%2C423&ssl=1 https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.06.55%E2%80%AFPM.png?resize=669%2C502&ssl=1

https://i0.wp.com/jws.news/wp-content/uploads/2024/05/sandwiches-580x423.jpg?resize=580%2C423&ssl=1 https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.12.21%E2%80%AFPM.png?resize=669%2C502&ssl=1

https://i0.wp.com/jws.news/wp-content/uploads/2024/05/concert-580x423.jpg?resize=580%2C423&ssl=1 https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.18.06%E2%80%AFPM.png?resize=669%2C502&ssl=1

Its ability to describe an image is pretty awesome.

Basic Web Service

So, what if we wanted to run it as a web service? Running Ollama locally is cool and all but it’s cooler if we can integrate it into an app. If you npm install express to install Express, you can run this as a web service.

The web service takes posts to http://localhost:4040/describe-image with a binary body that contains the image that you are trying to get a description of. It then returns a JSON object containing the description.

https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.41.20%E2%80%AFPM.png?resize=1024%2C729&ssl=1

Have any questions, comments, etc? Feel free to drop a comment, below.

https://jws.news/2024/how-can-you-use-llava-and-node-js-to-describe-an-image/

#AI #JavaScript #LLaVA #LLM #NodeJs #Ollama

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

The Image	The Description
https://i0.wp.com/jws.news/wp-content/uploads/2024/05/window-sign-580x423.jpg?resize=580%2C423&ssl=1	https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.06.55%E2%80%AFPM.png?resize=669%2C502&ssl=1
https://i0.wp.com/jws.news/wp-content/uploads/2024/05/sandwiches-580x423.jpg?resize=580%2C423&ssl=1	https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.12.21%E2%80%AFPM.png?resize=669%2C502&ssl=1
https://i0.wp.com/jws.news/wp-content/uploads/2024/05/concert-580x423.jpg?resize=580%2C423&ssl=1	https://i0.wp.com/jws.news/wp-content/uploads/2024/05/Screenshot-2024-05-18-at-1.18.06%E2%80%AFPM.png?resize=669%2C502&ssl=1

ovid, 9 hours ago to Lisp

#Perl, #Smalltalk, and #Lisp are three powerful programming languages that share a common feature.

Nobody knows how the hell to capitalize them.

#programming #software

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ovid, 8 hours ago

@tripleo I always have to look up the capitalization of Smalltalk because I get it wrong every time.

Hmm ... #JavaScript should be in that list.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ecmascript_news, 18 hours ago to javascript

ECMAScript 2025 feature: duplicate named capturing groups for regular expressions
@rauschma
https://2ality.com/2024/05/proposal-duplicate-named-capturing-groups.html

#ECMAScript #JavaScript

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

zalasur, 19 hours ago to javascript

It's been almost a decade since I've done a live coding stream. This will be fun!

Today I'll be migrating my website from React to Lit, which is a lightweight framework built around web components. I have the scaffolding set up mostly, so now it's time to get this done.

Come watch. Ask questions in chat! You don't need to create an account, just a username is needed to participate.

https://video.surazal.net/w/5S7FPXJMZh1i1eqZLY9mcV

#Javascript #Node #Development #PeerTube #Stream #Streaming

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Mrfunkedude

ecmascript_news, 20 hours ago to javascript

Node v22.2.0 (current)
@targos @nodejs
https://nodejs.org/en/blog/release/v22.2.0

#ECMAScript #JavaScript #NodeJS

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ecmascript_news, 20 hours ago to javascript

esbuild v0.21.3: decorator metadata and more
@evanw
https://github.com/evanw/esbuild/releases/tag/v0.21.3

#ECMAScript #JavaScript #esbuild

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stefan, 22 hours ago to javascript

I've been really enjoying working with Wikidata lately, setting up automated accounts like @libraries, @parks, and @lighthouses.

To see what else you can do with Wikidata, and to learn how to use it, check out a tutorial I put together: https://stefanbohacek.com/blog/making-a-map-of-unesco-world-heritage-sites/

#tutorial #wikidata #LearnToCode #javascript

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ HistoPol, botwiki

ecmascript_news, 23 hours ago to javascript

Web at Google I/O 2024 [YouTube playlist]
https://www.youtube.com/playlist?list=PLOU2XLYxmsIKeQI4KTrrplA_mUPI3Lq5b

#ECMAScript #JavaScript

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

joelanman, 1 day ago to javascript
trying to implement a form submission progress bar in js, but XHR follows the success redirect without telling me (I want to access it and redirect the browser).

Fetch can opt out of that but doesn't have a progress api!

I think I found a workaround for XHR:
xhr.onreadystatechange = function() {

 if (this.readyState === this.DONE) {  
 window.location.href = this.responseURL  
 this.abort()  
 }

}  
#javaScript #webdev
reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

craigabbott, 2 days ago to javascript

I was today years old and 4 hours down when I learned you cannot deep clone an instance of a #JavaScript class 😩

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

chipx86, 2 days ago to Discord

Hey, developers: The @reviewboard team's starting a new #Discord for devs to hang out, chat, and share what you're building.

https://discord.gg/saMCqHEZ

We have channels for #django, #python, #javascript / #typescript, #opensource, #gamedev, and more.

You don't need to use or contribute to Review Board to hang out. (But you can follow development there, if you want.)

We hope to see people come in and hang out. The aim is a friendly, diverse community of devs.

Feel free to pass along the invite!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ grimmy

stvfrnzl, 2 days ago to Blog

Five years ago was my graduation from #CodingBootcamp on this very day.

I looked back and wrote a GIANT #blog post: https://stevefrenzel.dev/posts/from-boot-camp-to-blog-five-years-in-the-tech-industry/

Sit back, relax and enjoy the ride. #WebDev #Fullstack #Coding #Frontend #Backend #HTML #CSS #JavaScript

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

a11yclub, 2 days ago to CSS

Just in: Alongside @karlgroves and @erikKroes, @5t3ph will also be offering a workshop in Amsterdam on 9 June:

Beyond #CSS: #JavaScript Requirements for Accessible Components

Participation in half-day workshops costs from € 25, full-day workshops from € 50. Further workshops can be added at any time. Get your tickets: https://ti.to/tollwerk/accessibility-club-summit-2024

More info: https://accessibility.club/event/accessibility-club-summit-2024#schedule-2024-06-09

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ erikKroes, 5t3ph

MaxArt2501, 3 days ago to javascript

A response to @cferdinandi 's recent post(s) on JavaScript and Web Components:
https://dev.to/maxart2501/javascript-is-not-the-problem-k4e

I know he didn't explain his position in details, so a 1800-word article sounds a little unfair, but I think dry and sharp statements need adequate context and analysis.

#webcomponents #javascript #webdev

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

decompwlj, 3 days ago to mathematics

One day, one decomposition
A076056: Primes which when read backwards are composite numbers

3D graph, threejs - webGL ➡️ https://decompwlj.com/3Dgraph/A076056.html
2D graph, first 500 terms ➡️ https://decompwlj.com/2Dgraph500terms/A076056.html

#decompwlj #maths #mathematics #sequence #OEIS #javascript #php #3D #primes #composite #numbers #primenumbers #graph #threejs #webGL

Decomposition into weight × level + jump of A076056 in 3D (threejs - WebGL) (log(weight), log(level), log(jump))

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ phpc

SocketSecurity, 3 days ago to programming

LDAPjs, an LDAP client and server API for Node.js, was decommissioned after its maintainer received an abusive email from a user, raising concerns about this form of abuse as a potential attack vector. #nodejs #JavaScript #opensource https://socket.dev/blog/ldapjs-open-source-project-decommissioned-after-maintainer-receives-abusive-email

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mojoaxel

davidbisset, 3 days ago to javascript

A long list of (advanced) #JavaScript questions, and their explanation (created in 2019). #Programming

https://github.com/lydiahallie/javascript-questions

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

YurkshireLad, 3 days ago to javascript

These questions/answers make me glad I'm not working with #javascript anymore! #dev

"lydiahallie/javascript-questions: A long list of (advanced) JavaScript questions, and their explanations"

https://github.com/lydiahallie/javascript-questions

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Nerdfest

masukomi, 3 days ago to javascript

#JavaScript Geeks:

🤔 I really want to figure out how to make #FediThready be able to post to a Mastodon server WITHOUT requiring a back-end.

I'm pretty sure it's possible to do OAuth and store the token locally without one, but i would love it if someone could point me to an example of this rather than figuring it out from first principles.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

leanpub, 3 days ago to Java

Modern Thymeleaf Bundle https://leanpub.com/b/modern-thymeleaf-bundle by Wim Deblauwe is the featured bundle on the Leanpub homepage! https://leanpub.com wimdeblauwe@mastodon.social #Java #Html #WebDevelopment #Software #JavaScript

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

domhabersack, 3 days ago to javascript

You can turn off the “x packages are looking for funding” messages that get logged with each npm install by setting

fund = false

in your project’s or user’s npmrc file.

https://domhabersack.com/blog/npm-fund-false

#javascript

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

premartinpatrick, 4 days ago to Twitch French

On s'occupe de la partie serveur du site de loterie à partir de 10h30 sur ma chaîne #Twitch. Codage en #PHP maintenant que la partie #HTML/#CSS et #JavaScript est bouclée.

RDV sur https://www.twitch.tv/patrickpremartin pour y assister.

Hier j'ai fait un peu de #JS, ce ne fut pas si laborieux que ça. Voici comment seront choisis les numéros de ticket de loterie par les participants : https://youtu.be/vdTp7XzNmBE

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

janl, 4 days ago to random

Funny, in #JavaScript it is unnerving to use a library version that has not had a release in four days. https://social.vivaldi.net/@Patricia/112447339260051370

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cjk

eldamir, 4 days ago to javascript

How to check and make sure that a data is null in #javascript ?

#lol

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

davidbisset, 4 days ago to javascript

Five Basic Things About #JavaScript That Will Help Non JavaScript-Focused Web #Designers
https://frontendmasters.com/blog/5-things-designers-can-do-with-javascript/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mobileatom, iamdtms