@chikim@mastodon.social avatar

chikim

@chikim@mastodon.social

Love music, technology, accessibility! Faculty at Berklee College of Music 👨🏻‍💻🎹🐕‍🦺

This profile is from a federated server and may be incomplete. Browse more on the original instance.

chikim, to random
@chikim@mastodon.social avatar

Let's try again! I haven't found any UI for local LLMs that isn't annoying to use with screen readers, so I just made one for myself for Ollama called VOLlama. lol Hope someone finds it useful.
Windows users: follow the instruction on the release page to install Ollama with Docker.
Mac user: Install Ollama using the instruction on ollama.ai. Also, the app is not signed.
https://github.com/chigkim/VOLlama/releases/tag/v0.1.0-alpha.1
@vick21 @freakyfwoof @tristan @KyleBorah @Bri

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.13: It now supports GPT, Ollama, Llama.cpp. Use models submenu in settings. @vick21 @talon @pixelate @KyleBorah @FreakyFwoof @Bri @pitermach https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.13

chikim, to random
@chikim@mastodon.social avatar

VOLlama v0.1.4-beta.1: System Prompt manager; Import Awesome ChatGPT Prompts; Partial support for GPT-4O (Throws an error for token counter in some cases but just ignore for now); Able to attach entire document and feed for long context model. https://chigkim.github.io/VOLlama/

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.23 safely stores your GPT API key in your Keychain.
Very Important! Until now your API key has been stored without encryption, so quit VOCR, trash permanently ~/Library/Preferences/com.chikim.VOCR.plist with Command+Option+Delete, and reboot!
https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.23
@tristan @FreakyFwoof @vick21 @KyleBorah @talon @pixelate @Bri @pitermach

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.16
Takes a screenshot before asking for a prompt; Able to select which model for Ollama to use if multiple clip models are found
https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.16
@talon @vick21 @pixelate @KyleBorah @FreakyFwoof @Bri @pitermach

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.7: 1) Changed diff algorithm for less verbose during realtime OCR; 2) Realtime OCR remains active at its initial location, allowing you to move the VOCursor during the process; 3) no sound during realtime OCR @Bri @FreakyFwoof @pitermach @vick21 https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.7

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.18
Autoupdater, logger FOR DEBUG, Realtime OCR shortcut toggles the feature
Eventually I need to split pre-release vs public release for autoupdater, but let's see how it goes.
https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.18
@vick21 @FreakyFwoof @KyleBorah @talon @pixelate @Bri @pitermach

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.3 recognizes what's under the VoiceOver cursor (Command+Control+Shift+V) utilizing GPT-4V. Keep the suggestions coming! https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.3

chikim, to random
@chikim@mastodon.social avatar

VOCR-v2.0.0-alpha.6: Now you can OCR VOCursor Realtime, and you're able to toggle object detection. Every scan during realtime OCR triggers a sound, but I'll turn it off in next build. @FreakyFwoof @pitermach @vick21 @Bri https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.6

chikim, to random
@chikim@mastodon.social avatar

VOCR-v2.0.0-alpha.5 has dedicated shortcuts: Command+Control+Shift+W for OCR window, V for OCR VOCursor, E for exploring the window with GPT, and A to ask about the VOCursor using GPT. @pitermach @FreakyFwoof @vick21 https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.5

chikim, to Discord
@chikim@mastodon.social avatar

Does anyone use #Discord on Desktop with screen reader? The UI seems extremely cluttered. Even just moving focus away from the area where you type message to the area that lists messages seems unreliable. Sometimes I'm just trapped in the last message, I can't press up to read previous messages. I can use VO cursor or virtual cursor to read the messages, but there are so many elements to skip over even just one message! I'm sure I can figure out some workarounds, but do I want it that bad? lol

chikim, to random
@chikim@mastodon.social avatar

@FreakyFwoof What are the remote jam apps you tried that were accessible? Jamulus, HPS Jam, SonoBus? Am I missing anything?

chikim, to boston
@chikim@mastodon.social avatar

In #Boston, #Massachusetts, average monthly childcare $2,591, apartment $3,970, health insurance $1,956 (after employer coverage). That's $8,517/month or $102,204/year. The average annual household income in Boston is $120,939. I'm not sure whats left after tax, housing, childcare, health insurance!

chikim, to random
@chikim@mastodon.social avatar

VOCR v2.0.0-alpha.11: Fixed a bug where some global shortcuts are deactivated after performing an OCR scan.
https://github.com/chigkim/VOCR/releases/tag/v2.0.0-alpha.11
@vick21 @KyleBorah @FreakyFwoof @Bri @pitermach

chikim, to random
@chikim@mastodon.social avatar

@ToniBarth I think you own zoom m3 mictrak right? Do you know if they released a driver update to operate in 32-bit float as an usb mic yet like m2? Is this limitation only on Windows, or Mac as well? Thanks!

chikim, to random
@chikim@mastodon.social avatar

The most interesting comment I heard about 32-bit float recording: Saying "you don't need 32-bit float if you know what you're doing" is like saying "I've been driving safely for 20 years, so I don't need airbags in my car!" Of course, you still need good mic technique, but 32-bit float gives you peace of mind when things get loud unexpectedly.

chikim, to random
@chikim@mastodon.social avatar

Try command+option+shift+escape to quickly force quit current app instead of using the force quit dialog on MacOS.

chikim, to random
@chikim@mastodon.social avatar

@vick21 Hey Vic, I just pushed another commit. If you have time, do you think you could look into improving diff function in RealTime.swift?

chikim, to jazz
@chikim@mastodon.social avatar

One of my students, Ciara Moser, who plays the bass, released her #jazz album “Blind. So What?” It sounds awesome! Check it out!
Spotify:https://open.spotify.com/album/0dwFmVgwpLgz1CwuTDK5JP?si=hFxsJhTURh--krlFhsT0Mw
Apple Music: https://music.apple.com/us/album/blind-so-what/1712158793

chikim, to reddit
@chikim@mastodon.social avatar

So So, what happened to the #Reddit blackout in the end? It seems Reddit did what they said they would do, and things are back to normal now?

chikim, to llm
@chikim@mastodon.social avatar

Mark Zuckerberg on Llama 3: Apparently Meta stopped training Llama-3-70b before convergence and decided to move onto Llama-4. Meaning they could have kept training and made it smarter! Also llama3-70b multimodal as well as multilingual and bigger context window are coming. #LLM #AI #ML https://youtu.be/bc6uFV9CJGg

chikim, to random
@chikim@mastodon.social avatar

BeMyEyes Privacy Policy 1/2: We record and store video streams and other images to enforce our Terms of Service, to promote and preserve safety, and to improve our Services and create new Services. We may provide recorded video streams or images to other organizations that are performing research or working to develop products and services that may assist blind and low-vision people or other members of the general public.

chikim, to llm
@chikim@mastodon.social avatar

If you have limited resources or want faster response from a local LLM, try HuggingFaceH4/zephyr-7b-alpha. It's based on mistral-7B, and generates pretty decent responses surpassing 13B models in various benchmarks. ollama run zephyr #LLM #AI https://ollama.ai/library/zephyr

chikim, to random
@chikim@mastodon.social avatar

For those of you who have used VOLlama, thank you for testing my hobby project! I'm considering moving it out of pre-release and marking the latest build as the first public release. Any thoughts on its stability or issues with bugs? Is it fairly stable or unstable? Of course, like all my other projects, it'll be free open source! @vick21 @technowitch @FreakyFwoof @kaveinthran @pixelate @ppatel

chikim, to llm
@chikim@mastodon.social avatar

If you missed it, check out the new GPT4O demo. https://www.youtube.com/watch?v=DQacCB9tDaw

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • InstantRegret
  • mdbf
  • osvaldo12
  • magazineikmin
  • GTA5RPClips
  • rosin
  • thenastyranch
  • Youngstown
  • cubers
  • slotface
  • khanakhh
  • kavyap
  • DreamBathrooms
  • anitta
  • Durango
  • everett
  • ethstaker
  • cisconetworking
  • provamag3
  • Leos
  • modclub
  • ngwrru68w68
  • tacticalgear
  • tester
  • megavids
  • normalnudes
  • lostlight
  • All magazines