#TTS - kbin.social

chikim, 4 days ago to random

I created samples for all 58 voices for xtts-v2. Hopefully it makes it easier for someone to choose a speaker. #coqui #tts https://we.tl/t-9vWd1gO3EN

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999

VE3RWJ, 4 days ago to AdobePhotoshop

#ElevenLabs does not disappoint.

They've just released an #app that will read whatever document you throw at it.

You can find out more Here:

https://apps.apple.com/ca/app/elevenlabs-reader-ai-audio/id6479373050
#AI #A11Y #TTS #blind #dyslexia

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaybird110127

jbzfn, 10 days ago to c64

Making 80s Computers Talk | 1980s Commodore Speech Synthesizer | Kari

https://www.youtube.com/watch?v=1ip7K0CaC7Y

#c64 #tts #retrocomputing

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jedie, 17 days ago to fdroid German

Gibt es eine bessere Lösung für #TTS mit #calyxos #lineageos und co?

https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

Listet #RHVoice per #fdroid auf,kann aber noch kein deutsch.

#eSpeak findet das berruntergeladene #mbrola model Datei nicht.

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jedie, 17 days ago to android German

Gibt es was neues zu #Android: #Sprachausgabe (#TTS-Engine) bei Custom-ROMs aktivieren • Kuketz IT-Security Blog
https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

#eSpeak bekomme ich nicht zum laufen, findet das runtergeladene #mbrola Modell Datei nicht.

#RHVoice hat immer noch kein deutsch.

#calyxos

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

eeejay, 19 days ago to GNOME

A demo of a #spiel sample app, the voices used in order are: eSpeakNG's "Andy" variant, MBROLA US2, and Piper's Amy. You can observe the different features like word tracking and quality. #speech #tts #gnome #linux

video/mp4

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

skinnylatte, 26 days ago to accessibility

Opening soon: 2 remote jobs for accessibility lead with the federal government. Open to US citizens and nationals only, with background check.

https://join.tts.gsa.gov/join/Solutions-Accessibility-Lead-April2024/

Info on how to write a federal resume: https://handbook.tts.gsa.gov/hiring-staying-or-changing-jobs/resume

#GetFediHired #Jobs #a11y #Accessibility #CivicTech #GSA #TTS #Government

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ thepoliticalcat, impactology

pixelate, 1 month ago to accessibility

Honestly, since the fast variants of the voices are a thing, I think I could really switch to the Sonata Neural Voices in NVDA full time. Now remember folks, these are AI voices. Scary, untrustworthy, AI voices that will smear your reputation all over fedi for using these voices! See, they even react to exclamation marks! Isn't that scary? :) Nah, the worst that'll happen, mainly with the HFC male and female, is that big numbers are garbled together. But every other voice does fine. I use Amy for work, and HFC for reading because those are among the most lively voices I've ever heard. And amazingly enough, we can make our own new voices. So, some people, from the Github repo's readme, are building more professional voices. And there are already versions of old TTS engines from the past that have been brought back to some semblence of life with this tech.

#accessibility #tts #ai #blind

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999

ErikJonker, 1 month ago to ai Dutch

Dangerous technology, they decided not to release it yet, but it will be a matter of time before it's available.
https://www.bloomberg.com/news/articles/2024-03-29/openai-previews-new-audio-tool-that-can-read-text-mimic-voices
#ai #openai #voice #tts

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ppatel, 1 month ago to Dragonlance

OpenAI debuts Voice Engine, which lets users generate synthetic copy of a voice from a 15-second sample, available to around 100 partners, including HeyGe. In other words, it's not available to the public just yet.

https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/

#audio #AI #voice #TTS

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaybird110127

chikim, 1 month ago to ai

Maybe we have an open source competitor for ElevenLabs? Check out their demo which they switch between original and synthesized. I can't tell. lol Apparently they're going to fully open source codebase and model weights. #TTS #AI #ML https://jasonppy.github.io/VoiceCraft_web/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ objectinspace, datajake1999, ppatel

KathyReid, 1 month ago to mastodon

A warm welcome to #Mastodon to @thorstenvoice - one of the best communicators about #ASR #TTS and #STT in the world. His #OpenSource #German #Deutsche dataset is in use in many places.

Please make Thorsten welcome 👋

#Introduction

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

thorstenvoice, 1 month ago

Thanks @potungthul for your nice welcome 😊.

To clear up the hashtags a little bit:
Think of the components of a voice assistant / smartspeaker.

You need #stt (speech-to-text) or #asr (automatic speech recognition) on the "input" side of a user request and #tts (text-to-speech) on the "output" side.

To throw in another technology - #nlp (natural language processing) is used in the "middle" to really understand what the user request is all about.

cc: @KathyReid

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ranfdev, 2 months ago to random

We need a dbus interface to get a system-wide Text To Speech provider, and Flatpak apps should be able to register themselves as TTS providers.

In GNOME settings there should be an option to disable the current TTS provider, open its settings or switch to another one. Similarly to how android manages multiple keyboards, which you can install from the play store.

The same goes for Speech To Text. You should be able to install your favorite STT provider, with your preferred voice, from the store

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ranfdev, 2 months ago

This requires integration from multiple GNOME components. I don't even know where to start, but I'd like to help.

#GNOME #tts #accessibility #a11y #AI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kaveinthran, 3 months ago to opensource

have anyone tried the new TTS metavoiceio/metavoice-1B-v0.1?
context, MetaVoice open sources a commercially permissive 1B base model for text-to-speech, supporting voice cloning and emotional speech synthesis
https://twitter.com/metavoiceio/status/1754983953193218193
#tts #mlai #openSource #ai #machineLearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jaybird110127, datajake1999

niavy, 3 months ago to android French

BON. J'ai besoin d'aide… #Android #ROM #crDroid

La ROM de mon smartphone n'a pas de système de synthèse vocale (#TTS). Geovelo m'invite à télécharger... le système #Google (Speech Recognition & Synthesis)

PAS ENVIE. Plus confiance.
Connaissez-vous un système TTS libre Android (pour LineageOS par exemple) qui supporte 🇨🇵🇬🇧 voire 🇪🇸🇵🇹 ?

Repouet TRÈS apprécié 🔄.

reply

expand (22)

collapse (22)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Phipe, MarSolRivas

kaveinthran, 3 months ago to history

Good, deep #thread on #tts that #blind People enjoy using, #History #eloquence #speech #ScreenReader #synthesizer #Disability @mastoblind @disabledvoices https://dragonscave.space/@MutedTrampet/109942593329052376

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ datajake1999

accessibleandroid, 3 months ago to android

We've just updated the list of languages with available TTS engines on Android, bringing the total number of supported languages to 83 with the new voices provided by the RHVoice and CerePlay TTS engines https://accessibleandroid.com/list-of-languages-with-available-tts-engines-on-android/ #Android #a11y #TTS

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ evilcookies98

jomo, 3 months ago to android German

Ich habe neulich gelernt, dass OsmAnd eine "German (Casual)" TTS Option hat, die nicht so ausführlich ist, wie die normale.

"Im Kreisel erste raus." oder "In 100 Metern links." ist wesentlich angenehmer, als komplett zugetextet zu werden.

#osmand #android #osm

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cantences, 3 months ago

@twomikecharlie
Sprichst du von einer vorinstallierten #TTS Engine?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

britt, 3 months ago to disability

Hey #Disability Mastodon…

What text to speech readers do you find to be the most comprehensive and most worth your money?

I’ve used the free version of Speechify for years but I don’t use it all the time due to limits in reading time/speed and the voices.

I would like to use a text to speech reader in browser, on iOS, for books and textbooks. #TTS
#Accessibility

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ GhostOnTheHalfShell, elduvelle, oblomov, trixter +1 more

modulux, 4 months ago to ai EN

Another interesting #TTS #AI system. I need to look closer into it in order to see if it's a voice cloning approach or something else: https://github.com/yl4579/StyleTTS2

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ alcinnz

tkk13909, 4 months ago to RickAndMorty

@JoeRess I was NOT prepared to hear Morty start talking in your voice in a Code Bullet video! Timestamp: 10:02
https://youtu.be/g39AagVW0s0

#RickAndMorty #TTS

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

KathyReid, 4 months ago to technology

Here's a #BookReview I wrote of Tobias Dengel and Karl Weber's "The Sound of the Future" - which claims that #voice #technology like #ASR, #TTS and #synthetic #speech are transformative, and that businesses should start to invest heavily in them.

While the book covers a lot of ground, it leaves many more critical questions unanswered in its unabashed techno-optimism.

https://blog.kathyreid.id.au/2024/01/04/book-review-the-sound-of-the-future-by-tobias-dengel-with-karl-weber/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cstross, 4 months ago to random

https://www.theverge.com/2023/12/27/24016791/astrohaus-freewrite-alpha-digital-typewriter-e-ink

I have no idea how such a shitty product can exist. Same price buys you a Chromebook; there are plenty of software libre distraction-free writing apps out there (try opening a terminal and typing "vim"?).

Or you could chicken out and buy a Kindle Fire Max 11 with keyboard case for the same price.

Both of these let you type for more than a day on a charge: the only benefit of the freewrite alpha is an 80 hour battery, which is pointless with USB-C charging everywhere.

reply

expand (37)

collapse (37)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kkarhan, andypiper

kkarhan, 4 months ago

@cstross some #TechIlliterate even recommended this shit to me instead if a #Laptop when I was in school.
I told them unless it comes with the same #TTS voice as #StevenHawking has I don't want them to ever be allowed to make any technical decision or suggestion in their life!
Those things are like #TexasInstruments #calculators: an absolute #ripoff given even the shittiest #Netbook with the abundant #Intel #Z3735F #SoC running #OS1337 is more versatile.
And I literally just started that distro.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cstross

chikorita157, 4 months ago to random

Wait, there was a new version of AIVoice, AIVoice 2 that I didn't know about, and it works on Mac now. Useful for creating audio for listening compression? Well…

https://amausaan.tokyo/aivoice-2-an-improved-japanese-tts-program-to-check-out/

#TTS #AIVoice #learningjapanese

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...