chikim, to random
@chikim@mastodon.social avatar

I created samples for all 58 voices for xtts-v2. Hopefully it makes it easier for someone to choose a speaker. https://we.tl/t-9vWd1gO3EN

VE3RWJ, to AdobePhotoshop
@VE3RWJ@mastodon.radio avatar

#ElevenLabs does not disappoint.

They've just released an #app that will read whatever document you throw at it.

You can find out more Here:

https://apps.apple.com/ca/app/elevenlabs-reader-ai-audio/id6479373050
#AI #A11Y #TTS #blind #dyslexia

jbzfn, to c64
@jbzfn@mastodon.social avatar

Making 80s Computers Talk | 1980s Commodore Speech Synthesizer | Kari

https://www.youtube.com/watch?v=1ip7K0CaC7Y

jedie, to fdroid German
@jedie@chaos.social avatar

Gibt es eine bessere Lösung für #TTS mit #calyxos #lineageos und co?

https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

Listet #RHVoice per #fdroid auf,kann aber noch kein deutsch.

#eSpeak findet das berruntergeladene #mbrola model Datei nicht.

jedie, to android German
@jedie@chaos.social avatar

Gibt es was neues zu : (-Engine) bei Custom-ROMs aktivieren • Kuketz IT-Security Blog
https://www.kuketz-blog.de/android-sprachausgabe-tts-engine-bei-custom-roms-aktivieren/

bekomme ich nicht zum laufen, findet das runtergeladene Modell Datei nicht.

hat immer noch kein deutsch.

eeejay, to GNOME
@eeejay@mastodon.social avatar

A demo of a sample app, the voices used in order are: eSpeakNG's "Andy" variant, MBROLA US2, and Piper's Amy. You can observe the different features like word tracking and quality.

video/mp4

skinnylatte, to accessibility
@skinnylatte@hachyderm.io avatar

Opening soon: 2 remote jobs for accessibility lead with the federal government. Open to US citizens and nationals only, with background check.

https://join.tts.gsa.gov/join/Solutions-Accessibility-Lead-April2024/

Info on how to write a federal resume: https://handbook.tts.gsa.gov/hiring-staying-or-changing-jobs/resume

pixelate, to accessibility
@pixelate@tweesecake.social avatar

Honestly, since the fast variants of the voices are a thing, I think I could really switch to the Sonata Neural Voices in NVDA full time. Now remember folks, these are AI voices. Scary, untrustworthy, AI voices that will smear your reputation all over fedi for using these voices! See, they even react to exclamation marks! Isn't that scary? :) Nah, the worst that'll happen, mainly with the HFC male and female, is that big numbers are garbled together. But every other voice does fine. I use Amy for work, and HFC for reading because those are among the most lively voices I've ever heard. And amazingly enough, we can make our own new voices. So, some people, from the Github repo's readme, are building more professional voices. And there are already versions of old TTS engines from the past that have been brought back to some semblence of life with this tech.

ErikJonker, to ai Dutch
@ErikJonker@mastodon.social avatar

Dangerous technology, they decided not to release it yet, but it will be a matter of time before it's available.
https://www.bloomberg.com/news/articles/2024-03-29/openai-previews-new-audio-tool-that-can-read-text-mimic-voices

ppatel, to Dragonlance
@ppatel@mstdn.social avatar

OpenAI debuts Voice Engine, which lets users generate synthetic copy of a voice from a 15-second sample, available to around 100 partners, including HeyGe. In other words, it's not available to the public just yet.

https://techcrunch.com/2024/03/29/openai-custom-voice-engine-preview/

chikim, to ai
@chikim@mastodon.social avatar

Maybe we have an open source competitor for ElevenLabs? Check out their demo which they switch between original and synthesized. I can't tell. lol Apparently they're going to fully open source codebase and model weights. https://jasonppy.github.io/VoiceCraft_web/

KathyReid, to mastodon
@KathyReid@aus.social avatar

A warm welcome to #Mastodon to @thorstenvoice - one of the best communicators about #ASR #TTS and #STT in the world. His #OpenSource #German #Deutsche dataset is in use in many places.

Please make Thorsten welcome 👋

#Introduction

thorstenvoice,
@thorstenvoice@techhub.social avatar

Thanks @potungthul for your nice welcome 😊.

To clear up the hashtags a little bit:
Think of the components of a voice assistant / smartspeaker.

You need #stt (speech-to-text) or #asr (automatic speech recognition) on the "input" side of a user request and #tts (text-to-speech) on the "output" side.

To throw in another technology - #nlp (natural language processing) is used in the "middle" to really understand what the user request is all about.

cc: @KathyReid

ranfdev, to random
@ranfdev@linuxrocks.online avatar

We need a dbus interface to get a system-wide Text To Speech provider, and Flatpak apps should be able to register themselves as TTS providers.

In GNOME settings there should be an option to disable the current TTS provider, open its settings or switch to another one. Similarly to how android manages multiple keyboards, which you can install from the play store.

The same goes for Speech To Text. You should be able to install your favorite STT provider, with your preferred voice, from the store

ranfdev,
@ranfdev@linuxrocks.online avatar

This requires integration from multiple GNOME components. I don't even know where to start, but I'd like to help.

kaveinthran, to opensource

have anyone tried the new TTS metavoiceio/metavoice-1B-v0.1?
context, MetaVoice open sources a commercially permissive 1B base model for text-to-speech, supporting voice cloning and emotional speech synthesis
https://twitter.com/metavoiceio/status/1754983953193218193

niavy, to android French
@niavy@masto.bike avatar

BON. J'ai besoin d'aide…

La ROM de mon smartphone n'a pas de système de synthèse vocale (). Geovelo m'invite à télécharger... le système (Speech Recognition & Synthesis)

PAS ENVIE. Plus confiance.
Connaissez-vous un système TTS libre Android (pour LineageOS par exemple) qui supporte 🇨🇵🇬🇧 voire 🇪🇸🇵🇹 ?

Repouet TRÈS apprécié 🔄.

kaveinthran, to history
accessibleandroid, to android
@accessibleandroid@mastodon.social avatar

We've just updated the list of languages with available TTS engines on Android, bringing the total number of supported languages to 83 with the new voices provided by the RHVoice and CerePlay TTS engines https://accessibleandroid.com/list-of-languages-with-available-tts-engines-on-android/

jomo, to android German
@jomo@mstdn.io avatar

Ich habe neulich gelernt, dass OsmAnd eine "German (Casual)" TTS Option hat, die nicht so ausführlich ist, wie die normale.

"Im Kreisel erste raus." oder "In 100 Metern links." ist wesentlich angenehmer, als komplett zugetextet zu werden.

cantences,

@twomikecharlie
Sprichst du von einer vorinstallierten Engine?

britt, to disability
@britt@mstdn.games avatar

Hey Mastodon…

What text to speech readers do you find to be the most comprehensive and most worth your money?

I’ve used the free version of Speechify for years but I don’t use it all the time due to limits in reading time/speed and the voices.

I would like to use a text to speech reader in browser, on iOS, for books and textbooks.

modulux, to ai EN

Another interesting system. I need to look closer into it in order to see if it's a voice cloning approach or something else: https://github.com/yl4579/StyleTTS2

tkk13909, to RickAndMorty
@tkk13909@fosstodon.org avatar

@JoeRess I was NOT prepared to hear Morty start talking in your voice in a Code Bullet video! Timestamp: 10:02
https://youtu.be/g39AagVW0s0

KathyReid, to technology
@KathyReid@aus.social avatar

Here's a I wrote of Tobias Dengel and Karl Weber's "The Sound of the Future" - which claims that like , and are transformative, and that businesses should start to invest heavily in them.

While the book covers a lot of ground, it leaves many more critical questions unanswered in its unabashed techno-optimism.

https://blog.kathyreid.id.au/2024/01/04/book-review-the-sound-of-the-future-by-tobias-dengel-with-karl-weber/

cstross, to random
@cstross@wandering.shop avatar

https://www.theverge.com/2023/12/27/24016791/astrohaus-freewrite-alpha-digital-typewriter-e-ink

I have no idea how such a shitty product can exist. Same price buys you a Chromebook; there are plenty of software libre distraction-free writing apps out there (try opening a terminal and typing "vim"?).

Or you could chicken out and buy a Kindle Fire Max 11 with keyboard case for the same price.

Both of these let you type for more than a day on a charge: the only benefit of the freewrite alpha is an 80 hour battery, which is pointless with USB-C charging everywhere.

kkarhan,
@kkarhan@mstdn.social avatar

@cstross some even recommended this shit to me instead if a when I was in school.
I told them unless it comes with the same voice as has I don't want them to ever be allowed to make any technical decision or suggestion in their life!
Those things are like : an absolute given even the shittiest with the abundant running is more versatile.
And I literally just started that distro.

chikorita157, to random
@chikorita157@sakurajima.moe avatar

Wait, there was a new version of AIVoice, AIVoice 2 that I didn't know about, and it works on Mac now. Useful for creating audio for listening compression? Well…

https://amausaan.tokyo/aivoice-2-an-improved-japanese-tts-program-to-check-out/

#TTS #AIVoice #learningjapanese

  • All
  • Subscribed
  • Moderated
  • Favorites
  • provamag3
  • GTA5RPClips
  • rosin
  • Youngstown
  • everett
  • khanakhh
  • slotface
  • InstantRegret
  • Durango
  • ngwrru68w68
  • kavyap
  • modclub
  • DreamBathrooms
  • mdbf
  • JUstTest
  • magazineikmin
  • thenastyranch
  • cubers
  • cisconetworking
  • osvaldo12
  • ethstaker
  • normalnudes
  • Leos
  • tester
  • megavids
  • tacticalgear
  • anitta
  • lostlight
  • All magazines