SeamlessM4T: Multimodal Model for Speech Translation
Meta releases SeamlessM4T, a general multilingual speech/text model claimed to surpass OpenAI’s Whisper. It’s available on github and everything can be used for free in a non-commercial setting....