Chatterbox Turbo, a new open-source text-to-speech model from Resemble AI, clones voices from five seconds of audio and generates speech in under 150 milliseconds. The startup claims it outperforms Elevenlabs.<br /> The article Resemble AI drops Chatterbox Turbo, an open-source text-to-speech model that clones voices in five seconds appeared first on The Decoder. [...]
Chinese AI startup Z.ai, known for its powerful, open source GLM family of large language models (LLMs), has introduced GLM-5-Turbo, a new, proprietary variant of its open source GLM-5 model aimed at [...]
Hot on the heels of its new $140 million Series D fundraising round, the multi-modal enterprise AI media creation platform fal.ai, known simply as "fal" or "Fal" is back with a yea [...]
The enterprise voice AI market is in the middle of a land grab. ElevenLabs and IBM announced a collaboration just this week to bring premium voice capabilities into IBM's watsonx Orchestrate plat [...]
Meta has just released a new multilingual automatic speech recognition (ASR) system supporting 1,600+ languages — dwarfing OpenAI’s open source Whisper model, which supports just 99. Is architectu [...]
Is China picking back up the open source AI baton? Z.ai, also known as Zhupai AI, a Chinese AI startup best known for its powerful, open source GLM family of models, has unveiled GLM-5.1 today under a [...]
Voice AI is moving faster than the tools we use to measure it. Every major AI lab — OpenAI, Google DeepMind, Anthropic, xAI — is racing to ship voice models capable of natural, real-time conversat [...]