ElevenLabs, the AI-powered platform for creating synthetic voices, has officially launched its platform out of beta with support for over 30 languages. Using a new AI model developed in-house, ElevenLabs’ tools can now automatically identify languages such as Korean, Dutch, and Vietnamese, and generate “emotionally rich” speech in those languages. Customers can leverage the platform’s voice-cloning tool to speak across almost 30 languages without the need to type text.
ElevenLabs CEO and co-founder, Mati Staniszewski, stated that the company’s goal is to make all content universally accessible in any language and voice. The release of this new model brings them one step closer to achieving that dream. Staniszewski believes that their text-to-speech generation tools help level the playing field and bring top-quality spoken audio capabilities to all creators.
ElevenLabs, founded by Staniszewski and his childhood friend Piotr Dabkowski, gained attention in recent months for both positive and negative reasons. Inspired by the subpar dubbing of American movies they experienced growing up in Poland, the duo set out to design a platform that could do better, utilizing AI technology. The platform gained popularity quickly due to the high quality of its generated voices and generous free tier. However, bad actors exploited the platform, using it to share hateful messages mimicking celebrities like Emma Watson.
In response to these incidents, ElevenLabs plans to introduce new safeguards, such as limiting voice cloning to paid accounts and implementing an AI detection tool. However, the platform still faces controversy regarding its potential threat to the voice acting industry. Voice actors are increasingly being asked to sign away rights to their voices, allowing clients to use AI to generate synthetic versions that could potentially replace them. Activision Blizzard, one of the largest game publishers globally, is reportedly working on AI-assisted “voice cloning” tools.
Despite these concerns, ElevenLabs sees its work as the natural progression of the industry. The company has collaborated with publishers like Storytel, media platforms like TheSoul Publishing and MNTN, and game publishers like Embark Studios and Paradox Interactive. ElevenLabs claims to have over a million registered users across the creative, entertainment, and publishing spaces, who have created 10 years’ worth of audio content.
Having recently raised $19 million from investors, including Andreessen Horowitz and DeepMind co-founder Mustafa Suleyman, ElevenLabs plans to extend its AI models to voice dubbing. The company aims to build a foundation that can transfer emotions and intonation from one language to another. Additionally, ElevenLabs intends to introduce a mechanism for users to share voices on the platform, although specific details have not been disclosed.
How does ElevenLabs address the potential misuse of their AI-powered voice-cloning technology and ensure responsible use
Negative reasons. On the positive side, the platform’s AI-powered voice-cloning technology has impressed many with its ability to generate synthetic voices that are indistinguishable from real human voices. This has led to exciting possibilities in various industries, including entertainment, voiceover, and language learning.
However, the technology has also raised concerns about potential misuse and ethical implications. The ability to generate realistic voices has sparked worries about the spread of misinformation, deepfakes, and impersonation. These concerns highlight the need for responsible use of such technology and the development of appropriate safeguards.
Despite the controversy, ElevenLabs continues to focus on its mission of making content accessible in any language and voice. The launch of its platform out of beta with support for over 30 languages demonstrates the company’s commitment to expanding its reach and creating tools that empower creators.
With the advancements made in their new AI model, ElevenLabs can now automatically identify and generate speech in languages like Korean, Dutch, and Vietnamese. This expands the platform’s capabilities and allows customers to communicate in a more personalized and emotionally rich manner.
The voice-cloning tool provided by ElevenLabs is particularly noteworthy, as it allows users to speak across almost 30 languages without the need to type text. This not only saves time and effort but also enables individuals to communicate in their preferred language, regardless of their typing skills or limitations.
Mati Staniszewski, CEO and co-founder of ElevenLabs, expressed his excitement about the platform’s potential to level the playing field in terms of spoken audio capabilities. By providing top-quality synthetic voices to all creators, regardless of location or language proficiency, ElevenLabs is democratizing the voiceover industry and opening up new opportunities for content creators around the world.
ElevenLabs’ launch of its new AI model marks a significant milestone in its journey towards universal accessibility. While the company acknowledges the challenges associated with its technology, it remains committed to refining its tools and ensuring responsible use. As the platform continues to evolve, it will be interesting to see how ElevenLabs addresses the ethical concerns surrounding synthetic voices and contributes to the advancement of voice-related technologies.