Meta Releases New AI Model for Speech and Text Translation

UTC by Darya Rudz · 3 min read
Meta Releases New AI Model for Speech and Text Translation
Photo: Unsplash

Meta expects SeamlessM4T to be much more effective than its competitors since its single approach reduces errors and delays, increasing the efficiency and quality of the translation process.

Meta Platforms Inc (NASDAQ: META) has announced the launch of a new AI-powered model that allows users to communicate effortlessly through speech and text across different languages. Called SeamlessM4T, the multilingual model supports about 100 languages. It can perform speech-to-text, speech-to-speech, text-to-speech, and text-to-text translations.

SeamlessM4T has been released under a research license to allow researchers and developers to build on this work. Along with SeamlessM4T, Meta has also introduced the metadata of SeamlessAlign, the biggest open multimodal translation dataset to date that totals 270,000 hours of mined speech and text alignments.

Paco Guzmán, a research scientist manager at Meta, commented:

“It even supports changing languages on the fly. This allows [it] to better support populations who naturally mix languages in their day-to-day speech.”

Meta expects SeamlessM4T to be much more effective than its competitors since its single approach reduces errors and delays, increasing the efficiency and quality of the translation process. The company has developed the model based on the finding from its previous similar projects.

Last year, Meta rolled out No Language Left Behind (NLLB), a first-of-its-kind AI model that delivers translations directly between 200 languages. Notably, the model is free of charge, it gives users the opportunity to access web content in their native language and communicate with anyone, anywhere.

Another project developed by Meta was Universal Speech Translator which targeted Hokkien – a widely spoken language within the Chinese diaspora that does not have a standard written form. Out of the more than 7,000 living languages, more than 40% of languages are primarily oral and do not have a standard or widely known writing system. Therefore, Meta came up with a solution that can extend beyond Hokkien to other unwritten languages.

Earlier this year, Meta announced the Massively Multilingual Speech model that supports automatic speech recognition (ASR) and text-to-speech synthesis (TTS) in over 1,100 languages as well as language identification (LID) in over 4,000 languages.

AI-Driven Translation Industry

As a result of the AI boom, the translation industry has been revolutionized, with language barriers being eliminated and translation becoming easier. By utilizing neural networks, AI translation tools can detect patterns in data and accurately translate text from one language to another. AI provides extremely fast, high-volume translations that are increasingly accurate and precise, which allows humans speaking different languages to communicate effectively.

Among the most widely used AI-powered translation tools are Google Translate, Microsoft Translator, DeepL, Systran, Trados Enterprise, Sonix, Alexa Translations, and more.

As AI technology continues to evolve, we can expect to see significant advancements in the field of translation. In the future, AI translation tools are expected to become more personalized, allowing users to customize their translations to meet their needs. The global machine translation market size is expected to reach almost $4.1 billion in 2030 from $812.6 million in 2021.

Artificial Intelligence, News, Technology News
Related Articles