Skynet Report

Meta has launched Seamless, a new system for preserving expression and improving real-time translation using AI.

The system includes two new models. The first is SeamlessExpressive, which preserves expression in speech-to-speech translation, and the second is SeamlessStreaming, which delivers “state-of-the-art results with around two seconds of latency”.

The models are based on the latest version of the company’s foundational model, SeamlessM4T, and are designed to improve automatic speech recognition, speech-to-speech, speech-to-text and text-to-speech capabilities.

Alongside the models, Meta is releasing metadata, data and data alignment tools to help the research community to improve on the work.

Skynet Report

Seamless Communication by Meta