Skynet Report

Stability AI has introduced Stable Audio Open, an open-source text-to-audio model that generates up to 47 seconds of audio samples, sound effects, and production elements.

The model enables users to create drum beats, instrument riffs, ambient sounds, and foley recordings using text prompts. It also allows for audio variations and style transfer of audio samples.

Stable Audio Open is a more specialised model compared to Stability AI’s commercial product, which can produce full tracks up to three minutes long.

This open-source model is trained on audio data from Freesound and the Free Music Archive, respecting creator rights.

The model weights are available on Hugging Face, and users can download and explore its capabilities.