Stability AI has introduced Stable Audio 3.0, which boasts better song structuring, higher sound quality, and longer musical compositions. This product showcases how generative AI is moving beyond text and visual arts to incorporate music production as well. The launch is timed to coincide with the UAE’s leading position in AI adoption.
The new family of audio models can generate songs up to six minutes long using licensed training data. According to TechCrunch, this extends the length of music that AI models can generate, with the company emphasizing its use of properly licensed content for training.
Six minutes represents a substantial increase over previous AI audio models that generated clips ranging from 30 seconds to 2 minutes. This duration allows for complete song structures with verses, choruses, and bridges rather than short loops.
According to Stability AI, the training process involved the use of licensed material. This means that Stable Audio 3.0 is now a more ethical choice for commercial purposes. Creators can now create full background scores for their videos, podcasts, or other commercial works without using multiple clips. Musicians will be able to create full demos and explore ideas without traditional recording equipment.
The licensed training data is expected to help creators use these tracks commercially without the legal uncertainty.
Also Read: How Generative AI and Emerging Technologies are Shaping the UAE’s Future
Stability AI has not yet announced pricing details or specific availability dates for Stable Audio 3.0. The company describes it as a 'family of audio models'. It suggests multiple tiers or specialized versions may be available.
Access methods and subscription options are still unclear, with more details expected in the coming weeks. Previous Stable Audio versions offered both free and paid tiers with different usage limits and capabilities. Stability AI's move reflects the industry's push towards longer, more sophisticated AI-generated content.