Stability AI releases a new audio model that can create 6-minute songs
Stability AI has released Stability Audio 3.0, a new family of audio models capable of generating professional-grade music. The top models can create compositions lasting over six minutes, a significant increase from previous versions. Stability AI emphasizes its use of fully licensed data and aims to support professional musicians with new tools.
Stability AI, the company renowned for Stable Diffusion, has introduced Stability Audio 3.0, a new family of audio models. These models are designed to generate professional-grade music, with the most advanced versions capable of producing compositions over six minutes long. This marks a substantial improvement, more than doubling the length achievable by its predecessor, Stable Audio 2.0.
The Stability Audio 3.0 family includes four models: small SFX, small, medium, and large. The small models are suited for on-device sound and music generation up to two minutes. The medium and large models can create full compositions of 6 minutes and 20 seconds, maintaining musical structure and melodic tone.
Stability AI is committed to open access for some of its models, with the small SFX, small, and medium models available with open weights for public use and modification. The large model, however, is accessible primarily through an API and paid self-hosting services, with enterprise licenses required for companies exceeding $1 million in revenue.
The company has strategically partnered with major music labels, including Warner Music Group and Universal Music Group, to ensure its models are built on fully licensed data. This approach addresses growing concerns in the industry regarding data licensing and intellectual property.
In a move to bolster its professional music offerings, Stability AI is developing a new suite of products for musicians. The company has also brought on Ethan Kaplan, former chief digital officer at Universal Audio and Fender, to lead these initiatives, further signaling its commitment to the professional music sector.
Related articles
The AI world is getting ‘loopy’
AI models are taking a significant leap forward with the adoption of "agentic loops," where AI agents continuously prompt each other to improve code and solve complex problems. This approach, though potentially resource-intensive, promises to unlock new levels of autonomous problem-solving and efficiency in AI applications.
Codex-maxxing for long-running work
Codex is increasingly being used by organizations to support long-running projects that go beyond a single prompt. This whitepaper by Jason Liu offers practical strategies for leveraging Codex as a persistent workspace, managing complex workflows and sustaining progress.
Nobel laureate John Jumper is leaving DeepMind for rival Anthropic
Nobel laureate John Jumper is departing Google DeepMind to join its competitor, Anthropic, after dedicating nearly nine years to DeepMind, where he led the AlphaFold team. Jumper, who shared a Nobel Prize for his work on AlphaFold, expressed gratitude for his time at DeepMind while looking forward to new endeavors.
