Tech

Stability AI releases a sound generator

Amjad Ali June 5, 2024

0 3 2 minutes read

Stability AI, the startup behind the AI-powered art generator Stable Diffusion, has released an open AI model for generating sounds and songs that it claims was trained exclusively on royalty-free recordings.

Called Stable Audio Open, the generative model takes a text description (e.g. “Rock beat played in a treated studio, session drumming on an acoustic kit”) and outputs a recording up to 47 seconds in length. The model was trained using around 486,000 samples from free music libraries FreeSound and the Free Music Archive.

Stability AI says that the model can be used to create drum beats, instrument riffs, ambient noises and “production elements” for videos, films and TV shows as well as to “edit” existing songs or apply the style of one song (e.g. smooth jazz) to another.

“A key benefit of this open source release is that users can fine-tune the model on their own custom audio data,” Stability AI wrote in a post on its corporate blog. “For example, a drummer could fine-tune on samples of their own drum recordings to generate new beats.”

Stable Audio Open has its limitations, however. It can’t produce full songs, melodies or vocals — at least not good ones. Stability AI says that it’s not optimized for this, and suggests that users looking for those capabilities opt for the company’s premium Stable Audio service.

Stable Audio Open also can’t be used commercially; its terms of service prohibit it. And it doesn’t perform equally well across musical styles and cultures or with description in languages other than English — biases Stability AI blames on the training data.

“The source of data is potentially lacking diversity and all cultures are not equally represented in the data set,” Stability AI writes in a description of the model. “The generated samples from the model will reflect the biases from the training data.”

Stability AI — which has long struggled to turn its flagging business around — became the subject of controversy recently after its VP of generative audio, Ed Newton-Rex, resigned over disagreement with the company’s stance that training generative AI models on copyrighted works constitutes “fair use.” Stable Audio Open would appear to be an attempt to turn that narrative around, while at the same time not-so-subtly advertising Stability AI’s paid products.

Source link

Amjad Ali June 5, 2024

0 3 2 minutes read

Foreperson (CDL Required) – ATE

Fleet Preventative Maintenance Technician

Groundperson (Driver License Required) – ATE

Bucket Operator (CDL Required) – ATE

Location Maintenance & Repair Technician – Minneapolis/St. Paul, MN area – Temporary Part Time

Groundperson (Driver License Required) – ATE

Account Executive, Insurance

Car Wash Attendant

Foreperson – Union – ATE

Bucket Operator (CDL Required) – ATE

Stability AI releases a sound generator

Amjad Ali

Leave a Reply Cancel reply

Entry Level – Work At Home (100% Remote) FocusGroup Panelist

Patient Care Technician – PCT CCHT – Dialysis

Customer Success Manager – SAP Academy for Customer Success – Pittsburgh (Hybrid)

Warehouse Associate – 3rd Shift with Differentials

Warehouse Associate Mid Shift

Spaghett – A Beautiful Mess

Rosé Spritz | Julie Blanner

7 of My Best Tips for Hosting a Dinner Party Everyone Will Enjoy | Wit & Delight

23 Apple Dessert Recipes Perfect For Fall

How to Host a Casual Dinner Party and 5 Tips for Easy Entertaining | Wit & Delight

How to Grow and Care for a Money Tree

With Product You Purchase

Subscribe to our mailing list to get the new updates!

Sales Manager - World Pakistan

Experienced Product Manager, User Rights- TikTok Shop - Seattle

Related Articles