Nvidia shows AI model that can modify voices, generate novel sounds
Nvidia on Monday showed a new artificial intelligence model for generating music and audio that can modify voices and generate novel sounds - technology aimed at the producers of music, films and video games. Nvidia, the world's biggest supplier of chips and software used to create AI systems, said it does not have immediate plans to publicly release the technology, which it calls Fugatto, short for Foundational Generative Audio Transformer Opus 1. Santa Clara, California-based Nvidia's version generates sound effects and music from a text description, including novel sounds such as making a trumpet bark like a dog. "If we think about synthetic audio over the past 50 years, music sounds different now because of computers, because of synthesizers," said Bryan Catanzaro, vice president of applied deep learning research at Nvidia.
Discover Related

Nvidia's new AI tool can create sounds never heard before, could revolutionise music

Nvidia's new AI tool can create sounds never heard before, could revolutionise music

Meta introduces Movie Gen text-to-video-and-sound generator

Nations building their own AI models add to Nvidia's growing chip demand
