Nvidia brings a new AI model to 'revolutionize' the audio industry: capable of creating music and modifying vocals
芊芊551
发表于 2024-12-8 18:59:55
204
0
0
According to reports, Nvidia has developed a new type of artificial intelligence (AI) model that can create sound effects, change people's pronunciation, and generate music using natural language prompts.
This model is named Fugatto, which stands for Founding Generative Audio Transformer Opus 1, and is a research project. Nvidia stated that it will not announce any plans to release this technology, but it may have a wide-ranging impact on industries ranging from music, entertainment to translation services.
Bryan Catanzaro, Vice President of Applied Deep Learning Research at NVIDIA, said in an interview, "The most exciting thing about Fugatto is that it has a model that you can ask it to make sound in some way, which really opens up your imagination of its application scope
He further explained that other models on the market, some can synthesize speech, some can add sound effects to music, but Fugatto can do all of them. Catanzaro said that it can be seen as a supplement to video and image generation models such as Stability AI's Stable Video Diffusion or OpenAI's Sora.
The most fundamental improvement here is... we are able to use language to synthesize audio, which I believe opens up new prospects for tools that people can use to create amazing audio, "he added.
According to Nvidia, Fugatto is the first basic model with emerging features, which means it can mix trained elements and follow "free-form instructions".
Specifically, the model can generate audio through standard text prompts and also handle the audio files you upload. So, if you have a document of someone speaking, you can translate that person's words into another language while making it sound like their voice. You can also choose a simple tune to make it sound like an orchestral performance, or add different beats to the music.
In addition, you can also upload a document for the model to read aloud in any voice you like. More importantly, you can instruct the model to produce sounds with emotional components.
However, Catanzaro also added that this model is not always perfect. Moreover, just like models that generate images and videos, Fugatto also raises concerns among artists, sound engineers, and professionals in related fields. But Catanzaro pointed out that his original intention was to hope that this technology could help musicians.
I hope this is a new tool for artists to explore. "" I think audio has always been a productive field of exploration. You know, when we acquire new audio tools, sometimes we acquire new forms of music, "he said.
LogoMoney.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表LogoMoney.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Thai Prime Minister meets with Nvidia CEO to strengthen cooperation in artificial intelligence
- Nvidia launches ExBody2 system to enhance bipedal robot balance and adaptability
- Elon Musk's AI becomes Silicon Valley darling, $6 billion financing luxury lineup revealed, "old friends" such as Nvidia, AMD added
- Attraction crushing wide base index! Retail investors net purchase $29.8 billion worth of Nvidia stocks in 2024
- Nvidia New Product Countdown: New 'Nuclear Bomb' RTX 5090 Coming Soon, B300 Coming Soon
- Over 210 billion yuan in explosive purchases! Retail investors' fierce pursuit 'of Nvidia investment bank, optimistic about next year's performance
- NVIDIA's new 'nuclear bomb' leaked!
- NVIDIA's latest statement! Robot 'ChatGPT Moment' is Coming, Bet on the Next Growth Driver
- Nvidia may launch robot 'brain' in the first half of next year, with the company's stock price increasing by over 176% since the beginning of this year
- Nvidia plans to release a new generation of humanoid robot computing platform in the first half of next year, supporting multimodal AI models
-
工信部党组书记李乐成会见德国汽车工业协会主席希尔德加德·穆勒 4月27日,工业和信息化部党组书记李乐成在北京会见德国汽车工业协会主席希尔德加德·穆勒,双方就深化中德汽车产业合作进行了交流。李乐成表 ...
- moonlightplay
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
美国总统特朗普近日在接受媒体采访时表示,他第二个任期不仅治理美国,也治理全世界。 特朗普于4月24日接受了《大西洋》(The Atlantic)月刊采访,这段专访于4月28日发布。 “第一次当总统时,我要做两 ...
- lfancn
- 昨天 12:10
- 支持
- 反对
- 回复
- 收藏
-
4月29日凌晨,阿里巴巴开源新一代通义千问模型Qwen3(千问3),参数量为DeepSeek-R1的三分之一,成本大幅下降。据称,该模型性能全面超越R1、OpenAI-o1等领先模型,登顶全球最强开源模型。 千问3是国内首个“ ...
- 风雨中行走
- 前天 10:32
- 支持
- 反对
- 回复
- 收藏
-
东风有限回应武汉工厂关停事宜 据第一财经,4月29日,东风汽车有限公司证实,该公司武汉工厂目前正常运行,后续也不会关停。东风有限称,该公司将在东风与日产母公司的支持下平稳有序发展,持续加速向新能源 ...
- king19831101
- 昨天 09:56
- 支持
- 反对
- 回复
- 收藏