论文标题

用甘斯感知音乐质量

Perceiving Music Quality with GANs

论文作者

Hilmkil, Agrin, Thomé, Carl, Arpteg, Anders

论文摘要

已经开发了几种方法来评估在变换下的音频感知质量,例如有损压缩。但是,他们需要未更改内容的配对参考信号,这限制了它们在不可用的应用程序中使用。这阻碍了音频产生和样式转移的进展,在这种情况下,一种无参考质量评估方法将允许在方法之间进行更多可重现的比较。我们建议在大型音乐库上培训gan,并将其歧视器作为对音乐质量的无参考质量评估度量。这种方法是无监督的,不需要访问降级的材料,并且可以为音乐的各个领域调整。在与448名人类受试者的听力测试中,参与者评级为专业产生的音乐曲目,并以不同级别和类型的信号降解(例如浪潮造成失真和低通滤波)进行降解,我们建立了人类评级材料的数据集。通过使用人类评分的数据集,我们表明歧视者得分与主观评分显着相关,这表明该方法可用于创建不引用的音乐音频质量评估措施。

Several methods have been developed to assess the perceptual quality of audio under transforms like lossy compression. However, they require paired reference signals of the unaltered content, limiting their use in applications where references are unavailable. This has hindered progress in audio generation and style transfer, where a no-reference quality assessment method would allow more reproducible comparisons across methods. We propose training a GAN on a large music library, and using its discriminator as a no-reference quality assessment measure of the perceived quality of music. This method is unsupervised, needs no access to degraded material and can be tuned for various domains of music. In a listening test with 448 human subjects, where participants rated professionally produced music tracks degraded with different levels and types of signal degradations such as waveshaping distortion and low-pass filtering, we establish a dataset of human rated material. By using the human rated dataset we show that the discriminator score correlates significantly with the subjective ratings, suggesting that the proposed method can be used to create a no-reference musical audio quality assessment measure.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源