Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/658
Title: Nonintrusive Perceptual Audio Quality Assessment for User-Generated Content Using Deep Learning
Authors: Mumtaz D.
Jakhetiya V.
Nathwani K.
Subudhi B.N.
Guntuku S.C.
Keywords: Audio quality assessment
deep learning
gated recurrent unit (GRU)
non-intrusive quality metric
user-generated multimedia (UGM)
Issue Date: 2022
Publisher: IEEE Computer Society
Abstract: With the boom of social media communication, teleconferencing, and online classes, audiovisual communication over bandwidth strained networks has become an integral part of our lives. Consequently, the growing demand for the quality of experience necessitates developing algorithms to measure and enrich user experience. Prior studies have mainly focused on assessing speech quality and intelligibility with reference to audio quality assessment, while other categories in user-generated multimedia (UGM) are less explored. Moreover, frequency-domain properties of speech and UGM audio are significantly different from each other. Furthermore, there is a lack of a standard dataset for the quality assessment of UGM. Considering these limitations, in this article, we first develop the IIT-JMU-UGM audio dataset consisting of 1150 audio clips, with diverse context, content, and types of degradation commonly observed in real-world scenarios and annotated with the subjective quality scores. Finally, we propose a non-intrusive audio quality assessment metric using a stacked gated-recurrent-unit-based deep learning framework. The proposed model outperforms several baseline methods, including state-of-the-art non-intrusive and intrusive approaches. The resulting Pearson's correlation coefficient of 0.834 indicates that the proposed method efficiently mirrors human auditory perception. © 2005-2012 IEEE.
URI: https://dx.doi.org/10.1109/TII.2021.3139010
http://localhost:8080/xmlui/handle/123456789/658
ISSN: 1551-3203
Appears in Collections:Journal Article

Files in This Item:
There are no files associated with this item.
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.