Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/232
Title: Speech intelligibility enhancement using an optimal formant shifting approach
Authors: Nathwani K.
Hafiz F.
Swain A.
Biswas R.
Keywords: CLPSO
HINT
Intelligibility Enhancement
Optimal Formant Shifting
STOI
Issue Date: 2021
Publisher: IEEE Computer Society
Abstract: The present study proposes a novel delta function-based optimal shift in formants for enhancing the near-end speech intelligibility. The delta function being used here is trapezoidal in shape. The shaping parameters of this delta function are determined using comprehensive learning particle swarm optimization (CLPSO) which maximizes the short time objective intelligibility (STOI) of speech sequences. The proposed method does not require the knowledge of noise statistics in designing the delta function. Further, the proposed method does not require post-processing in terms of the computation of smoothing of the shifted formants. The performance of the proposed method is illustrated using speech signals from the Hearing In Noise Test (HINT) French database by including the engine noise from a car running at 130 km/h. The results of the investigation, at various SNRs, convincingly demonstrate that the optimal delta function (function with the optimized parameters) could significantly improve the speech intelligibility at very low SNRs while preserving the quality and naturalness of the sound. © 2021 IEEE.
URI: https://dx.doi.org/10.1109/ISPA52656.2021.9552080
http://localhost:8080/xmlui/handle/123456789/232
ISBN: 978-1665426398
ISSN: 1845-5921
Appears in Collections:Conference Paper

Files in This Item:
There are no files associated with this item.
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.