Optimal Speech Intelligibility Improvement for Varying Car Noise Characteristics

Biswas R.; Nathwani K.; Hafiz F.; Swain A.

Full metadata record

DC Field	Value	Language
dc.contributor.author	Biswas R.	en_US
dc.contributor.author	Nathwani K.	en_US
dc.contributor.author	Hafiz F.	en_US
dc.contributor.author	Swain A.	en_US
dc.date.accessioned	2023-11-30T08:33:16Z	-
dc.date.available	2023-11-30T08:33:16Z	-
dc.date.issued	2022	-
dc.identifier.issn	1939-8018	-
dc.identifier.other	EID(2-s2.0-85139705570)	-
dc.identifier.uri	https://dx.doi.org/10.1007/s11265-022-01815-x	-
dc.identifier.uri	http://localhost:8080/xmlui/handle/123456789/434	-
dc.description.abstract	The present study proposes a novel method for speech intelligibility improvement by optimally shifting the formants using a trapezoidal voice transformation function. The shaping parameters of this function are determined by maximizing various performance measures using a comprehensive learning particle swarm optimization (CLPSO) algorithm. These measures include the short time objective intelligibility (STOI), perceptual evaluation of speech quality (PESQ) and signal to distortion ration (SDR). The proposed method does not requires a priori knowledge about the noise statistics in designing the voice transformation function. Although, the shaping parameters are obtained at specific SNRs, a Gaussian process (GP) regression model is trained to compute these parameters for arbitrary SNRs. The performance of the proposed method is demonstrated on various databases which include Hearing In Noise Test (HINT) a French database, NOIZEUS (ENGLISH) and CHAINS (ENGLISH) databases at different levels of engine noises arising from a running car at various speeds. The results of the investigation convincingly demonstrate that the proposed approach could improve the speech intelligibility, while preserving the quality. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.	en_US
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.source	Journal of Signal Processing Systems	en_US
dc.subject	Gaussian process regression	en_US
dc.subject	Near-end	en_US
dc.subject	PESQ	en_US
dc.subject	SDR	en_US
dc.subject	Speech intelligibility	en_US
dc.subject	STOI	en_US
dc.title	Optimal Speech Intelligibility Improvement for Varying Car Noise Characteristics	en_US
dc.type	Journal Article	en_US
Appears in Collections:	Journal Article