Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/434
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBiswas R.en_US
dc.contributor.authorNathwani K.en_US
dc.contributor.authorHafiz F.en_US
dc.contributor.authorSwain A.en_US
dc.date.accessioned2023-11-30T08:33:16Z-
dc.date.available2023-11-30T08:33:16Z-
dc.date.issued2022-
dc.identifier.issn1939-8018-
dc.identifier.otherEID(2-s2.0-85139705570)-
dc.identifier.urihttps://dx.doi.org/10.1007/s11265-022-01815-x-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/434-
dc.description.abstractThe present study proposes a novel method for speech intelligibility improvement by optimally shifting the formants using a trapezoidal voice transformation function. The shaping parameters of this function are determined by maximizing various performance measures using a comprehensive learning particle swarm optimization (CLPSO) algorithm. These measures include the short time objective intelligibility (STOI), perceptual evaluation of speech quality (PESQ) and signal to distortion ration (SDR). The proposed method does not requires a priori knowledge about the noise statistics in designing the voice transformation function. Although, the shaping parameters are obtained at specific SNRs, a Gaussian process (GP) regression model is trained to compute these parameters for arbitrary SNRs. The performance of the proposed method is demonstrated on various databases which include Hearing In Noise Test (HINT) a French database, NOIZEUS (ENGLISH) and CHAINS (ENGLISH) databases at different levels of engine noises arising from a running car at various speeds. The results of the investigation convincingly demonstrate that the proposed approach could improve the speech intelligibility, while preserving the quality. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.en_US
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.sourceJournal of Signal Processing Systemsen_US
dc.subjectGaussian process regressionen_US
dc.subjectNear-enden_US
dc.subjectPESQen_US
dc.subjectSDRen_US
dc.subjectSpeech intelligibilityen_US
dc.subjectSTOIen_US
dc.titleOptimal Speech Intelligibility Improvement for Varying Car Noise Characteristicsen_US
dc.typeJournal Articleen_US
Appears in Collections:Journal Article

Files in This Item:
There are no files associated with this item.
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.