Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/433
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBiswas R.en_US
dc.contributor.authorNathwani K.en_US
dc.date.accessioned2023-11-30T08:33:16Z-
dc.date.available2023-11-30T08:33:16Z-
dc.date.issued2022-
dc.identifier.issn0278081X-
dc.identifier.otherEID(2-s2.0-85134676071)-
dc.identifier.urihttps://dx.doi.org/10.1007/s00034-022-02106-3-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/433-
dc.description.abstractThe proposed work attempts to improve the near-end intelligibility of speech at very low signal-to-noise ratios (SNRs). Additionally, the prerequisite of noise statistics that existing intelligibility improvement methods require is not a limitation of the proposed approach. To this end, the shaping parameters of the voice transformation function (VTF) are optimized. This optimization of the shaping parameters of the VTF corresponds to the combined modification that includes formant shifting, nonuniform time scaling, smoothing, and energy re-distributions in comprehensive learning particle swarm optimization (CLPSO) framework. The optimal parameters of the combined modifications are obtained by jointly maximizing the short time objective intelligibility, perceptual evaluation of speech quality and signal-to-distortion ratio metrics being used as the cost function in CLPSO. The outcome at the end is an improvement in intelligibility that is significantly higher than the ones obtained by applying these methods individually, while preserving the quality. As a side result, a Gaussian process regression is also employed to estimate the shaping parameters of VTF at arbitrary SNRs—other than the ones which were used during CLPSO training. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.en_US
dc.language.isoenen_US
dc.publisherBirkhauseren_US
dc.sourceCircuits, Systems, and Signal Processingen_US
dc.subjectCLPSOen_US
dc.subjectPESQen_US
dc.subjectSDRen_US
dc.subjectSpeech intelligibilityen_US
dc.subjectSTOIen_US
dc.titleOptimal Near-End Speech Intelligibility Improvement Using CLPSO-Based Voice Transformation in Realistic Noisy Environmentsen_US
dc.typeJournal Articleen_US
Appears in Collections:Journal Article

Files in This Item:
There are no files associated with this item.
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.