Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/132
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBiswas R.en_US
dc.contributor.authorNathwani K.en_US
dc.contributor.authorAbrol V.en_US
dc.date.accessioned2023-11-30T07:33:27Z-
dc.date.available2023-11-30T07:33:27Z-
dc.date.issued2021-
dc.identifier.isbn978-1713836902-
dc.identifier.issn2308457X-
dc.identifier.otherEID(2-s2.0-85119206545)-
dc.identifier.urihttps://dx.doi.org/10.21437/Interspeech.2021-150-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/132-
dc.description.abstractIn a recent work [1], a novel Delta Function-based Formant Shifting approach was proposed for speech intelligibility improvement. The underlying principle is to dynamically relocate the formants based on their occurrence in the spectrum away from the region of noise. The manner in which the formants are shifted is decided by the parameters of the Delta Function, the optimal values of which are evaluated using Comprehensive Learning Particle Swarm Optimization (CLPSO). Although effective, CLPSO is computationally expensive to the extent that it overshadows its merits in intelligibility improvement. As a solution to this, the current work aims to improve the Short-Time Objective Intelligibility (STOI) of (target) speech using a Delta Function that has been generated using a different (source) language. This transfer learning is based upon the relative positioning of the formant frequencies and pitch values of the source & target language datasets. The proposed approach is demonstrated and validated by subjecting it to experimentation with three different languages under variable noisy conditions. Copyright © 2021 ISCA.en_US
dc.language.isoenen_US
dc.publisherInternational Speech Communication Associationen_US
dc.sourceProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECHen_US
dc.subjectFormant ratioen_US
dc.subjectFormant shiftingen_US
dc.subjectPitch ratioen_US
dc.subjectSpeech intelligibilityen_US
dc.subjectTransfer learningen_US
dc.titleTransfer learning for speech intelligibility improvement in noisy environmentsen_US
dc.typeConference Paperen_US
Appears in Collections:Conference Paper

Files in This Item:
There are no files associated with this item.
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.