http://10.10.120.238:8080/xmlui/handle/123456789/434
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Biswas R. | en_US |
dc.contributor.author | Nathwani K. | en_US |
dc.contributor.author | Hafiz F. | en_US |
dc.contributor.author | Swain A. | en_US |
dc.date.accessioned | 2023-11-30T08:33:16Z | - |
dc.date.available | 2023-11-30T08:33:16Z | - |
dc.date.issued | 2022 | - |
dc.identifier.issn | 1939-8018 | - |
dc.identifier.other | EID(2-s2.0-85139705570) | - |
dc.identifier.uri | https://dx.doi.org/10.1007/s11265-022-01815-x | - |
dc.identifier.uri | http://localhost:8080/xmlui/handle/123456789/434 | - |
dc.description.abstract | The present study proposes a novel method for speech intelligibility improvement by optimally shifting the formants using a trapezoidal voice transformation function. The shaping parameters of this function are determined by maximizing various performance measures using a comprehensive learning particle swarm optimization (CLPSO) algorithm. These measures include the short time objective intelligibility (STOI), perceptual evaluation of speech quality (PESQ) and signal to distortion ration (SDR). The proposed method does not requires a priori knowledge about the noise statistics in designing the voice transformation function. Although, the shaping parameters are obtained at specific SNRs, a Gaussian process (GP) regression model is trained to compute these parameters for arbitrary SNRs. The performance of the proposed method is demonstrated on various databases which include Hearing In Noise Test (HINT) a French database, NOIZEUS (ENGLISH) and CHAINS (ENGLISH) databases at different levels of engine noises arising from a running car at various speeds. The results of the investigation convincingly demonstrate that the proposed approach could improve the speech intelligibility, while preserving the quality. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer | en_US |
dc.source | Journal of Signal Processing Systems | en_US |
dc.subject | Gaussian process regression | en_US |
dc.subject | Near-end | en_US |
dc.subject | PESQ | en_US |
dc.subject | SDR | en_US |
dc.subject | Speech intelligibility | en_US |
dc.subject | STOI | en_US |
dc.title | Optimal Speech Intelligibility Improvement for Varying Car Noise Characteristics | en_US |
dc.type | Journal Article | en_US |
Appears in Collections: | Journal Article |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.