http://10.10.120.238:8080/xmlui/handle/123456789/292
Title: | Detection of AI-Synthesized Speech Using Cepstral & Bispectral Statistics |
Authors: | Singh A.K. Singh P. |
Keywords: | AI-synthesized speech Bi-spectral Analysis Cepstral Analysis Higher Order Correlations MFCC Multimedia Forensics |
Issue Date: | 2021 |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Abstract: | Digital technology has made possible unimaginable applications come true. It seems exciting to have a handful of tools for easy editing and manipulation, but it raises alarming concerns that can propagate as speech clones, duplicates, or maybe deep fakes. Validating the authenticity of a speech is one of the primary problems of digital audio forensics. We propose an approach to distinguish human speech from AI synthesized speech exploiting the Bi-spectral and Cepstral analysis. Higher-order statistics have less correlation for human speech in comparison to a synthesized speech. Also, Cepstral analysis revealed a durable power component in human speech that is missing for a synthesized speech. We integrate both these analyses and propose a model to detect AI synthesized speech. © 2021 IEEE. |
URI: | https://dx.doi.org/10.1109/MIPR51284.2021.00076 http://localhost:8080/xmlui/handle/123456789/292 |
ISBN: | 978-1665418652 |
Appears in Collections: | Conference Paper |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.