Skip navigation

Please use this identifier to cite or link to this item: http://10.10.120.238:8080/xmlui/handle/123456789/292
Title: Detection of AI-Synthesized Speech Using Cepstral & Bispectral Statistics
Authors: Singh A.K.
Singh P.
Keywords: AI-synthesized speech
Bi-spectral Analysis
Cepstral Analysis
Higher Order Correlations
MFCC
Multimedia Forensics
Issue Date: 2021
Publisher: Institute of Electrical and Electronics Engineers Inc.
Abstract: Digital technology has made possible unimaginable applications come true. It seems exciting to have a handful of tools for easy editing and manipulation, but it raises alarming concerns that can propagate as speech clones, duplicates, or maybe deep fakes. Validating the authenticity of a speech is one of the primary problems of digital audio forensics. We propose an approach to distinguish human speech from AI synthesized speech exploiting the Bi-spectral and Cepstral analysis. Higher-order statistics have less correlation for human speech in comparison to a synthesized speech. Also, Cepstral analysis revealed a durable power component in human speech that is missing for a synthesized speech. We integrate both these analyses and propose a model to detect AI synthesized speech. © 2021 IEEE.
URI: https://dx.doi.org/10.1109/MIPR51284.2021.00076
http://localhost:8080/xmlui/handle/123456789/292
ISBN: 978-1665418652
Appears in Collections:Conference Paper

Files in This Item:
There are no files associated with this item.
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.