Study for Speaker Identification Under Reverberation Effect

Document Type : Original Article

Authors

Communications and Electronics Department Faculty of Electronic Engineering,Manoufia University: Menouf, Egypt

Abstract

Speech is the way people interact with each other. With the development of computer technology, some researchers have sought to find techniques that allow the computer to understand natural speech. Smart systems are now trying to imitate the functions of the human brain by using neural networks. Reverberation represents interference noise on the original speech signal. The aim of the research is investigate of reverberation effect on speech signal and whether it affects the identification of the speaker or not to determine the best scenario for identifying the speaker in the presence of reverberation. This study is done using feature extraction with Mel-Frequency Capstral coefficients (MFCCs) and neural networks.

Keywords


[1]      Sreenivas SremathTirumala,Seyed RezaShahamiri ,Abhimanyu SinghGarhwal and RuiliWang “Speaker identification features extraction methods A systematic review” Expert Systems with Applications, Volume 90, 30 December 2017, p. 250-271.
[2]      Jeet Kumar , Om Prakash Prabhakar  and  Navneet Kumar Sahu  “Comparative Analysis of Different Feature Extraction and Classifier Techniques for Speaker Identification Systems: A Review,”, Vol. 2, Issue 1, January 2014.
[3]      Praveen G Nair, and Rohit Nair “Efcient Speaker Identifcation Using Artifcial Neural Network,”, IJECT Vo l. 6, Issue 1, Jan- March2015.
[5]      Naglaa F. Soliman , Zhraa Mostfa, Fathi E. Abd El‑Samie, Mahmoud I. Abdalla “Performance enhancement of speaker identification systems using speech encryption and cancelable features,” International Journal of Speech Technology, December 2017, Volume 20, Issue 4, pp 977–1004.
[6]      Wu Junqin and Yu Junjun “An Improved Arithmetic of MFCC in Speech Recognition System,”International Conference on Electronics, Communications and Control (ICECC) , 2011.
[7]      Jorge Martinez Hector Perez Enrique Escamilla and Masahisa Mabo Suzuki “A Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) Techniques,”  , 22nd International Conference on Electrical Communications and Computers, 2012.
[8]      Wu Ju, “Speaker Recognition System Based on MFCC and Schmm.” Symposium on ICT and Energy Efficiency and Workshop on Information Theory and Security (CIICT 2012), 2012,  pp. 88 – 92.
[9]      Hoang Trang, Tran Hoang Loc, Huynh Bui Hoang Nam “Proposed Combination of PCA and MFCC Feature Extraction in Speech Recognition System,” International Conference on Advanced Technologies for Communications (ATC 2014), 2014.
[10]   Wenbo Wang , Sichun Li , Jianshe Yang , Zhao Liu and  Weicun Zhou o, “ Feature Extraction of Underwater Target in Auditory Sensation Area Based on MFCC,” , IEEE/OES China Ocean Acoustics (COA), 2016.
[11]   Roland Maas , Emanu¨ el A.P. Habets, Armin Sehr , Walter Kellermann. “On the application of reverberation suppression to robust speech recognition,”,  IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012.
[12]   Sen M. Kuo , Bob. H. Lee and Wenshun Tain,  “Real-Time Digital Signal Processing Implementations and Applications” Second Edition.
[13]   S E. Fredrickson and L. Tarassenko "Text-independent speaker recognition using neural network techniques,”  4th International Conference on Artificial Neural Networks, 1995,  pp. 13 – 18.
[14]   Goutam Saha, Pankaj Kumar, and Sandipan Chakroborty “A Comparative Study of Feature Extraction Algorithms on ANN Based Speaker Model for Speaker Recognition Applications” International Conference on Neural Information Processing, ICONIP 2004: Neural Information Processing pp 1192-1197
 
Volume 28, ICEEM2019-Special Issue
ICEEM2019-Special Issue: 1st International Conference on Electronic Eng., Faculty of Electronic Eng., Menouf, Egypt, 7-8 Dec.
2019
Pages 1-7