JOURNAL ARTICLE

Speaker recognition using artificial neural networks

Abstract

We report on the application of RNN (recurrent neural net) in an open-set text-dependent speaker identification task. MFCC (Mel-frequency cepstral coefficient) features from the speech utterance are fed to a neural-network-based classifier to identify the speakers. We use a feedforward net architecture as proposed by A.J. Robinson (IEEE Trans. on Neural Networks, vol.5, no.2, 1994). We introduce a fully connected hidden layer between the input and state nodes and the output. We show that this hidden layer makes the learning of complex classification tasks more efficient. Training uses backpropagation through time. There is one output unit per speaker, with the training targets corresponding to speaker identity. For 10 male speakers, we obtain a true acceptance rate of 100% with a false acceptance rate of 10%. For 14 speakers these figures are 94% and 12% respectively. We also investigate the effect of environmental factors on the identification accuracy (signal level, change of microphone), choice of acoustic vectors (FFT or MFCC), size of the training database, inclusion of fundamental frequency. MFCC features plus fundamental frequency give the best results.

Keywords:
Mel-frequency cepstrum Computer science Speech recognition Artificial neural network Speaker recognition Classifier (UML) Backpropagation Microphone Artificial intelligence Time delay neural network Utterance Pattern recognition (psychology) Feature extraction

Metrics

11
Cited By
0.39
FWCI (Field Weighted Citation Impact)
14
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

DISSERTATION

Multi-speaker isolated digit recognition using artificial neural networks

Danqing Zhang

University:   ANU Open Research (Australian National University) Year: 1994
JOURNAL ARTICLE

Speaker Recognition Using LIRA Neural Networks

Nestor A. Garcia FragosoTetyana BaydykErnst Kussul

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2020 Vol: 14 (1)Pages: 14-22
JOURNAL ARTICLE

A Survey of Automatic Speaker Recognition System Using Artificial Neural Networks

Kharibam Jilenkumari DeviKhelchandra Thongam

Journal:   Journal of Advanced Research in Dynamical and Control Systems Year: 2019 Vol: 11 (10-SPECIAL ISSUE)Pages: 453-456
JOURNAL ARTICLE

Assamese Speaker Recognition Using Artificial Neural Network

Bhargab MedhiProf. P.H. Talukdar

Journal:   IJARCCE Year: 2015 Pages: 321-324
© 2026 ScienceGate Book Chapters — All rights reserved.