Speaker recognition using artificial neural networks

F. Mueen; Aftab Ahmed; Sanaullah Sanaullah; A. Gaba

doi:10.1109/iscon.2002.1215947

ScienceGate Book Chapters

JOURNAL ARTICLE

Speaker recognition using artificial neural networks

F. Mueen Aftab Ahmed Sanaullah Sanaullah A. Gaba

Year: 2004 Vol: 1 Pages: 99-102

DOI: 10.1109/iscon.2002.1215947

Get Full-Text PDF Get Analytical Report

Abstract

We report on the application of RNN (recurrent neural net) in an open-set text-dependent speaker identification task. MFCC (Mel-frequency cepstral coefficient) features from the speech utterance are fed to a neural-network-based classifier to identify the speakers. We use a feedforward net architecture as proposed by A.J. Robinson (IEEE Trans. on Neural Networks, vol.5, no.2, 1994). We introduce a fully connected hidden layer between the input and state nodes and the output. We show that this hidden layer makes the learning of complex classification tasks more efficient. Training uses backpropagation through time. There is one output unit per speaker, with the training targets corresponding to speaker identity. For 10 male speakers, we obtain a true acceptance rate of 100% with a false acceptance rate of 10%. For 14 speakers these figures are 94% and 12% respectively. We also investigate the effect of environmental factors on the identification accuracy (signal level, change of microphone), choice of acoustic vectors (FFT or MFCC), size of the training database, inclusion of fundamental frequency. MFCC features plus fundamental frequency give the best results.

Keywords:

Mel-frequency cepstrum Computer science Speech recognition Artificial neural network Speaker recognition Classifier (UML) Backpropagation Microphone Artificial intelligence Time delay neural network Utterance Pattern recognition (psychology) Feature extraction

Metrics

Cited By

0.39

FWCI (Field Weighted Citation Impact)

Refs

0.70

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speaker recognition using artificial neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-speaker isolated digit recognition using artificial neural networks

Speaker recognition using artificial neural networks based on vowel phonemes

Speaker Recognition Using LIRA Neural Networks

A Survey of Automatic Speaker Recognition System Using Artificial Neural Networks

Assamese Speaker Recognition Using Artificial Neural Network