JOURNAL ARTICLE

Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter

Abstract

Speech processing is now an emerging technology of signal processing. Some research areas of speech processing are recognition of speech, speaker identification (SI), speech synthesis etc. Speaker identification is important research area of speech processing. SI means identifying the speaker based on his spoken speech. The main use of SI is to recognize the speech owner based on the speaking style of the speaker. SI is mainly used in forensic analysis, home control system, database access services etc. For SI two things are essential. One is feature extraction and another is feature matching. Feature extraction is extraction of small information from the available audio wave signal. That information can be used to represent the particular speaker. For SI, There are many feature extraction techniques like LPC (Linear Predictive Coefficients), MFCC (Mel Frequency Cepstral Coefficients), PLP (Perceptual Linear Predictive Coefficients) and many more are used. MFCC is one of them and it gives good (efficient) identification results. Factor affecting on SI is noise, sampling rate, number of frames etc., and among them noise is the most critical factor. We found that MFCC is not much effective in the noisy environment, especially when the noise condition mismatch. The identification rate becomes poor and poor when the noise level increases. To improve the performance of SI in a real world noisy environment, we propose a technique which is a variant of MFCC. Proposed MFCC includes wiener filter which is good for handling the noise in speech. In this paper, it is suggested that the wiener filter is effective in the frequency domain rather than the time domain based on our experiments. We got 88.57% average identification rate with NOIZEUS database by our proposed technique. In feature matching, the unknown speech is classified by using some classifier. We have used neural network for feature matching.

Keywords:
Mel-frequency cepstrum Speech recognition Computer science Wiener filter Speaker recognition Noise (video) Feature extraction Feature (linguistics) Filter (signal processing) Filter bank Speech processing Cepstrum Linear prediction Pattern recognition (psychology) Artificial intelligence Computer vision

Metrics

45
Cited By
3.38
FWCI (Field Weighted Citation Impact)
20
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Speaker Identification Using K-means Method Based on Mel Frequency Cepstral Coefficients(MFCC)

Dirman HanafiAbdul Syafiq Abdull Sukor

Journal:   i-manager’s Journal on Embedded Systems Year: 2012 Vol: 1 (1)Pages: 19-28
JOURNAL ARTICLE

SPEAKER IDENTIFICATION USING MEL FREQUENCY CEPSTRAL COEFFICIENTS

Rashidul HasanS. M. Mahbubur Rahman

Journal:   The Journal of Urology Year: 2004 Vol: 170 (1)Pages: 94-8
BOOK-CHAPTER

Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification

Sumit SrivastavaMahesh ChandraG. Sahoo

Advances in intelligent systems and computing Year: 2016 Pages: 309-316
© 2026 ScienceGate Book Chapters — All rights reserved.