Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter

Paresh M. Chauhan; Nikita Desai

doi:10.1109/icgccee.2014.6921394

ScienceGate Book Chapters

JOURNAL ARTICLE

Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter

Paresh M. Chauhan Nikita Desai

Year: 2014 Pages: 1-5

DOI: 10.1109/icgccee.2014.6921394

Get Full-Text PDF Get Analytical Report

Abstract

Speech processing is now an emerging technology of signal processing. Some research areas of speech processing are recognition of speech, speaker identification (SI), speech synthesis etc. Speaker identification is important research area of speech processing. SI means identifying the speaker based on his spoken speech. The main use of SI is to recognize the speech owner based on the speaking style of the speaker. SI is mainly used in forensic analysis, home control system, database access services etc. For SI two things are essential. One is feature extraction and another is feature matching. Feature extraction is extraction of small information from the available audio wave signal. That information can be used to represent the particular speaker. For SI, There are many feature extraction techniques like LPC (Linear Predictive Coefficients), MFCC (Mel Frequency Cepstral Coefficients), PLP (Perceptual Linear Predictive Coefficients) and many more are used. MFCC is one of them and it gives good (efficient) identification results. Factor affecting on SI is noise, sampling rate, number of frames etc., and among them noise is the most critical factor. We found that MFCC is not much effective in the noisy environment, especially when the noise condition mismatch. The identification rate becomes poor and poor when the noise level increases. To improve the performance of SI in a real world noisy environment, we propose a technique which is a variant of MFCC. Proposed MFCC includes wiener filter which is good for handling the noise in speech. In this paper, it is suggested that the wiener filter is effective in the frequency domain rather than the time domain based on our experiments. We got 88.57% average identification rate with NOIZEUS database by our proposed technique. In feature matching, the unknown speech is classified by using some classifier. We have used neural network for feature matching.

Keywords:

Mel-frequency cepstrum Speech recognition Computer science Wiener filter Speaker recognition Noise (video) Feature extraction Feature (linguistics) Filter (signal processing) Filter bank Speech processing Cepstrum Linear prediction Pattern recognition (psychology) Artificial intelligence Computer vision

Metrics

Cited By

3.38

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter

Abstract

Metrics

Citation History

Topics

Related Documents

Mel Frequency Cepstral Coefficients (Mfcc) Based Speaker Identification In Noisy Environment Using Lbg Vector Quantization

Speaker Identification Using K-means Method Based on Mel Frequency Cepstral Coefficients(MFCC)

SPEAKER IDENTIFICATION USING MEL FREQUENCY CEPSTRAL COEFFICIENTS

Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)

Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification