Hilbert-Huang Mel Frequency Cepstral Coefficient for Speech Stress Recognition System

Barlian Henryranu Prasetio; Dahnial Syauqy; Edita Rosana Widasari

doi:10.1109/icitacee55701.2022.9923955

ScienceGate Book Chapters

JOURNAL ARTICLE

Hilbert-Huang Mel Frequency Cepstral Coefficient for Speech Stress Recognition System

Barlian Henryranu Prasetio Dahnial Syauqy Edita Rosana Widasari

Year: 2022 Pages: 111-114

DOI: 10.1109/icitacee55701.2022.9923955

Get Full-Text PDF Get Analytical Report

Abstract

Today, acoustic and spectral characteristics are commonly utilized to determine stress levels, and they have a high degree of accuracy. Mel frequency cepstral coefficients are the most successful spectral characteristic (MFCCs). On MFCC framework, each window was determined using a Fourier transformation based on its resolution. The window's size, on the other hand, causes frequency resolution issues, particularly for under-stressed speech, since the frequency of each speech can change due to emotional conditions. Hence, it is necessary to analyze the frequency spectrum over time and has an adaptive window. To address this issue, we apply Hilbert-Huang transform into MFCC (called HFCC) feature extraction technique for more robust stress speech recognition system. The extracted features are then used as trained data using neural network (NN) to identify the emotional stress of speaker. We used actual speech stress data from SUSAS Database in all experiments. The experimental result shows that HFCC outperforms MFCC and the existing feature extraction techniques.

Keywords:

Mel-frequency cepstrum Computer science Speech recognition Feature extraction Window function Cepstrum Feature (linguistics) Stress (linguistics) Pattern recognition (psychology) Artificial intelligence Spectral density

Metrics

Cited By

0.78

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Hilbert-Huang Mel Frequency Cepstral Coefficient for Speech Stress Recognition System

Abstract

Metrics

Citation History

Topics

Related Documents

Mel-frequency cepstral coefficient analysis in speech recognition

Extricate Features Utilizing Mel Frequency Cepstral Coefficient in Automatic Speech Recognition System

Modified Mel Frequency Cepstral Coefficient for Korean Children's Speech Recognition

Hilbert Huang transform based speech recognition

Parallel dual-accumulator based Mel frequency cepstral coefficient for speech recognition