Gammatone Cepstral Coefficient-based Speech Emotion Recognition using Machine Learning Techniques

P. Anil Kumar; A. Lakshmi Parvathi; S. Sruthi

doi:10.18311/jmmf/2025/49511

ScienceGate Book Chapters

JOURNAL ARTICLE

Gammatone Cepstral Coefficient-based Speech Emotion Recognition using Machine Learning Techniques

P. Anil Kumar A. Lakshmi Parvathi S. Sruthi

Year: 2025 Journal: Journal of Mines Metals and Fuels Pages: 2933-2940

DOI: 10.18311/jmmf/2025/49511

Get Full-Text PDF Get Analytical Report

Abstract

This research investigates the effectiveness of different feature extraction techniques for Speech Emotion Recognition (SER) and explores the potential of machine learning to improve accuracy. While Gamma Tone Cepstral Coefficients (GTCC) are designed to capture auditory features that align with human perception, they may not adequately capture subtle emotional cues. Mel Frequency Cepstral Coefficients (MFCC), on the other hand, have shown promise in effectively representing speech signals for emotion recognition. This work compares GTCC-based feature extraction with MFCC and employs a Cubic Support Vector Machine (SVM) classifier to enhance the system's ability to learn and distinguish between emotional states. Using datasets like CREMA-D and SAVEE, this research aims to advance the development of SER systems with improved accuracy and sensitivity for applications in human-computer interaction. Major Findings: This study focuses on enhancing GTCC features for better emotion recognition in speech. While MFCC showed higher accuracy, improvements to GTCC narrowed the performance gap. The results indicate that refined GTCC, combined with Cubic SVM, holds promise for effective SER.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.35

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gammatone Cepstral Coefficient-based Speech Emotion Recognition using Machine Learning Techniques

Abstract

Metrics

Topics

Related Documents

Speech Emotion Recognition Using Gammatone Cepstral Coefficients and Deep Learning Features

Speech Emotion Recognition Using Machine Learning Techniques

Speech Emotion Recognition Using Machine Learning Techniques

Whispered speech recognition based on gammatone filterbank cepstral coefficients

Recognition of Speech Emotion Using Machine Learning Techniques