Audio emotion recognition using machine learning

Abihshek Saini; Sujata Kukreti; Aditya Verma; Tomar Ankit

doi:10.1201/9781003333500-39

ScienceGate Book Chapters

BOOK-CHAPTER

Audio emotion recognition using machine learning

Abihshek Saini Sujata Kukreti Aditya Verma Tomar Ankit

Year: 2023 Pages: 339-346

DOI: 10.1201/9781003333500-39

Get Full-Text PDF Get Analytical Report

Abstract

Over the last decade, the study of emotion recognition has attracted a lot of attention in the area of human-computer interaction. Current recognition accuracy can be improved, but more research into the fundamental temporal link between speech waveforms is needed. A method for speech recognition is proposed that takes advantage of difference in emotional saturation between time frames (RNN) using combination of speech features with attention-based Long Short-term Memory (LSTM) recurrent neural networks. In place of standard statistical features, frame level speech features were derived from the waveform to retain the original speech's temporal relations through sequence of frames. Two LSTM enhancement algorithms based on the attention mechanism have been presented to distinguish emotional saturation in distinct frames. An Emotion Recognition in Conversion system that is capable of recognizing face emotion in real-time was one of the systems that was proposed.

Keywords:

Computer science Speech recognition Artificial intelligence

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.22

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Audio emotion recognition using machine learning

Abstract

Metrics

Topics

Related Documents

Supervised machine learning for audio emotion recognition

Audio Emotion Recognition using Machine Learning to support Sound Design

Machine Learning Algorithms for Emotion Recognition Using Audio and Text Data

Speech Emotion Recognition using Machine Learning With Real-time Audio Analysis

Machine Learning based Speech Emotion Recognition in Hindi Audio