JOURNAL ARTICLE

RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS

Pragya Singh Tomar Brahma Datta Shukla

Year: 2018 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

This research describes a multimodal emotion identification system that uses auditory and visual inputs to recognize emotions. Mel-Frequency Cepstral Coefficients, Filter Bank Energies, and prosodic characteristics are retrieved from the audio channel. Two techniques are being investigated for the visual element. First, the geometric relationships between face landmarks, such as distances and angles, are calculated. Second, we condense each emotional movie into a smaller collection of key-frames that may be used to visually distinguish between different emotions. To accomplish so, key-frame summary films are fed into a convolutional neural network. Finally, in a late fusion/stacking approach, the confidence outputs of all the classifiers from all the modalities are utilized to build a new feature space to be trained for final emotion label prediction. Experiments on the SAVEE, eNTERFACE'05, and RML databases reveal that our proposed solution performs significantly better than current options, defining the current state-of-the-art in all three databases.

Keywords:
CLIPS Audio visual Computer science Multimedia Speech recognition Artificial intelligence

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.31
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS

Tomar*, Pragya Singh

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2018
JOURNAL ARTICLE

Audio-Visual Emotion Recognition in Video Clips

Fatemeh NorooziMarina MarjanovićAngelina NjegušSérgio EscaleraGholamreza Anbarjafari

Journal:   IEEE Transactions on Affective Computing Year: 2017 Vol: 10 (1)Pages: 60-75
BOOK-CHAPTER

Audio/Video Clips

Year: 2013 Pages: 5-5
BOOK-CHAPTER

Audio/video clips

Frank RennieKeith Smyth

Year: 2019 Pages: 16-17
JOURNAL ARTICLE

Multiple video clips preservation using folded back audio-visual cryptography scheme

Imon MukherjeeRitam Ganguly

Journal:   Multimedia Tools and Applications Year: 2017 Vol: 77 (5)Pages: 5281-5301
© 2026 ScienceGate Book Chapters — All rights reserved.