Emotion recognition with multimodal features and temporal models

Shuai Wang; Wenxuan Wang; Jinming Zhao; Shizhe Chen; Qin Jin; Shilei Zhang; Yong Qin

doi:10.1145/3136755.3143016

ScienceGate Book Chapters

JOURNAL ARTICLE

Emotion recognition with multimodal features and temporal models

Shuai Wang Wenxuan Wang Jinming Zhao Shizhe Chen Qin Jin Shilei Zhang Yong Qin

Year: 2017

DOI: 10.1145/3136755.3143016

Get Full-Text PDF Get Analytical Report

Abstract

This paper presents our methods to the Audio-Video Based Emotion Recognition subtask in the 2017 Emotion Recognition in the Wild (EmotiW) Challenge. The task aims to predict one of the seven basic emotions for short video segments. We extract different features from audio and facial expression modalities. We also explore the temporal LSTM model with the input of frame facial features, which improves the performance of the non-temporal model. The fusion of different modality features and the temporal model lead us to achieve a 58.5% accuracy on the testing set, which shows the effectiveness of our methods.

Keywords:

Computer science Modality (human–computer interaction) Facial expression Modalities Speech recognition Emotion recognition Task (project management) Artificial intelligence Set (abstract data type) Frame (networking) Pattern recognition (psychology)

Metrics

Cited By

0.77

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion recognition with multimodal features and temporal models

Abstract

Metrics

Citation History

Topics

Related Documents

Deep Spatio-Temporal Features for Multimodal Emotion Recognition

Emotion Recognition using Multimodal Features

Multimodal Emotion Recognition With Temporal and Semantic Consistency

Cross-culture Continuous Emotion Recognition with Multimodal Features

Interactive Multimodal Framework with Temporal Modeling for Emotion Recognition