BOOK-CHAPTER

Audiovisual Facial Action Unit Recognition Using Feature Level Fusion

Abstract

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.

Keywords:
Computer science Convolutional neural network Feature (linguistics) Artificial intelligence Speech recognition Pattern recognition (psychology) Channel (broadcasting) Feature extraction Exploit Fusion Computer vision

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
49
Refs
0.38
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face recognition and analysis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Audiovisual Facial Action Unit Recognition using Feature Level Fusion

Zibo MengShizhong HanMin ChenYan Tong

Journal:   International Journal of Multimedia Data Engineering and Management Year: 2016 Vol: 7 (1)Pages: 60-76
JOURNAL ARTICLE

Improving Speech Related Facial Action Unit Recognition by Audiovisual Information Fusion

Zibo MengShizhong HanPing LiuYan Tong

Journal:   IEEE Transactions on Cybernetics Year: 2018 Vol: 49 (9)Pages: 3293-3306
JOURNAL ARTICLE

Facial expression recognition using feature level fusion

Vanita JainPuneet Singh LambaBhanu Pratap SinghNarayanan NamboothiriShafali Dhall

Journal:   Journal of Discrete Mathematical Sciences and Cryptography Year: 2019 Vol: 22 (2)Pages: 337-350
JOURNAL ARTICLE

Action recognition based on feature-level fusion

Enqing ChenWanli Cheng

Year: 2018 Vol: 23 Pages: 42-42
© 2026 ScienceGate Book Chapters — All rights reserved.