Neural Architecture Search for Speech Emotion Recognition

Xixin Wu; Shoukang Hu; Zhiyong Wu; Xunying Liu; Helen Meng

doi:10.1109/icassp43922.2022.9746155

ScienceGate Book Chapters

JOURNAL ARTICLE

Neural Architecture Search for Speech Emotion Recognition

Xixin Wu Shoukang Hu Zhiyong Wu Xunying Liu Helen Meng

Year: 2022 Journal: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pages: 6902-6906

DOI: 10.1109/icassp43922.2022.9746155

Get Full-Text PDF Get Analytical Report

Abstract

Deep neural networks have brought significant advancements to speech emotion recognition (SER). However, the architecture design in SER is mainly based on expert knowledge and empirical (trial-and-error) evaluations, which is time-consuming and resource intensive. In this paper, we propose to apply neural architecture search (NAS) techniques to automatically configure the SER models. To accelerate the candidate architecture optimization, we propose a uniform path dropout strategy to encourage all candidate architecture operations to be equally optimized. Experimental results of two different neural structures on IEMOCAP show that NAS can improve SER performance (54.89% to 56.28%) while maintaining model parameter sizes. The proposed dropout strategy also shows superiority over the previous approaches.

Keywords:

Dropout (neural networks) Computer science Architecture Artificial neural network Artificial intelligence Deep neural networks Machine learning Speech recognition

Metrics

Cited By

2.00

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Neural Architecture Search for Speech Emotion Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition

Efficient neural architecture search for emotion recognition

Multilingual Speech Emotion Recognition with Multi-Gating Mechanism and Neural Architecture Search

EEG-Based Emotion Recognition via Neural Architecture Search

EEG-based Emotion Recognition via Transformer Neural Architecture Search