JOURNAL ARTICLE

Speech Emotion Recognition Based on Swin-Transformer

Zirou LiaoShaoping Shen

Year: 2023 Journal:   Journal of Physics Conference Series Vol: 2508 (1)Pages: 012056-012056   Publisher: IOP Publishing

Abstract

Abstract The ability of machines to understand human subjective emotions is an essential link to realize artificial intelligence. How to extract and utilize information from audio signals is still a challenging task. By transforming acoustic signals into time-domain information represented by spectrograms, advanced algorithms in the field of computer vision can be applied to the field of acoustics. In this paper, we propose a Speech Emotion Recognition(SER) system based on Swin-Transformer(Swin). In addition to verifying the feasibility of Swin in SER task, we also compared the effectiveness of various spectrum maps under the same model parameters. Our model is validated on the IEMOCAP dataset and achieves competitive performance.

Keywords:
Spectrogram Computer science Speech recognition Transformer Emotion recognition Field (mathematics) Artificial intelligence Task (project management) Engineering

Metrics

7
Cited By
1.88
FWCI (Field Weighted Citation Impact)
7
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
© 2026 ScienceGate Book Chapters — All rights reserved.