JOURNAL ARTICLE

Emotional Climate Recognition in Speech‐Based Conversations: Leveraging Deep Bispectral Image Analysis and Affect Dynamics

Abstract

ABSTRACT The growing availability of conversational data across multiple platforms has intensified interest in dynamic emotion recognition. Speech plays a pivotal role in shaping the emotional climate (EC) of peer conversations. We propose DeepBispec, the first framework to integrate deep bispectral image analysis with affect dynamics (AD) for speech‐based EC recognition. Bispectrum representations capture nonlinear and non‐Gaussian speech characteristics, while AD descriptors model temporal emotion fluctuations. Evaluated on K‐EmoCon, IEMOCAP and SEWA datasets, DeepBispec consistently improved EC classification performance. For example, on K‐EmoCon, arousal accuracy increased from 79.0% (bispectrum only) to 81.4% (with AD), while valence accuracy improved from 76.8% to 77.5%; similar trends were observed for IEMOCAP and SEWA. DeepBispec outperformed strong CNN, LSTM, and Transformer baselines, demonstrating robust cross‐lingual performance across seven languages. These findings highlight its potential for real‐world applications such as mental health monitoring, affect‐aware learning platforms and empathetic dialogue systems.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
42
Refs
0.37
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Emotions and Moral Behavior
Social Sciences →  Psychology →  Social Psychology
© 2026 ScienceGate Book Chapters — All rights reserved.