Channel-Time-Frequency Attention Module for Improved Multi-Channel Speech Enhancement

Xiao Zeng; Mingjiang Wang

doi:10.1109/access.2025.3546703

ScienceGate Book Chapters

JOURNAL ARTICLE

Channel-Time-Frequency Attention Module for Improved Multi-Channel Speech Enhancement

Xiao Zeng Mingjiang Wang

Year: 2025 Journal: IEEE Access Vol: 13 Pages: 44418-44427 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2025.3546703

Get Full-Text PDF Get Analytical Report

Abstract

Both spatial and tempo-spectral information are essential for multi-channel speech enhancement, a field that has gained significant popularity in recent years. While many studies focus on improving feature extraction capabilities through unique network architectures, these approaches often prioritize raw feature learning without fully addressing how to effectively utilize the extracted features for enhanced performance. In this work, we focus on the post-extracted features and introduce a Channel-Time-Frequency Attention (CTFA) module, which allocates weights to the extracted features, aiming to enhance feature utilization and enabling the model to focus more effectively on informative features. The CTFA module is structured with three parallel attention branches—channel, time, and frequency branches—to effectively refine both spatial and tempo-spectral features. It facilitates better feature reuse by assigning greater weight to effective features, thereby improving the model’s robustness. We incorporate the CTFA module into our previously proposed model and conduct an ablation study to evaluate its effectiveness. Extensive experimental results confirm the efficacy of the CTFA module, with our proposed method outperforming state-of-the-art baselines.

Keywords:

Computer science Channel (broadcasting) Speech enhancement Speech recognition Telecommunications Artificial intelligence Noise reduction

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.05

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Channel-Time-Frequency Attention Module for Improved Multi-Channel Speech Enhancement

Abstract

Metrics

Topics

Related Documents

Two-stage UNet with channel and temporal-frequency attention for multi-channel speech enhancement

Hybrid Attention Time-Frequency Analysis Network for Single-Channel Speech Enhancement

High frequency domain enhancement and channel attention module for multi-view stereo

A time-frequency fusion model for multi-channel speech enhancement

Attention-Based Beamformer For Multi-Channel Speech Enhancement