Three-stage hybrid neural beamformer for multi-channel speech enhancement

Kelan Kuang; Feiran Yang; Junfeng Li; Jun Yang

doi:10.1121/10.0019802

ScienceGate Book Chapters

JOURNAL ARTICLE

Three-stage hybrid neural beamformer for multi-channel speech enhancement

Kelan Kuang Feiran Yang Junfeng Li Jun Yang

Year: 2023 Journal: The Journal of the Acoustical Society of America Vol: 153 (6)Pages: 3378-3378 Publisher: Acoustical Society of America

DOI: 10.1121/10.0019802

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a hybrid neural beamformer for multi-channel speech enhancement, which comprises three stages, i.e., beamforming, post-filtering, and distortion compensation, called TriU-Net. The TriU-Net first estimates a set of masks to be used within a minimum variance distortionless response beamformer. A deep neural network (DNN)-based post-filter is then utilized to suppress the residual noise. Finally, a DNN-based distortion compensator is followed to further improve speech quality. To characterize the long-range temporal dependencies more efficiently, a network topology, gated convolutional attention network, is proposed and utilized in the TriU-Net. The advantage of the proposed model is that the speech distortion compensation is explicitly considered, yielding higher speech quality and intelligibility. The proposed model achieved an average 2.854 wb-PESQ score and 92.57% ESTOI on the CHiME-3 dataset. In addition, extensive experiments conducted on the synthetic data and real recordings confirm the effectiveness of the proposed method in noisy reverberant environments.

Keywords:

PESQ Computer science Speech enhancement Speech recognition Residual Intelligibility (philosophy) Distortion (music) Beamforming Artificial neural network Pattern recognition (psychology) Artificial intelligence Noise reduction Algorithm Bandwidth (computing) Telecommunications

Metrics

Cited By

2.68

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Hearing Loss and Rehabilitation

Life Sciences → Neuroscience → Cognitive Neuroscience

Three-stage hybrid neural beamformer for multi-channel speech enhancement

Abstract

Metrics

Citation History

Topics

Related Documents

Attention-Based Beamformer For Multi-Channel Speech Enhancement

A New Neural Beamformer for Multi-channel Speech Separation

TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor’s Approximation Theory

All-Neural Multi-Channel Speech Enhancement

Three-stage hybrid spiking neural networks fine-tuning for speech enhancement