Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network

Ashutosh Pandey; Buye Xu; Anurag Kumar; Jacob Donley; Paul Calamia; DeLiang Wang

doi:10.21437/interspeech.2022-11215

ScienceGate Book Chapters

JOURNAL ARTICLE

Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network

Ashutosh Pandey Buye Xu Anurag Kumar Jacob Donley Paul Calamia DeLiang Wang

Year: 2022 Journal: Interspeech 2022 Pages: 729-733

DOI: 10.21437/interspeech.2022-11215

Get Full-Text PDF Get Analytical Report

Abstract

Deep neural networks (DNNs) are very effective for multichannel speech enhancement with fixed array geometries.However, it is not trivial to use DNNs for ad-hoc arrays with unknown order and placement of microphones.We propose a novel triplepath network for ad-hoc array processing in the time domain.The key idea in the network design is to divide the overall processing into spatial processing and temporal processing and use self-attention for spatial processing.Using self-attention for spatial processing makes the network invariant to the order and the number of microphones.The temporal processing is done independently for all channels using a recently proposed dual-path attentive recurrent network.The proposed network is a multiple-input multiple-output architecture that can simultaneously enhance signals at all microphones.Experimental results demonstrate the excellent performance of the proposed approach.Further, we present analysis to demonstrate the effectiveness of the proposed network in utilizing multichannel information even from microphones at far locations.

Keywords:

Computer science Wireless ad hoc network Mobile ad hoc network Computer network Telecommunications Wireless

Metrics

Cited By

1.26

FWCI (Field Weighted Citation Impact)

Refs

0.80

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Advanced Adaptive Filtering Techniques

Physical Sciences → Engineering → Computational Mechanics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network

Abstract

Metrics

Citation History

Topics

Related Documents

TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement

Speech enhancement with ad-hoc microphone array using single source activity

Triple-Path RNN Network: A Time-and-Frequency Joint Domain Speech Separation Model

Rural communication enhancement using mobile ad-hoc network

Path availability in ad hoc network