JOURNAL ARTICLE

SPADE: Self-Supervised Pretraining for Acoustic Disentanglement

Abstract

Self-supervised representation learning approaches have grown in popularity due to the ability to train models on large amounts of unlabeled data and have demonstrated success in diverse fields such as natural language processing, computer vision, and speech. Previous self-supervised work in the speech domain has disentangled multiple attributes of speech such as linguistic content, speaker identity, and rhythm. In this work, we introduce a self-supervised approach to disentangle room acoustics from speech and use the acoustic representation on the downstream task of device arbitration. Our results demonstrate that our proposed approach significantly improves performance over a baseline when labeled training data is scarce, indicating that our pretraining scheme learns to encode room acoustic information while remaining invariant to other attributes of the speech signal.

Keywords:
Computer science Speech recognition ENCODE Labeled data Artificial intelligence Task (project management) Representation (politics) Speech processing Scheme (mathematics) Natural language processing

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
25
Refs
0.53
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals

Xingyu ChenQiushi ZhuJie ZhangLi-Rong Dai

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 561-565
JOURNAL ARTICLE

FeDi: Feature disentanglement for self-supervised learning

Jeong Ryong LeeGeonhui SonDosik Hwang

Journal:   Pattern Recognition Year: 2025 Vol: 172 Pages: 112619-112619
BOOK-CHAPTER

Self-Supervised Pretraining for Cortical Surface Analysis

Dániel UnyiBálint Gyires-Tóth

Lecture notes in computer science Year: 2024 Pages: 96-108
© 2026 ScienceGate Book Chapters — All rights reserved.