Audio Barlow Twins: Self-Supervised Audio Representation Learning

Jonah Anton; Harry Coppock; Pancham Shukla; Björn W. Schuller

doi:10.1109/icassp49357.2023.10095041

ScienceGate Book Chapters

JOURNAL ARTICLE

Audio Barlow Twins: Self-Supervised Audio Representation Learning

Jonah Anton Harry Coppock Pancham Shukla Björn W. Schuller

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095041

Get Full-Text PDF Get Analytical Report

Abstract

The Barlow Twins self-supervised learning objective requires neither negative samples or asymmetric learning updates, achieving results on a par with the current state-of-the-art within Computer Vision. As such, we present Audio Barlow Twins, a novel self-supervised audio representation learning approach, adapting Barlow Twins to the audio domain. We pre-train on the large-scale audio dataset AudioSet, and evaluate the quality of the learnt representations on 18 tasks from the HEAR 2021 Challenge, achieving results which outperform, or otherwise are on a par with, the current state-of-the-art for instance discrimination self-supervised learning approaches to audio representation learning. Code at https://github.com/jonahanton/SSL_audio.

Keywords:

Computer science Representation (politics) Feature learning Code (set theory) Artificial intelligence Speech recognition Scale (ratio) Sound quality Domain (mathematical analysis) Machine learning Set (abstract data type)

Metrics

Cited By

0.54

FWCI (Field Weighted Citation Impact)

Refs

0.55

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Audio Barlow Twins: Self-Supervised Audio Representation Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Graph Barlow Twins: A self-supervised representation learning framework for graphs

Audio–visual self-supervised representation learning: A survey

Audio Albert: A Lite Bert for Self-Supervised Learning of Audio Representation

Comparing Learning Methodologies for Self-Supervised Audio-Visual Representation Learning

Audio-Visual Predictive Coding for Self-Supervised Visual Representation Learning