Automated detection of wolf howls using audio spectrogram transformers

Nikolai Makarov; А. А. Савченко; Iuliia Zemtsova; Maxim Novopoltsev; Andrey D. Poyarkov; Anastasia O. Viricheva; М. Д. Чистополова; А. А. Никольский; J. A. Hernandez-Blanco

doi:10.1038/s41598-025-11413-z

JOURNAL ARTICLE

Automated detection of wolf howls using audio spectrogram transformers

Nikolai Makarov А. А. Савченко Iuliia Zemtsova Maxim Novopoltsev Andrey D. Poyarkov Anastasia O. Viricheva М. Д. Чистополова А. А. Никольский J. A. Hernandez-Blanco

Year: 2025 Journal: Scientific Reports Vol: 15 (1)Pages: 26641-26641 Publisher: Nature Portfolio

DOI: 10.1038/s41598-025-11413-z

Get Full-Text PDF Get Analytical Report

Abstract

The grey wolf (Canis lupus) is a pivotal species for ecological studies. As a key participant in ecosystem processes, it also serves as a model for investigating social structure formation and ecological adaptation. However, the species' complex social behavior, spatial dynamics, and expansive habitats make monitoring and population assessments across large areas particularly challenging. In recent years, audio traps have been used to collect extensive datasets of wolf vocalizations, particularly howls. Yet, manually detecting wolf howls in lengthy recordings remains a labor-intensive and inefficient task. We propose an approach leveraging modern machine-learning techniques to address this challenge. Following a comprehensive analysis of sound processing methods, we developed two state-of-the-art deep learning models based on the Audio Spectrogram Transformer architecture. The first model classifies audio for the presence of animal sounds with a precision of 98.3% and a recall of 99.3%. The second model distinguishes wolf howls from other animal sounds with a precision of 89.6% and a recall of 93.4%. These models significantly enhance the efficiency and accuracy of wolf vocalization detection, supporting ecological monitoring and research efforts.

Keywords:

Spectrogram Computer science Transformer Speech recognition Engineering Electrical engineering

Metrics

Cited By

5.33

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Animal Vocal Communication and Behavior

Life Sciences → Biochemistry, Genetics and Molecular Biology → Developmental Biology

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Automated detection of wolf howls using audio spectrogram transformers

Abstract

Metrics

Citation History

Topics

Related Documents

Spectrogram Transformers for Audio Classification

MAST: Multiscale Audio Spectrogram Transformers

Vocal Biomarkers for Parkinson’s Disease Classification Using Audio Spectrogram Transformers

Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers

Emergency sirens and wolf howls