JOURNAL ARTICLE

CNN-based Text-independent Automatic Speaker Identification Using Short Utterances

Mandana FasounakiEmirhan Burak YüceSerkan OnculGökhan İnce

Year: 2021 Journal:   2021 6th International Conference on Computer Science and Engineering (UBMK) Pages: 413-418

Abstract

With the widespread use of voice-controlling services and devices, the research for developing robust and fast systems for automatic speaker identification had accelerated. In this paper, we present a Convolutional Neural Network (CNN) architecture for text-independent automatic speaker identification. The primary purpose is to identify a speaker, among many others, using a short speech segment. Most of the current researches focus on deep CNNs, which were initially designed for computer vision tasks. Besides, most of the existing speaker identification methods require audio samples longer than 3 seconds in the query phase for achieving a high accuracy. We created a CNN architecture appropriate for voice and speech-related classification tasks. We propose an optimum model that achieves 99.5% accuracy on LibriSpeech and 90% accuracy on VoxCeleb 1 dataset using only 1-second test utterances in our experiments.

Keywords:
Computer science Convolutional neural network Speech recognition Focus (optics) Identification (biology) Speaker recognition Speaker identification Speaker diarisation Artificial intelligence

Metrics

16
Cited By
1.35
FWCI (Field Weighted Citation Impact)
40
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Text-independent speaker identification from short utterances based on piecewise discriminant analysis

Hiroshi Matsumoto

Journal:   Computer Speech & Language Year: 1989 Vol: 3 (2)Pages: 133-150
JOURNAL ARTICLE

Text-independent speaker recognition with short utterances

Kairui LiEdwin H. Wrench

Journal:   The Journal of the Acoustical Society of America Year: 1982 Vol: 72 (S1)Pages: S29-S30
JOURNAL ARTICLE

Text independent speaker identification using automatic acoustic segmentation

Richard C. RoseD.A. Reynolds

Journal:   International Conference on Acoustics, Speech, and Signal Processing Year: 2002 Pages: 293-296
© 2026 ScienceGate Book Chapters — All rights reserved.