Fast speaker change detection for broadcast news transcription and indexing

Daben Liu; Francis Kubala

doi:10.21437/eurospeech.1999-167

ScienceGate Book Chapters

JOURNAL ARTICLE

Fast speaker change detection for broadcast news transcription and indexing

Daben Liu Francis Kubala

Year: 1999 Pages: 1031-1034

DOI: 10.21437/eurospeech.1999-167

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we describe a new speaker change detection algorithm designed for fast transcription and audio indexing of spoken broadcast news. We have designed a two-stage algorithm that begins with a gender-independent phone-class recognition pass. We collapse the phoneme inventory to only 4 broad classes and include 4 different models for non-speech, resulting in a small fast decoder that runs in less than 0.1 times real-time. The second stage of the SCD algorithm hypothesizes a speaker change boundary between every phone in the labeled input. The phone level time resolution in our approach permits the algorithm to run quickly while maintaining the same accuracy as a frame level approach. Applying the new algorithms to a large sample of broadcast news programs resulted in improvements in speaker change detection accuracy, speech recognition accuracy, and speed.

Keywords:

Computer science Search engine indexing Phone Speech recognition Speaker diarisation Transcription (linguistics) Voice activity detection Frame (networking) Change detection Speaker recognition Artificial intelligence Speech processing Telecommunications

Metrics

Cited By

7.86

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Fast speaker change detection for broadcast news transcription and indexing

Abstract

Metrics

Citation History

Topics

Related Documents

Fast speaker change detection for broadcast news transcription and indexing

Unsupervised Speaker Change Detection For Broadcast News Segmentation

Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition

Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News

On-line incremental speaker adaptation for broadcast news transcription