Automated transcription and topic segmentation of large spoken archives

Martin Franz; Bhuvana Ramabhadran; Todd Ward; Michael Picheny

doi:10.21437/eurospeech.2003-331

ScienceGate Book Chapters

JOURNAL ARTICLE

Automated transcription and topic segmentation of large spoken archives

Martin Franz Bhuvana Ramabhadran Todd Ward Michael Picheny

Year: 2003 Pages: 953-956

DOI: 10.21437/eurospeech.2003-331

Get Full-Text PDF Get Analytical Report

Abstract

Digital archives have emerged as the pre-eminent method for capturing the human experience. Before such archives can be used efficiently, their contents must be described. The scale of such archives along with the associated content mark up cost make it impractical to provide access via purely manual means, but automatic technologies for search in spoken materials still have relatively limited capabilities. The NSF-funded MALACH project will use the world’s largest digital archive of video oral histories, collected by the Survivors of the Shoah Visual History Foundation (VHF) to make a quantum leap in the ability to access such archives by advancing the state-of-the-art in Automated Speech Recognition (ASR), Natural Language Processing (NLP) and related technologies [1, 2]. This corpus consists of over 115,000 hours of unconstrained, natural speech from 52,000 speakers in 32 different languages, filled with disfluencies, heavy accents, age-related coarticulations, and un-cued speaker and language switching. Thispaper discusses some of theASR and NLPtools and technologies that we have been building for the English speech in the MALACH corpus. We also discuss this new test bed while emphasizing the unique characteristics of this corpus.

Keywords:

Computer science Cued speech Transcription (linguistics) Natural language processing Segmentation Natural language Spoken language Speech processing Artificial intelligence Speech recognition World Wide Web Linguistics

Metrics

Cited By

3.83

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Automated transcription and topic segmentation of large spoken archives

Abstract

Metrics

Citation History

Topics

Related Documents

Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives

Towards automatic transcription of large spoken archives - English ASR for the MALACH project

Multilingual access to large spoken archives

Speech cohesion for topic segmentation of spoken contents

Automated Topic Modelling in Archives Portal Europe_20220912