Towards automatic transcription of large spoken archives - English ASR for the MALACH project

Bhuvana Ramabhadran; Jing Huang; Michael Picheny

doi:10.1109/icassp.2003.1198756

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards automatic transcription of large spoken archives - English ASR for the MALACH project

Bhuvana Ramabhadran Jing Huang Michael Picheny

Year: 2003 Journal: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Vol: 1 Pages: I-216

DOI: 10.1109/icassp.2003.1198756

Get Full-Text PDF Get Analytical Report

Abstract

Digital archives have emerged as the pre-eminent method for capturing the human experience. Before such archives can be used efficiently, their contents must be described. The NSF-funded MALACH project aims to provide improved access to large spoken archives by advancing the state-of-the-art in automated speech recognition (ASR), Information Retrieval (IR) and related technologies [1,2] for multiple languages. This paper describes the ASR research for the English speech in the MALACH corpus. The MALACH corpus consists of unconstrained, natural speech filled with disfluencies, heavy accents, age-related coarticulation, uncued speaker and language switching, and emotional speech collected in the form of interviews from over 52000 speakers in 32 languages. In this paper, we describe this new testbed for developing speech recognition algorithms and report on the performance of well-known techniques for building better acoustic models for the speaking styles seen in this corpus. The best English ASR system to date has a word error rate of 43.8% on this corpus.

Keywords:

Computer science Coarticulation Transcription (linguistics) Natural language processing Speech recognition Speech corpus Speech technology Artificial intelligence Speech processing Speech synthesis Linguistics

Metrics

Cited By

3.92

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Towards automatic transcription of large spoken archives - English ASR for the MALACH project

Abstract

Metrics

Citation History

Topics

Related Documents

Towards Automatic Transcription of Large Spoken Archives in Agglutinating Languages – Hungarian ASR for the MALACH Project

Towards Automatic Transcription of Spontaneous Czech Speech in the MALACH Project

Automated transcription and topic segmentation of large spoken archives

Automatic transcription of Czech, Russian, and Slovak spontaneous speech in the MALACH project

Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments