Subword-Based Large-Vocabulary Speech Recognition

Chin‐Hui Lee; Jean‐Luc Gauvain; Roberto Pieraccini; Lawrence R. Rabiner

doi:10.1002/j.1538-7305.1993.tb00652.x

ScienceGate Book Chapters

JOURNAL ARTICLE

Subword-Based Large-Vocabulary Speech Recognition

Chin‐Hui Lee Jean‐Luc Gauvain Roberto Pieraccini Lawrence R. Rabiner

Year: 1993 Journal: AT&T Technical Journal Vol: 72 (5)Pages: 25-36 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1002/j.1538-7305.1993.tb00652.x

Get Full-Text PDF Get Analytical Report

Abstract

During the past several years, research in large-vocabulary speech recognition has been intensively carried out worldwide, encouraged by advances in algorithms, architecture, and hardware. In the United States, the defense advanced-research projects agency (DARPA) spoken-language-processing community has focused its efforts on studying several systems. These include the 991-word naval resource management (RM) speech-recognition task, the open-vocabulary, spontaneous-speech, air-travel information system (ATIS) speech-understanding task, and the 20,000-word Wall Street Journal (WSJ) dictation task. Although researchers have learned a great deal about how to build and efficiently implement large-vocabulary speech-recognition systems, many fundamental questions remain, for which there are no definitive answers. This paper focuses on the basic structure of a large-vocabulary speech-recognition system, considerations in choosing a set of subword units, method of “training,” integration of a language model, and implementation of a complete system. The paper also reports on some recent results, obtained at AT&T Bell Laboratories, on the DARPA RM task.

Keywords:

Dictation Vocabulary Computer science Task (project management) Speech recognition Set (abstract data type) Resource (disambiguation) Natural language processing Hidden Markov model Word (group theory) Artificial intelligence Speech processing Linguistics Engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.20

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Subword-Based Large-Vocabulary Speech Recognition

Abstract

Metrics

Topics

Related Documents

Large vocabulary speech recognition using subword units

Acoustic modeling of subword units for large vocabulary speaker independent speech recognition

Creating large subword units for speech recognition

Syllable-based large vocabulary continuous speech recognition

Speech recognition based on subword units