Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Sheng Li; Yuya Akita; Tatsuya Kawahara

doi:10.1587/transinf.2015edp7047

ScienceGate Book Chapters

JOURNAL ARTICLE

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Sheng Li Yuya Akita Tatsuya Kawahara

Year: 2015 Journal: IEICE Transactions on Information and Systems Vol: E98.D (8)Pages: 1545-1552 Publisher: Institute of Electronics, Information and Communication Engineers

DOI: 10.1587/transinf.2015edp7047

Get Full-Text PDF Get Analytical Report

Abstract

The paper addresses a scheme of lightly supervised training of an acoustic model, which exploits a large amount of data with closed caption texts but not faithful transcripts. In the proposed scheme, a sequence of the closed caption text and that of the ASR hypothesis by the baseline system are aligned. Then, a set of dedicated classifiers is designed and trained to select the correct one among them or reject both. It is demonstrated that the classifiers can effectively filter the usable data for acoustic model training. The scheme realizes automatic training of the acoustic model with an increased amount of data. A significant improvement in the ASR accuracy is achieved from the baseline system and also in comparison with the conventional method of lightly supervised training based on simple matching.

Keywords:

Computer science Discriminative model USable Training set Scheme (mathematics) Labeled data Artificial intelligence Speech recognition Exploit Transcription (linguistics) Acoustic model Machine learning Pattern recognition (psychology) Speech processing Multimedia

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.03

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training

Abstract

Metrics

Citation History

Topics

Related Documents

Discriminative data selection for lightly supervised training of acoustic model using closed caption texts

Improving broadcast news transcription by lightly supervised discriminative training

Investigating lightly supervised acoustic model training

Lightly supervised and unsupervised acoustic model training

Lightly supervised training for risk-based discriminative language models