Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions

Quốc Bảo Nguyễn; Van Tuan; Quang Trung Le; Ba Quyen Dam; Van Hai

doi:10.1145/3287921.3287938

ScienceGate Book Chapters

JOURNAL ARTICLE

Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions

Quốc Bảo Nguyễn Van Tuan Quang Trung Le Ba Quyen Dam Van Hai

Year: 2018 Pages: 222-226

DOI: 10.1145/3287921.3287938

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we first present our effort to collect a 500-hour corpus for Vietnamese read speech. After that, various techniques such as data augmentation, recurrent neural network language model rescoring, language model adaptation, bottleneck feature, system combination are applied to build the speech recognition system. Our final system achieves a low word error rate at 6.9% on the noisy test set.

Keywords:

Computer science Vietnamese Speech recognition Word error rate Bottleneck Vocabulary Language model Artificial intelligence Adaptation (eye) Hidden Markov model Set (abstract data type) Feature (linguistics) Test set Natural language processing Word (group theory) Recurrent neural network Artificial neural network

Metrics

Cited By

0.79

FWCI (Field Weighted Citation Impact)

Refs

0.78

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions

Abstract

Metrics

Citation History

Topics

Related Documents

Vietnamese large vocabulary continuous speech recognition

Vietnamese large vocabulary continuous speech recognition

Large Vocabulary Mandarin Continuous Speech Recognition under Noisy Environment

Large Vocabulary Continuous Speech Recognition System for Marathi

The RWTH large vocabulary continuous speech recognition system