Cross-Lingual Language Modeling for Low-Resource Speech Recognition

Ping Xu; Pascale Fung

doi:10.1109/tasl.2013.2244088

ScienceGate Book Chapters

JOURNAL ARTICLE

Cross-Lingual Language Modeling for Low-Resource Speech Recognition

Ping Xu Pascale Fung

Year: 2013 Journal: IEEE Transactions on Audio Speech and Language Processing Vol: 21 (6)Pages: 1134-1144 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tasl.2013.2244088

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes using cross-lingual language modeling with syntactic information for low-resource speech recognition. We propose phrase-level transduction and syntactic reordering for transcribing a resource-poor language and translating it into a resource-rich language, if necessary. The phrase-level transduction is capable of performing n -m cross-lingual transduction. The syntactic reordering serves to model the syntactic discrepancies between the source and target languages. Our purpose is to leverage the statistics in a resource-rich language model to improve the language model of a resource-poor language and at the same time to improve low-resource speech recognition performance. We implement our cross-lingual language model using weighted finite-state transducers (WFSTs), and integrate it into a WFST-based speech recognition search space to output the transcriptions of both resource-poor and resource-rich languages. This creates an integrated speech transcription and translation framework. Evaluations on Cantonese speech transcription and Cantonese to standard Chinese translation tasks show that our proposed approach improves the system performance significantly, with up to 12.5% relative character error rate (CER) reduction over baseline language model interpolation, 6.6% relative CER reduction and 18.5% relative BLEU score improvement, compared to the best word-level transduction approach. © 2013 IEEE.

Keywords:

Computer science Natural language processing Leverage (statistics) Artificial intelligence Word error rate Transduction (biophysics) Transcription (linguistics) Phrase Language model Machine translation Mandarin Chinese Speech recognition Linguistics

Metrics

Cited By

0.94

FWCI (Field Weighted Citation Impact)

Refs

0.83

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Cross-Lingual Language Modeling for Low-Resource Speech Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Cross-lingual language modeling for low-resource speech recognition

CAM: A cross-lingual adaptation framework for low-resource language speech recognition

Exploiting Adapters for Cross-Lingual Low-Resource Speech Recognition

Cross-Lingual Word Embeddings for Low-Resource Language Modeling

Cross-Lingual Cross-Age Adaptation for Low-Resource Elderly Speech Emotion Recognition