Low-resource grapheme-to-phoneme conversion using recurrent neural networks

Preethi Jyothi; Mark Hasegawa‐Johnson

doi:10.1109/icassp.2017.7953114

ScienceGate Book Chapters

JOURNAL ARTICLE

Low-resource grapheme-to-phoneme conversion using recurrent neural networks

Preethi Jyothi Mark Hasegawa‐Johnson

Year: 2017 Pages: 5030-5034

DOI: 10.1109/icassp.2017.7953114

Get Full-Text PDF Get Analytical Report

Abstract

Grapheme-to-phoneme (G2P) conversion is an important problem for many speech and language processing applications. G2P models are particularly useful for low-resource languages that do not have well-developed pronunciation lexicons. Prominent G2P paradigms are based on initial alignments between grapheme and phoneme sequences. In this work, we devise new alignment strategies that work effectively with recurrent neural network based models when only a small number of pronunciations are available to train the models. In a small data setting, we build G2P models for Pashto, Tagalog and Lithuanian that significantly outperform a joint sequence model and a baseline recurrent neural network based model, giving up to 14% and 9% relative reductions in phone and word error rates when trained on a dataset of 250 words.

Keywords:

Grapheme Computer science Resource (disambiguation) Artificial intelligence Artificial neural network Speech recognition Natural language processing Computer network Engineering

Metrics

Cited By

1.38

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Data Compression Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Algorithms and Data Compression

Physical Sciences → Computer Science → Artificial Intelligence

Low-resource grapheme-to-phoneme conversion using recurrent neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks

Grapheme-to-Phoneme Conversion with Convolutional Neural Networks

Multimodal, Multilingual Grapheme-to-Phoneme Conversion for Low-Resource Languages

Neural correlates of phoneme-to-grapheme conversion

Massively Multilingual Neural Grapheme-to-Phoneme Conversion