Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Keiichiro Oura; Keiichi Tokuda; Junichi Yamagishi; Simon King; Mirjam Wester

doi:10.1109/icassp.2010.5495558

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Keiichiro Oura Keiichi Tokuda Junichi Yamagishi Simon King Mirjam Wester

Year: 2010 Pages: 4594-4597

DOI: 10.1109/icassp.2010.5495558

Get Full-Text PDF Get Analytical Report

Abstract

In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user's spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user's voice. We integrate two techniques, unsupervised adaptation for HMM-based TTS using a word-based large-vocabulary continuous speech recognizer and cross-lingual speaker adaptation for HMM-based TTS, into a single architecture. Thus, an unsupervised cross-lingual speaker adaptation system can be developed. Listening tests show very promising results, demonstrating that adapted voices sound similar to the target speaker and that differences between supervised and unsupervised cross-lingual speaker adaptation are small.

Keywords:

Computer science Speech recognition Hidden Markov model Adaptation (eye) Vocabulary Speaker diarisation Word (group theory) Speech synthesis Active listening Artificial intelligence Natural language processing Speaker recognition Linguistics Communication Psychology

Metrics

Cited By

3.61

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Abstract

Metrics

Citation History

Topics

Related Documents

Cross-Lingual Speaker Adaptation for HMM-based Speech Synthesis

Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

A comparison of supervised and unsupervised cross-lingual speaker adaptation approaches for HMM-based speech synthesis