Prediction of abstract prosodic labels for speech synthesis

Kenneth N. Ross; Mari Ostendorf

doi:10.1006/csla.1996.0010

ScienceGate Book Chapters

JOURNAL ARTICLE

Prediction of abstract prosodic labels for speech synthesis

Kenneth N. Ross Mari Ostendorf

Year: 1996 Journal: Computer Speech & Language Vol: 10 (3)Pages: 155-185 Publisher: Elsevier BV

DOI: 10.1006/csla.1996.0010

Get Full-Text PDF Get Analytical Report

Abstract

Higher quality speech synthesis is required to make text-to-speech technologies useful in more applications, and prosody is one component of synthesis technology with the greatest need for improvement. This paper describes computational models for the prediction of abstract prosodic labels for synthesis—accent location, symbolic tones and relative prominence level—from text that is tagged with part-of-speech labels and marked for prosodic constituent structure. Specifically, the model uses multiple levels of a prosodic hierarchy and at each level combines decision tree probability functions with Markov sequence assumptions. An advantage of decision trees is the ability to incorporate linguistic knowledge in an automatic training framework, which is needed for building systems that reflect particular speaking styles. Studies of accent and tone variability across speakers are reported and used to motivate new evaluation metrics. Prediction experiments show an improvement in accuracy of prominence location prediction over simple decision trees, with accuracy similar to the level of variability observed across speakers.

Keywords:

Computer science Prosody Speech synthesis Hidden Markov model Speech recognition Hierarchy Decision tree Stress (linguistics) Natural language processing Artificial intelligence Tone (literature) Linguistics

Metrics

Cited By

7.43

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Prediction of abstract prosodic labels for speech synthesis

Abstract

Metrics

Citation History

Topics

Related Documents

Prosodic Boundary Prediction for Greek Speech Synthesis

All-Prosodic Speech Synthesis

Prosodic Prediction in Brazilian Portuguese: A Contribution to Speech Synthesis

Self-attention Based Prosodic Boundary Prediction for Chinese Speech Synthesis

Generalizing prosodic prediction of speech recognition errors