JOURNAL ARTICLE

Prediction of abstract prosodic labels for speech synthesis

Kenneth N. RossMari Ostendorf

Year: 1996 Journal:   Computer Speech & Language Vol: 10 (3)Pages: 155-185   Publisher: Elsevier BV

Abstract

Higher quality speech synthesis is required to make text-to-speech technologies useful in more applications, and prosody is one component of synthesis technology with the greatest need for improvement. This paper describes computational models for the prediction of abstract prosodic labels for synthesis—accent location, symbolic tones and relative prominence level—from text that is tagged with part-of-speech labels and marked for prosodic constituent structure. Specifically, the model uses multiple levels of a prosodic hierarchy and at each level combines decision tree probability functions with Markov sequence assumptions. An advantage of decision trees is the ability to incorporate linguistic knowledge in an automatic training framework, which is needed for building systems that reflect particular speaking styles. Studies of accent and tone variability across speakers are reported and used to motivate new evaluation metrics. Prediction experiments show an improvement in accuracy of prominence location prediction over simple decision trees, with accuracy similar to the level of variability observed across speakers.

Keywords:
Computer science Prosody Speech synthesis Hidden Markov model Speech recognition Hierarchy Decision tree Stress (linguistics) Natural language processing Artificial intelligence Tone (literature) Linguistics

Metrics

98
Cited By
7.43
FWCI (Field Weighted Citation Impact)
0
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Phonetics and Phonology Research
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Prosodic Boundary Prediction for Greek Speech Synthesis

Panagiotis Zervas

Journal:   Journal of Computer Sciences and Applications Year: 2013 Vol: 1 (4)Pages: 61-74
BOOK-CHAPTER

All-Prosodic Speech Synthesis

Arthur DirksenJohn Coleman

Year: 1997 Pages: 91-108
BOOK-CHAPTER

Prosodic Prediction in Brazilian Portuguese: A Contribution to Speech Synthesis

Cirineu Cecote Stein

Lecture notes in computer science Year: 2010 Pages: 152-161
© 2026 ScienceGate Book Chapters — All rights reserved.