JOURNAL ARTICLE

DNN Based Expressive Text-to-Speech with Limited Training Data

Abstract

Modern text-to-speech synthesis systems should deliver speech which is not just intelligible, but whose style corresponds to the domain in which synthesized speech is used. In this paper three approaches based on deep neural networks aimed at synthesis of expressive speech are presented: style code, model re-training and an architecture using shared hidden layers. Their usability is tested on a speech corpus with a limited amount of expressive speech data. A new architecture for transplanting speech styles is also presented and compared with a referent approach from literature.

Keywords:
Computer science Speech synthesis Speech recognition Referent Natural language processing Usability Artificial intelligence Architecture Style (visual arts) Domain (mathematical analysis) Speech corpus Artificial neural network Linguistics Human–computer interaction

Metrics

3
Cited By
0.15
FWCI (Field Weighted Citation Impact)
52
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Accented Text-to-Speech Synthesis With Limited Data

Xuehao ZhouMingyang ZhangYi ZhouZhizheng WuHaizhou Li

Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Year: 2024 Vol: 32 Pages: 1699-1711
JOURNAL ARTICLE

Expressive Text-To-Speech Approaches

Kanellos, IoannisSuciu, IoanaMoudenc, Thierry

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2007
JOURNAL ARTICLE

Expressive Text-To-Speech Approaches

Kanellos, IoannisSuciu, IoanaMoudenc, Thierry

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2007
JOURNAL ARTICLE

Sentence-Based Sentiment Analysis for Expressive Text-to-Speech

Alexandre TrillaFrancesc Álías

Journal:   IEEE Transactions on Audio Speech and Language Processing Year: 2012 Vol: 21 (2)Pages: 223-233
© 2026 ScienceGate Book Chapters — All rights reserved.