JOURNAL ARTICLE

A hybrid method oriented to concatenative text-to-speech synthesis

Abstract

In this paper we present a speech synthesis method for diphonebased text-to-speech systems. Its main goal is to achieve\nprosodic modifications that result in more natural-sounding synthetic speech. This improvement is especially useful for emotional speech synthesis, which requires high-quality prosodic modification. We present a hybrid method based on TD-PSOLA and the harmonic plus noise model, which incorporates a novel method to jointly modify pitch and time-scale. Preliminary results show an improvement in the synthetic speech quality when high pitch modification is required.

Keywords:
Computer science Speech synthesis Speech recognition Natural language processing Artificial intelligence

Metrics

1
Cited By
0.38
FWCI (Field Weighted Citation Impact)
7
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

An Efficient Unit-selection Method for Concatenative Text-to-speech Synthesis Systems

Jerneja Žganec GrosMario Žganec

Journal:   Journal of Computing and Information Technology Year: 2007 Vol: 16 (1)Pages: 69-69
JOURNAL ARTICLE

A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units

Stas TiomkinD. MalahSlava ShechtmanZvi Kons

Journal:   IEEE Transactions on Audio Speech and Language Processing Year: 2010 Vol: 19 (5)Pages: 1278-1288
JOURNAL ARTICLE

INDONESIAN TEXT-TO-SPEECH SYSTEM USING DIPHONE CONCATENATIVE SYNTHESIS

Sutarman Sutarman

Journal:   International Journal of Computer Systems & Software Engineering Year: 2015 Vol: 1 (1)Pages: 85-93
© 2026 ScienceGate Book Chapters — All rights reserved.