JOURNAL ARTICLE

Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization

Abstract

This paper examines two issues of a statistical speech synthesis approach based Gaussian process (GP) regression. Although GP-based speech synthesis can give higher performance in generating spectral parameters than the HMM-based one, a number of issues still remain. In this paper, we incorporate global variance (GV) feature to overcome over-smoothing problem into the parameter generation. Furthermore, in order to utilize an appropriate kernel function in accordance with actual data, we propose an EM-based kernel hyperparameter optimization technique. Objective and subjective evaluation results show that using GV and hyperparameter estimation enhanced the performance in spectral feature generation.

Keywords:
Hyperparameter Gaussian process Computer science Smoothing Kriging Kernel (algebra) Artificial intelligence Parametric statistics Feature (linguistics) Variance (accounting) Pattern recognition (psychology) Regression Kernel smoother Machine learning Gaussian Algorithm Kernel method Mathematics Statistics Support vector machine Radial basis function kernel

Metrics

8
Cited By
3.38
FWCI (Field Weighted Citation Impact)
23
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.