Using kernel PCA to improve eigenvoice speaker adaptation

Brian Mak; James T. Kwok; Simon Ho

doi:10.1109/icmlc.2004.1378558

ScienceGate Book Chapters

JOURNAL ARTICLE

Using kernel PCA to improve eigenvoice speaker adaptation

Brian Mak James T. Kwok Simon Ho

Year: 2005 Vol: 16 Pages: 3062-3067

DOI: 10.1109/icmlc.2004.1378558

Get Full-Text PDF Get Analytical Report

Abstract

Eigenvoice-basedmethods have been shown to be effective for fast speaker adaptation when only a'small amount of adaptation data is available. Conventionally, these methods employ linear principal component analysis (PCA) to find the most important eigenvoices. Recently, in what we called kernel eigenvoice (KEV) speaker adaptation, we suggested the use of kernel PCA to compute the eigenvoices so as to exploit possible nonlinearity in the data. The major challenge is that unlike the standard eigenvoice (EV) method, an adapted speaker model found by KEV adaptation resides in the high-dimensional kernel-induced feature space; it is not clear how to obtain the constituent Gaussians of the adapted model that are needed for the computation of state observation likelihoods during the estimation of eigenvoice weights and subsequent decoding. Our solution is the use of composite kernels in such a way that state observation likelihoods can be computed using only kernel functions. In an evaluation on the TIDIGITS task using less than 10s of adaptation speech, it is found that KEV speaker adaptation using composite Gaussian kernels outperforms a speaker-independent model and adapted models found by EV, MAP, or MLLR adaptation using 2.1s and 4.1s of speech.

Keywords:

Computer science Speech recognition Kernel (algebra) Pattern recognition (psychology) Kernel principal component analysis Artificial intelligence Adaptation (eye) Principal component analysis Speaker recognition Mixture model Kernel method Mathematics Support vector machine

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.18

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Using kernel PCA to improve eigenvoice speaker adaptation

Abstract

Metrics

Topics

Related Documents

Kernel eigenvoice speaker adaptation

Kernel eigenvoice speaker adaptation

Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Evolutionary eigenvoice MLLR speaker adaptation