JOURNAL ARTICLE

Using kernel PCA to improve eigenvoice speaker adaptation

Abstract

Eigenvoice-basedmethods have been shown to be effective for fast speaker adaptation when only a'small amount of adaptation data is available. Conventionally, these methods employ linear principal component analysis (PCA) to find the most important eigenvoices. Recently, in what we called kernel eigenvoice (KEV) speaker adaptation, we suggested the use of kernel PCA to compute the eigenvoices so as to exploit possible nonlinearity in the data. The major challenge is that unlike the standard eigenvoice (EV) method, an adapted speaker model found by KEV adaptation resides in the high-dimensional kernel-induced feature space; it is not clear how to obtain the constituent Gaussians of the adapted model that are needed for the computation of state observation likelihoods during the estimation of eigenvoice weights and subsequent decoding. Our solution is the use of composite kernels in such a way that state observation likelihoods can be computed using only kernel functions. In an evaluation on the TIDIGITS task using less than 10s of adaptation speech, it is found that KEV speaker adaptation using composite Gaussian kernels outperforms a speaker-independent model and adapted models found by EV, MAP, or MLLR adaptation using 2.1s and 4.1s of speech.

Keywords:
Computer science Speech recognition Kernel (algebra) Pattern recognition (psychology) Kernel principal component analysis Artificial intelligence Adaptation (eye) Principal component analysis Speaker recognition Mixture model Kernel method Mathematics Support vector machine

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
15
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Kernel eigenvoice speaker adaptation

Brian MakJames T. KwokSimon Ho

Journal:   IEEE Transactions on Speech and Audio Processing Year: 2005 Vol: 13 (5)Pages: 984-992
JOURNAL ARTICLE

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Brian MakRoger HsiaoSimon HoJames T. Kwok

Journal:   IEEE Transactions on Audio Speech and Language Processing Year: 2006 Vol: 14 (4)Pages: 1267-1280
JOURNAL ARTICLE

Evolutionary eigenvoice MLLR speaker adaptation

Reza SahraeianMehdi MohammadiAhmad AkbariAhmad Ayatollahi

Journal:   Procedia Computer Science Year: 2011 Vol: 3 Pages: 992-997
© 2026 ScienceGate Book Chapters — All rights reserved.