This paper describes a new formulation of a polynomial sequence kernel based on dynamic time warping (DTW) for support vector machine (SVM) classification of isolated words given very sparse training data. The words are uttered by dysarthric speakers who suffer from debilitating neurological conditions that make the collection of speech samples a timeconsuming and low-yield process. Data for building dysarthric speech recognition engines are therefore limited. Simulations show that the SVM based approach is significantly better than standard DTW and hidden Markov model (HMM) approaches when given sparse training data. In conditions where the models were constructed from three examples of each word, the SVM approach recorded a 45% lower error rate (relative) than the DTW approach and a 35% lower error rate than the HMM approach.
Phil GreenJames CarmichaelAthanassios HatzisPam EnderbyMark HawleyMark Parker
Iván Mejía-GuevaraÁngel Kuri-Morales
Hiroshi ShimodairaKen-ichi NomaMitsuru NakaiShigeki Sagayama
Balwant A. SonkambleDharmpal D. Doye