Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data

Vincent Wan; James Carmichael

doi:10.21437/interspeech.2005-853

ScienceGate Book Chapters

JOURNAL ARTICLE

Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data

Vincent Wan James Carmichael

Year: 2005 Pages: 3321-3324

DOI: 10.21437/interspeech.2005-853

Get Full-Text PDF Get Analytical Report

Abstract

This paper describes a new formulation of a polynomial sequence kernel based on dynamic time warping (DTW) for support vector machine (SVM) classification of isolated words given very sparse training data. The words are uttered by dysarthric speakers who suffer from debilitating neurological conditions that make the collection of speech samples a timeconsuming and low-yield process. Data for building dysarthric speech recognition engines are therefore limited. Simulations show that the SVM based approach is significantly better than standard DTW and hidden Markov model (HMM) approaches when given sparse training data. In conditions where the models were constructed from three examples of each word, the SVM approach recorded a 45% lower error rate (relative) than the DTW approach and a 35% lower error rate than the HMM approach.

Keywords:

Dynamic time warping Hidden Markov model Support vector machine Speech recognition Computer science Word error rate Pattern recognition (psychology) Kernel (algebra) Artificial intelligence Polynomial kernel Polynomial Kernel method Mathematics

Metrics

Cited By

2.26

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data

Abstract

Metrics

Citation History

Topics

Related Documents

Automatic speech recognition with sparse training data for dysarthric speakers

MP-Polynomial Kernel for Training Support Vector Machines

Support vector machine with dynamic time-alignment kernel for speech recognition

A Novel Linear-Polynomial Kernel to Construct Support Vector Machines for Speech Recognition

Use of Support Vector Machines through Linear-Polynomial (LP) Kernel for Speech Recognition