Automatic speech recognition with sparse training data for dysarthric speakers

Phil Green; James Carmichael; Athanassios Hatzis; Pam Enderby; Mark Hawley; Mark Parker

doi:10.21437/eurospeech.2003-384

ScienceGate Book Chapters

JOURNAL ARTICLE

Automatic speech recognition with sparse training data for dysarthric speakers

Phil Green James Carmichael Athanassios Hatzis Pam Enderby Mark Hawley Mark Parker

Year: 2003 Pages: 1189-1192

DOI: 10.21437/eurospeech.2003-384

Get Full-Text PDF Get Analytical Report

Abstract

We describe an unusual ASR application: recognition of command words from severely dysarthric speakers, who have poor control of their articulators.The goal is to allow these clients to control assistive technology by voice.While this is a small vocabulary, speaker-dependent, isolated-word application, the speech material is more variable than normal, and only a small amount of data is available for training.After training a CDHMM recogniser, it is necessary to predict its likely performance without using an independent test set,so that confusable words can be replaced by alternatives.We present a battery of measures of consistency and confusability, based on forced-alignment, which can be used to predict recogniser performance.We show how these measures perform, and how they are presented to the clinicians who are the users of the system.

Keywords:

Speech recognition Computer science Training set Speaker recognition Training (meteorology) Artificial intelligence Dysarthria Natural language processing Audiology Medicine

Metrics

Cited By

3.45

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Voice and Speech Disorders

Health Sciences → Medicine → Physiology

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Automatic speech recognition with sparse training data for dysarthric speakers

Abstract

Metrics

Citation History

Topics

Related Documents

Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis

Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers

Survey of Automatic Dysarthric Speech Recognition

Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech

Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data