Speech recognition using time-warping neural networks

Kazuhiko Aikawa

doi:10.1109/nnsp.1991.239508

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech recognition using time-warping neural networks

Kazuhiko Aikawa

Year: 2002 Vol: 1 Pages: 337-346

DOI: 10.1109/nnsp.1991.239508

Get Full-Text PDF Get Analytical Report

Abstract

The author proposes a time-warping neural network (TWNN) for phoneme-based speech recognition. The TWNN is designed to accept phonemes with arbitrary duration, whereas conventional phoneme recognition networks have a fixed-length input window. The purpose of this network is to cope with not only variability of phoneme duration but also time warping in a phoneme. The proposed network is composed of several time-warping units which each have a time-warping function. The TWNN is characterized by time-warping functions embedded between the input layer and the first hidden layer in the network. The proposed network demonstrates higher phoneme recognition accuracy than a baseline recognizer based on conventional feedforward neural networks and linear time alignment. The recognition accuracy is even higher than that achieved with discrete hidden Markov models.< >

Keywords:

Dynamic time warping Image warping Speech recognition Computer science Artificial neural network Hidden Markov model Time delay neural network Pattern recognition (psychology) Feedforward neural network Artificial intelligence

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.06

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Neural Networks and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Time Series Analysis and Forecasting

Physical Sciences → Computer Science → Signal Processing

Speech recognition using time-warping neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

Phoneme recognition using time-warping neural networks.

Speech Recognition using Dynamic Time Warping

Speech recognition using time-delay neural networks

HUMAN-LIKE DYNAMIC PROGRAMMING NEURAL NETWORKS FOR DYNAMIC TIME WARPING SPEECH RECOGNITION

Speech recognition using dynamic time warping with neural network trained templates