This paper presents a human-like dynamic programming neural network method for speech recognition using dynamic time warping. The networks are configured, much like human’s, such that the minimum states of the network’s energy function represent the near-best correlation between test and reference patterns. The dynamics and properties of the neural networks are analytically explained. Simulations for classifying speaker-dependent isolated words, consisting of 0 to 9 and A to Z, show that the method is better than conventional methods. The hardware implementation of this method is also presented.
Talal Bin AminIftekhar Mahmood
Yurika PermanasariErwin HarahapErwin Prayoga Ali
Chaug-Ching HuangJhing-Fa WangChung‐Hsien WuJau‐Yien Lee