Alan Tan Wee ChiatChockalingam Aravind VaithilingamHou Kit MunPraveen Edward James
The purpose of this paper is to design an efficient recurrent neural network (RNN)-based speech recognition system using software with long short-term memory (LSTM). The design process involves speech acquisition, pre-processing, feature extraction, training and pattern recognition tasks for a spoken sentence recognition system using LSTM-RNN. There are five layers namely, an input layer, a fully connected layer, a hidden LSTM layer, SoftMax layer and a sequential output layer. A vocabulary of 80 words which constitute 20 sentences is used. The depth of the layer is chosen as 20, 42 and 60 and the accuracy of each system is determined. The results reveal that the maximum accuracy of 89% is achieved when the depth of the hidden layer is 42. Since the depth of the hidden layer is fixed for a task, increased performance can be achieved by increasing the number of hidden layers.
Praveen Edward JamesHou Kit MunChockalingam Aravind VaithilingamAlan Tan Wee Chiat
Siddhant C. JoshiDR. A.N. CHEERAN
Yeh-Huann GohKai-Xian LauYoon-Ket Lee