JOURNAL ARTICLE

Convolutional Neural Networks-based continuous speech recognition using raw speech signal

Abstract

State-of-the-art automatic speech recognition systems model the relationship between acoustic speech signal and phone classes in two stages, namely, extraction of spectral-based features based on prior knowledge followed by training of acoustic model, typically an artificial neural network (ANN). In our recent work, it was shown that Convolutional Neural Networks (CNNs) can model phone classes from raw acoustic speech signal, reaching performance on par with other existing feature-based approaches. This paper extends the CNN-based approach to large vocabulary speech recognition task. More precisely, we compare the CNN-based approach against the conventional ANN-based approach on Wall Street Journal corpus. Our studies show that the CNN-based approach achieves better performance than the conventional ANN-based approach with as many parameters. We also show that the features learned from raw speech by the CNN-based approach could generalize across different databases.

Keywords:
Computer science Convolutional neural network Speech recognition Acoustic model Vocabulary Phone Feature extraction Artificial neural network SIGNAL (programming language) Artificial intelligence Hidden Markov model Feature (linguistics) Speech processing Task (project management) Pattern recognition (psychology)

Metrics

176
Cited By
16.03
FWCI (Field Weighted Citation Impact)
36
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Continuous speech recognition by convolutional neural networks

ZHANG Qing-qingLIU YongPAN Jie-linYAN Yong-hong

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2015
JOURNAL ARTICLE

Speech Recognition Using Convolutional Neural Networks

D. NagajyothiP. Siddaiah

Journal:   International Journal of Engineering & Technology Year: 2018 Vol: 7 (4.6)Pages: 133-137
JOURNAL ARTICLE

Speech Emotion Recognition Using Convolutional Neural Networks

Narsi Reddy

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2024 Vol: 12 (8)Pages: 30-36
BOOK-CHAPTER

Speech Emotion Recognition Using Convolutional Neural Networks

Anunya SharmaKiran MalikPoonam Bansal

Communications in computer and information science Year: 2024 Pages: 90-101
© 2026 ScienceGate Book Chapters — All rights reserved.