JOURNAL ARTICLE

Optimizing deep bottleneck feature extraction

Abstract

We investigate several optimizations to a recently published architecture for extracting bottleneck features for large-vocabulary speech recognition with deep neural networks. We are able to improve recognition performance of first-pass systems from a 12% relative word error rate reduction reported previously to 21%, compared to MFCC baselines on a Tagalog conversational telephone speech corpus. This is achieved by using different input features, training the network to predict context-dependent targets, employing an efficient learning rate schedule and varying several architectural details. Evaluations on two larger German and French speech transcription tasks show that the optimizations proposed are universally applicable and yield comparable gains on other corpora (19.9% and 22.8%, respectively).

Keywords:
Computer science Bottleneck Word error rate Speech recognition Vocabulary Artificial intelligence Artificial neural network Telephony Feature extraction Schedule Context (archaeology) Deep learning Reduction (mathematics) Natural language processing

Metrics

11
Cited By
4.24
FWCI (Field Weighted Citation Impact)
24
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

BOOK-CHAPTER

Bottleneck Feature Extraction for Gene Expression Using Deep Learning

Tanima ThakurIsha BatraArun Malik

Advances in human and social aspects of technology book series Year: 2024 Pages: 311-332
JOURNAL ARTICLE

Optimizing Cyber Threat Detection Through Bottleneck Feature Extraction and Adaptive Boosting

B. MenakaS. Arulselvarani

Journal:   Indian Journal of Science and Technology Year: 2025 Vol: 18 (28)Pages: 2246-2256
BOOK-CHAPTER

Bottleneck Feature Extraction-Based Deep Neural Network Model for Facial Emotion Recognition

Tian MaKavuma BenonBamweyana ArnoldKeping YuYang YanQiaozhi HuaZheng WenAnup Kumar Paul

Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Year: 2020 Pages: 30-46
© 2026 ScienceGate Book Chapters — All rights reserved.