Optimizing deep bottleneck feature extraction

Quốc Bảo Nguyễn; Jonas Gehring; Kevin Kilgour; Alex Waibel

doi:10.1109/rivf.2013.6719885

ScienceGate Book Chapters

JOURNAL ARTICLE

Optimizing deep bottleneck feature extraction

Quốc Bảo Nguyễn Jonas Gehring Kevin Kilgour Alex Waibel

Year: 2013 Vol: 20 Pages: 152-156

DOI: 10.1109/rivf.2013.6719885

Get Full-Text PDF Get Analytical Report

Abstract

We investigate several optimizations to a recently published architecture for extracting bottleneck features for large-vocabulary speech recognition with deep neural networks. We are able to improve recognition performance of first-pass systems from a 12% relative word error rate reduction reported previously to 21%, compared to MFCC baselines on a Tagalog conversational telephone speech corpus. This is achieved by using different input features, training the network to predict context-dependent targets, employing an efficient learning rate schedule and varying several architectural details. Evaluations on two larger German and French speech transcription tasks show that the optimizations proposed are universally applicable and yield comparable gains on other corpora (19.9% and 22.8%, respectively).

Keywords:

Computer science Bottleneck Word error rate Speech recognition Vocabulary Artificial intelligence Artificial neural network Telephony Feature extraction Schedule Context (archaeology) Deep learning Reduction (mathematics) Natural language processing

Metrics

Cited By

4.24

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Optimizing deep bottleneck feature extraction

Abstract

Metrics

Citation History

Topics

Related Documents

Optimizing Deep Bottleneck Feature Extraction

Training Deep Neural Networks for Bottleneck Feature Extraction

Bottleneck Feature Extraction for Gene Expression Using Deep Learning

Optimizing Cyber Threat Detection Through Bottleneck Feature Extraction and Adaptive Boosting

Bottleneck Feature Extraction-Based Deep Neural Network Model for Facial Emotion Recognition