JOURNAL ARTICLE

Hidden factor dynamic Bayesian networks for speech recognition

Abstract

This paper presents a novel approach to modeling speech data by Dynamic Bayesian Networks. Instead of defining a specific set of factors that affect speech signals the factors are modeled implicitly by speech data clustering. Different data clusters correspond to different subsets of the factor values. These subsets are represented by the corresponding factor states. The factor states along with the phone states represent 2 hidden layers in the Hidden Factor Dynamic Bayesian Network (HFDBN). In this study we proved that Hidden Factor Dynamic Bayesian Networks provide a better speech recognition performance than HMMs of equal complexity. Speech recognition experiments were conducted on the speech data recorded in a moving car and demonstrated advantage of using HFDBN over HMM for clean and noisy speech data recognition.

Keywords:
Hidden Markov model Computer science Dynamic Bayesian network Speech recognition Factor (programming language) Cluster analysis Bayesian probability Phone Set (abstract data type) Bayesian network Artificial intelligence Dynamic factor Pattern recognition (psychology) Mathematics

Metrics

5
Cited By
0.77
FWCI (Field Weighted Citation Impact)
5
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Dynamic Bayesian networks for automatic speech recognition

Murat Deviren

Journal:   National Conference on Artificial Intelligence Year: 2002 Pages: 981-981
JOURNAL ARTICLE

Dynamic Bayesian Networks for Audio-Visual Speech Recognition

Ara NefianLuhong LiangXiaobo PiXiaoxing LiuKevin J. Murphy

Journal:   EURASIP Journal on Advances in Signal Processing Year: 2002 Vol: 2002 (11)
JOURNAL ARTICLE

Dynamic Bayesian networks for multi-band automatic speech recognition

Khalid DaoudiDominique FohrAntoine Christophe

Journal:   Computer Speech & Language Year: 2003 Vol: 17 (2-3)Pages: 263-285
© 2026 ScienceGate Book Chapters — All rights reserved.