Speech recognition using stochastic phonemic segment model based on phoneme segmentation

Chieko Furuichi; Katsura Aizawa; Inoue Kazuhiko

doi:10.1002/1520-684x(200009)31:10<89::aid-scj9>3.0.co;2-7

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech recognition using stochastic phonemic segment model based on phoneme segmentation

Chieko Furuichi Katsura Aizawa Inoue Kazuhiko

Year: 2000 Journal: Systems and Computers in Japan Vol: 31 (10)Pages: 89-98 Publisher: Wiley

DOI: 10.1002/1520-684x(200009)31:10<89::aid-scj9>3.0.co;2-7

Get Full-Text PDF Get Analytical Report

Abstract

This paper discusses speech recognition based on a new statistical phoneme segment model which is trained by phoneme parameters derived from automatically extracted phoneme segments. The proposed system operates as follows. In preprocessing before recognition, the phoneme boundaries are detected by segmentation. The phonemes are discriminated using a stochastic phoneme segment model, and a phoneme segment lattice with scores is constructed. Next the speech recognition is performed by matching of symbol sequences to dictionary items. The segmentation system that is employed can infer phoneme boundaries with high accuracy. This helps to eliminate unnecessary parameters, leaving the feature parameters which are effective in separating phonemes. In other words, the phoneme recognition problem in continuous speech can be reduced to a discrimination problem and thus a speaker-independent model can be constructed from a relatively small number of training data. The stochastic phoneme segment model is trained with training samples extracted from a phoneme-balanced word set of 4920 words uttered by 10 speakers. In a recognition experiment with 6709 words uttered by 63 nontraining speakers, a recognition rate of 92.6% was obtained as the average for all speakers, using a word dictionary of 212 words. © 2000 Scripta Technica, Syst Comp Jpn, 31(10): 89–98, 2000

Keywords:

Speech recognition Computer science Segmentation Preprocessor Speech segmentation Pattern recognition (psychology) Feature (linguistics) Word recognition Word (group theory) Artificial intelligence Matching (statistics) Word error rate Set (abstract data type) Natural language processing Mathematics Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.12

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech recognition using stochastic phonemic segment model based on phoneme segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

A stochastic segment model for phoneme-based continuous speech recognition

A stochastic segment model for phoneme-based continuous speech recognition

A Stochastic Segment Model for Phoneme-Based Continuous Speech Recognition

A statistical phonemic segment model for speech recognition based on automatic phonemic segmentation

Speech/Non-Speech Segmentation Based on Phoneme Recognition Features