JOURNAL ARTICLE

Phoneme recognition by phoneme filter neural networks

Abstract

A phoneme filter neural network (PFN) approach to vowel recognition is described. The PFN is a multilayer neural network with fewer hidden units than input units prepared for each of the phoneme categories. Each network is trained as identity mapping by speech data belonging to one phoneme category. In the recognition process, the similarity between the input data and output data is computed for each network. The results of an experiment involving the Japanese vowel recognition task showed that the PFN recognition rates for the top two or more choices are higher than those of a conventional three-layer neural network and the PFN outputs represented candidate likelihoods. It was also confirmed that the PFN has a mapping ability and recognition performance superior to those of the linear K-L transformation method because of the nonlinearity of the PFN.< >

Keywords:
Artificial neural network Speech recognition Computer science Similarity (geometry) Filter (signal processing) Vowel Transformation (genetics) Task (project management) Pattern recognition (psychology) Artificial intelligence Identity (music) Process (computing) Time delay neural network Natural language processing Image (mathematics) Engineering Computer vision Acoustics

Metrics

5
Cited By
0.44
FWCI (Field Weighted Citation Impact)
18
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

A new approach to phoneme recognition by phoneme filter neural networks

Masami NakamuraKazuhiko TsudaJun‐ichi Aoe

Journal:   Information Sciences Year: 1996 Vol: 90 (1-4)Pages: 109-119
JOURNAL ARTICLE

Recurrent neural networks for phoneme recognition

Takuya KoizumiMikio MoriShuji TaniguchiMitsutoshi Maruya

Journal:   4th International Conference on Spoken Language Processing (ICSLP 1996) Year: 1996 Pages: 326-329
© 2026 ScienceGate Book Chapters — All rights reserved.