JOURNAL ARTICLE

Robust Speech Recognition Parameters for Emotional Variation

Weon-Goo Kim

Year: 2005 Journal:   Journal of Korean institute of intelligent systems Vol: 15 (6)Pages: 655-660   Publisher: Korean Institute of Intelligent Systems

Abstract

본 논문에서는 인간의 감정 변화에 강인한 음성 인식 기술 개발을 목표로 하여 감정 변화의 영향을 적게 받는 음성 인식시스템의 특징 파라메터에 관한 연구를 수행하였다. 이를 위하여 우선 다양한 감정이 포함된 음성 데이터베이스를 사용하여 감정 변화가 음성 인식 시스템의 성능에 미치는 영향에 관한 연구와 감정 변화의 영향을 적게 받는 음성 인식 시스템의 특징 파라메터에 관한 연구를 수행하였다. 본 연구에서는 LPC 켑스트럼 계수, 멜 켑스트럼 계수, 루트 켑스트럼 계수, PLP 계수와 RASTA 처리를 한 멜 켑스트럼 계수와 음성의 에너지를 사용하였다 또한 음성에 포함된 편의(bias)를 제거하는 방법으로 CMS와 SBR 방법을 사용하여 그 성능을 비교하였다. 실험 결과에서 RASTA 멜 켑스트럼과 델타 켑스트럼을 사용하고 신초편의 제거 방법으로 CMS를 사용한 경우에 HMM 기반의 화자독립 단어 인식기의 오차가 $7.05\%$로 가장 우수한 성능을 나타내었다. 이러한 것은 멜 켑스트럼을 사용한 기준시스템과 비교하여 $59\%$정도 오차가 감소된 것이다. This paper studied the feature parameters less affected by the emotional variation for the development of the robust speech recognition technologies. For this purpose, the effect of emotional variation on the speech recognition system and robust feature parameters of speech recognition system were studied using speech database containing various emotions. In this study, LPC cepstral coefficient, met-cepstral coefficient, root-cepstral coefficient, PLP coefficient, RASTA met-cepstral coefficient were used as a feature parameters. And CMS and SBR method were used as a signal bias removal techniques. Experimental results showed that the HMM based speaker independent word recognizer using RASTA met-cepstral coefficient :md its derivatives and CMS as a signal bias removal showed the best performance of $7.05\%$ word error rate. This corresponds to about a $52\%$ word error reduction as compare to the performance of baseline system using met - cepstral coefficient.

Keywords:
Speech recognition Cepstrum Mel-frequency cepstrum Hidden Markov model Word (group theory) Computer science Feature (linguistics) Pattern recognition (psychology) Variation (astronomy) Word error rate SIGNAL (programming language) Coefficient of variation Feature extraction Artificial intelligence Mathematics Statistics

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
5
Refs
0.03
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Internet of Things and Social Network Interactions
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Robust Speech Parameters for the Emotional Speech Recognition

Guehyun LeeWeon-Goo Kim

Journal:   Journal of Korean institute of intelligent systems Year: 2012 Vol: 22 (6)Pages: 681-686
JOURNAL ARTICLE

Speech Parameters for the Robust Emotional Speech Recognition

Weon-Goo Kim

Journal:   Journal of Institute of Control Robotics and Systems Year: 2010 Vol: 16 (12)Pages: 1137-1142
JOURNAL ARTICLE

Robust Speech Recognition using Vocal Tract Normalization for Emotional Variation

Weon-Goo KimHyun-Jin Bang

Journal:   Journal of Korean institute of intelligent systems Year: 2009 Vol: 19 (6)Pages: 773-778
JOURNAL ARTICLE

Speech Features Extraction Techniques for Robust Emotional Speech Analysis/Recognition

K. M. Shiva PrasadG. N. Kodanda RamaiahM. Manjunatha

Journal:   Indian Journal of Science and Technology Year: 2017 Vol: 10 (3)
© 2026 ScienceGate Book Chapters — All rights reserved.