JOURNAL ARTICLE

Formant frequency estimation of high-pitched speech by homomorphic prediction

Mohammad Shahidur RahmanTetsuya Shimamura

Year: 2005 Journal:   Nippon Onkyo Gakkaishi/Acoustical science and technology/Nihon Onkyo Gakkaishi Vol: 26 (6)Pages: 502-510   Publisher: Acoustical Society of Japan

Abstract

The conventional model of the linear prediction analysis suffers from difficulties in estimating vocal tract characteristics of high-pitched speakers. This is because the autocorrelation function used by the autocorrelation method of linear prediction for estimating autoregressive coefficients is actually an “aliased” version of that of the vocal tract impulse response. This “aliasing” occurs due to the periodic nature of voiced speech. Generally it is accepted that homomorphic filtering can be used to obtain an estimate of vocal tract impulse response which is free from periodicity. Thus linear prediction of the resulting vocal tract impulse response (referred to as homomorphic prediction) is expected to be free from variations of fundamental frequencies. To our knowledge any experimental study, however, has not yet appeared on the suitability of this method for analyzing high-pitched speech. This paper presents a detail study on the prospects of homomorphic prediction as a formant tracking tool especially for high-pitched speech where linear prediction fails to obtain accurate estimation. The formant frequencies estimated using the proposed method are found to be accurate by more than an order of magnitude compared to the conventional procedure. The accuracy of formant estimation is verified on synthetic vowels for a wide range of pitch periods covering typical male and high-pitched female speakers. The validity of the proposed method is also examined by inspecting the spectral envelopes of natural speech spoken by high-pitched female speakers. We noticed that almost all the previous methods dealing with this limitation of linear prediction are based on the covariance technique where the obtained AR filter can be unstable. The solutions obtained by the current method are guaranteed to be stable which makes it superior for many speech analysis applications.

Keywords:
Formant Speech recognition Estimation Computer science Homomorphic encryption Engineering Vowel

Metrics

13
Cited By
0.32
FWCI (Field Weighted Citation Impact)
18
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics
Acoustic Wave Phenomena Research
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Formant frequency estimation of high-pitched vowels using weighted linear prediction

Paavo AlkuJouni PohjalainenMartti VainioAnne-Maria LaukkanenBrad H. Story

Journal:   The Journal of the Acoustical Society of America Year: 2013 Vol: 134 (2)Pages: 1295-1313
JOURNAL ARTICLE

Formant estimation of high-pitched noisy speech using homomorphic deconvolution of higher-order group delay spectrum

Husne Ara ChowdhuryMohammad Shahidur Rahman

Journal:   Nippon Onkyo Gakkaishi/Acoustical science and technology/Nihon Onkyo Gakkaishi Year: 2023 Vol: 44 (2)Pages: 84-92
JOURNAL ARTICLE

Formant estimation of high fundamental frequency speech

Corine Bickley

Journal:   The Journal of the Acoustical Society of America Year: 1986 Vol: 79 (S1)Pages: S38-S38
JOURNAL ARTICLE

Group Delay Moment of Cepstrum for Formant Estimation of High-Pitched Noisy Speech

Husne Ara Chowdhury

Journal:   SUST Journal of Science and Technology (SUST JST) Year: 2025 Vol: 33 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.