JOURNAL ARTICLE

Maximum-likelihood harmonic matching for fundamental frequency estimation

Xavier RodetBoris Doval

Year: 1992 Journal:   The Journal of the Acoustical Society of America Vol: 92 (4_Supplement)Pages: 2428-2429   Publisher: Acoustical Society of America

Abstract

This presentation deals with the estimation of fundamental frequency (f0) of pseudoperiodic sound signals with important results for polyphonic frequency tracking, and voice separation. Given a set of candidate partials in the signal, the estimation of f0 is taken in the sense of finding the optimal period duration(s) according to a criterion of maximum-likelihood harmonic matching. Excellent results have been obtained on large databases of speech (40 mn) and music [B. Doval and X. Rodet, Proc. IEEE-ICASSP, Toronto, May (1991)]. The algorithm has been implemented at IRCAM to run in real time for live performance frequency tracking. Developments are in several directions. A combined estimation of f0 and of a spectral envelope improves both estimations. Most important is the estimation of the ‘‘apriori’’ distributions of the different random variables on a learning set. Finally, a hidden Markov model tracks f0 trajectories between adjacent frames. The first experiments of polyphonic frequency tracking and voice separation are very promising. The model can be transposed directly to the maximum-likelihood estimation of several harmonic sounds since it already considers more than one f0 value. a)Presently on sabbatical at Ctr. for New Music and Audio Technol. (CNMAT), Univ. of California at Berkeley, 1750 Arch St., Berkeley, CA 94709.

Keywords:
Polyphony Harmonic Fundamental frequency Computer science Set (abstract data type) Maximum likelihood Hidden Markov model Envelope (radar) SIGNAL (programming language) Speech recognition Spectral envelope A priori and a posteriori Matching (statistics) Algorithm Acoustics Tracking (education) Mathematics Statistics Telecommunications Physics

Metrics

3
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.11
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music Technology and Sound Studies
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.