JOURNAL ARTICLE

Predicting F0 and voicing from NAM-captured whispered speech

Abstract

The NAM-to-speech conversion proposed by Toda and colleagues which converts Non-Audible Murmur (NAM) to audible speech by statistical mapping trained using aligned corpora is a very promising technique, but its performance is still insufficient, mainly due to the difficulty in estimating F 0 of the transformed voice from unvoiced speech.In this paper, we propose a method to improve F 0 estimation and voicing decision in a NAM-to-speech conversion system based on Gaussian Mixture Models (GMM) applied to whispered speech.Instead of combining voicing decision and F 0 estimation in a single GMM, a simple feed-forward neural network is used to detect voiced segments in the whisper while a GMM estimates a continuous melodic contour based on training voiced segments.The error rate for the voiced/unvoiced decision of the network is 6.8% compared to 9.2% with the original system.Our proposal benefits also to F 0 estimation error.

Keywords:
Voice Speech recognition Computer science

Metrics

10
Cited By
2.00
FWCI (Field Weighted Citation Impact)
17
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

DISSERTATION

Artificial voicing of whispered speech

Patrícia Cristina Ramalho de Oliveira

University:   Open Repository of the University of Porto (University of Porto) Year: 2015
JOURNAL ARTICLE

Perception of final-consonant “voicing” in whispered speech.

Yana D. GilichinskayaWinifred Strange

Journal:   The Journal of the Acoustical Society of America Year: 2011 Vol: 129 (4_Supplement)Pages: 2420-2420
DISSERTATION

Context Analysis for Voicing Decision in Whispered Speech

Tavares, Gonçalo Amaral

University:   Open Repository of the University of Porto (University of Porto) Year: 2024
JOURNAL ARTICLE

Final consonant voicing and vowel height contrasts in whispered speech.

Yana D. GilichinskayaWinifred Strange

Journal:   The Journal of the Acoustical Society of America Year: 2008 Vol: 124 (4_Supplement)Pages: 2558-2558
© 2026 ScienceGate Book Chapters — All rights reserved.