Abstract

This paper presents results concerning the exploitation of visual cues in the perception of Mandarin tones. The lower part of a female speaker's face was recorded on digital video as she uttered 25 sets of syllabic tokens covering the four different tones of Mandarin. Then in a perception study the audio sound track alone, as well an audio plus video condition were presented to native Mandarin speakers who were required to decide which tone they perceived. Audio was presented in various conditions: clear, babble-noise masked at different SNR levels, as well as devoiced and amplitudemodulated noise conditions using LPC resynthesis. In the devoiced and the clear audio conditions, there is little augmentation of audio alone due to the addition of video. However, the addition of visual information did significantly improve perception in the babble-noise masked condition, and this effect increased with decreasing SNR. This outcome suggests that the improvement in noise-masked conditions is not due to additional information in the video per se, but rather to an effect of early integration of acoustic and visual cues facilitating auditory-visual speech perception.

Keywords:
Mandarin Chinese Speech recognition Perception Noise (video) Tone (literature) Syllabic verse Computer science Sensory cue Speech perception Psychology Computer vision Artificial intelligence Linguistics

Metrics

31
Cited By
1.23
FWCI (Field Weighted Citation Impact)
8
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multisensory perception and integration
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Hearing Loss and Rehabilitation
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

Incongruent visual cues affect the perception of Mandarin vowel but not tone

Shanhu HongRui WangBiao Zeng

Journal:   Frontiers in Psychology Year: 2023 Vol: 13 Pages: 971979-971979
JOURNAL ARTICLE

Modelling Mandarin tone perception-production link through critical perceptual cues

Keith K. W. LeungYue Wang

Journal:   The Journal of the Acoustical Society of America Year: 2024 Vol: 155 (2)Pages: 1451-1468
JOURNAL ARTICLE

Audio-visual perception of mandarin tone in clear speech

Yuyu ZengKeith K. W. LeungYue WangAllard JongmanJoan A. Sereno

Journal:   The Journal of the Acoustical Society of America Year: 2017 Vol: 142 (4_Supplement)Pages: 2727-2727
JOURNAL ARTICLE

Auditory-visual perception of Mandarin lexical tone using 3D display

Dyball, Alyssa

Journal:   Macquarie University Year: 2022
© 2026 ScienceGate Book Chapters — All rights reserved.