JOURNAL ARTICLE

AUDIOVISUAL SPEECH PERCEPTION IN DIOTIC AND DICHOTIC LISTENING CONDITIONS

Abstract

Background Speech perception is multisensory, relying on auditory as well as visual information from the articulators. Watching articulatory gestures which are either congruent or incongruent with the speech audio can change the auditory percept, indicating that there is a complex integration of auditory and visual stimuli. A speech segment is comprised of distinctive features, notably voice onset time (VOT) and place of articulation (POA). Understanding the importance of each of these features for audiovisual (AV) speech perception is critical. The present study investigated the perception of AV consonant-vowel (CV) syllables with various VOTs and POAs under two conditions: diotic incongruent and dichotic congruent. Material and methods AV stimuli comprised diotic and dichotic CV syllables with stop consonants (bilabial /pa/ and /ba/; alveolar /ta/ and /da/; and velar /ka/ and /ɡa/) presented with congruent and incongruent video CV syllables with stop consonants. There were 40 righthanded normal hearing young adults (20 females, mean age 23 years, <i>SD</i> = 2.4 years) and 20 males (mean age 24 years, <i>SD</i> = 2.1 years) who participated in the experiment. Results In the diotic incongruent AV condition, short VOT (voiced CV syllables) of the visual segments were identified when auditory segments had a CV syllable with long VOT (unvoiced CV syllables). In the dichotic congruent AV condition, there was an increase in identification of the audio segment when the subject was presented with a video segment congruent to either ear, in this way overriding the otherwise presented ear advantage in dichotic listening. Distinct visual salience of bilabial stop syllables had greater visual influence (observed as greater identification scores) than velar stop syllables and thus overrode the acoustic dominance of velar syllables. Conclusions The findings of the present study have important implications for understanding the perception of diotic incongruent and dichotic congruent audiovisual CV syllables in which the stop consonants have different VOT and POA combinations. Earlier findings on the effect of VOT on dichotic listening can be extended to AV speech having dichotic auditory segments.

Keywords:
Dichotic listening Audiology Psychology Syllable Voice-onset time Perception Percept Consonant Vowel Speech recognition Computer science Medicine

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
59
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Multisensory perception and integration
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Hearing Loss and Rehabilitation
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

Frequency selectivity under diotic and dichotic listening conditions

Mark F. YamsDonald E. Robinson

Journal:   The Journal of the Acoustical Society of America Year: 1978 Vol: 64 (S1)Pages: S36-S36
JOURNAL ARTICLE

Differences between psychophysical ‘‘suppression effects’’ under diotic and dichotic listening conditions

Mark F. Yama

Journal:   The Journal of the Acoustical Society of America Year: 1982 Vol: 72 (5)Pages: 1380-1383
JOURNAL ARTICLE

Auditory Spectral Integration in Nontraditional Speech Cues in Diotic and Dichotic Listening

Robert A. FoxEwa Jacewicz

Journal:   Perceptual and Motor Skills Year: 2010 Vol: 111 (2)Pages: 543-558
© 2026 ScienceGate Book Chapters — All rights reserved.