JOURNAL ARTICLE

Speech recognition and intelligent translation under multimodal human–computer interaction system

Danhua HuangShuaiqiu Xiang

Year: 2024 Journal:   Journal of Intelligent Systems Vol: 33 (1)   Publisher: IlmuKomputer.Com

Abstract

Abstract The traditional translation robot is limited to the translation of single-mode text images and text videos, which has the problem of low translation accuracy. Therefore, speech recognition and intelligent translation in multimodal human–computer interaction (HCI) system are proposed. First, the network structure of speech recognition model in multi-channel HCI system is established, and the multi-head self-attention mechanism is constructed. Then, the artificial intelligence voice wake-up function is designed, and a multimodal machine translation model is constructed. On this basis, selective attention is added to obtain visual recognition of perceived text, and the decoder is used for multimodal gating fusion to realize the output of encoder translation results. Experimental results show that this method has high BLUE value and high translation accuracy.

Keywords:
Computer science Speech translation Translation (biology) Speech recognition Natural language processing Artificial intelligence Human–computer interaction Translation system Machine translation

Metrics

4
Cited By
2.56
FWCI (Field Weighted Citation Impact)
25
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Subtitles and Audiovisual Media
Social Sciences →  Arts and Humanities →  Language and Linguistics

Related Documents

JOURNAL ARTICLE

Intelligent System of Interaction & Recognition for Computer-Human Interfaces

Siddhant Santosh Toggi

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
JOURNAL ARTICLE

Intelligent System of Interaction & Recognition for Computer-Human Interfaces

Siddhant Santosh Toggi

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
© 2026 ScienceGate Book Chapters — All rights reserved.