JOURNAL ARTICLE

Task-Oriented Multi-User Semantic Communications

Huiqiang XieZhijin QinXiaoming TaoKhaled B. Letaief

Year: 2022 Journal:   IEEE Journal on Selected Areas in Communications Vol: 40 (9)Pages: 2584-2597   Publisher: Institute of Electrical and Electronics Engineers

Abstract

While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We propose a Transformer based framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between images and DeepSC-MT is trained to minimize the semantic errors by recovering the semantic meaning of sentences. For the multimodal multi-user system, we develop a Transformer enabled model, named, DeepSC-VQA, for the VQA task by extracting text-image information at the transmitters and fusing it at the receiver. In particular, a novel layer-wise Transformer is designed to help fuse multimodal data by adding connection between each of the encoder and decoder layers. Numerical results show that the proposed models are superior to traditional communications in terms of the robustness to channels, computational complexity, transmission delay, and the task-execution performance at various task-specific metrics.

Keywords:
Computer science Transformer Robustness (evolution) Encoder Machine translation Artificial intelligence Embedding Natural language processing Speech recognition Machine learning

Metrics

341
Cited By
42.09
FWCI (Field Weighted Citation Impact)
60
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Wireless Signal Modulation Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Task-Oriented Multi-User Semantic Communications for VQA

Huiqiang XieZhijin QinGeoffrey Ye Li

Journal:   IEEE Wireless Communications Letters Year: 2021 Vol: 11 (3)Pages: 553-557
JOURNAL ARTICLE

Task-Oriented Explainable Semantic Communications

Shuai MaWeining QiaoYoulong WuHang LiGuangming ShiDahua GaoYuanming ShiShiyin LiNaofal Al‐Dhahir

Journal:   IEEE Transactions on Wireless Communications Year: 2023 Vol: 22 (12)Pages: 9248-9262
JOURNAL ARTICLE

Task-Oriented Integrated Sensing and Semantic Communications for Multi-Device Video Analytics

Yinghui HeXin LiJun Luo

Journal:   IEEE Transactions on Mobile Computing Year: 2025 Pages: 1-14
JOURNAL ARTICLE

Stacked Intelligent Metasurfaces for Task-Oriented Semantic Communications

Guojun HuangJiancheng AnZhaohui YangLu GanMehdi BennisMérouane Debbah

Journal:   IEEE Wireless Communications Letters Year: 2024 Vol: 14 (2)Pages: 310-314
© 2026 ScienceGate Book Chapters — All rights reserved.