JOURNAL ARTICLE

Goal-Oriented Visual Semantic Navigation Using Semantic Knowledge Graph and Transformer

Zhongli WangGuohui Tian

Year: 2024 Journal:   IEEE Transactions on Automation Science and Engineering Vol: 22 Pages: 1647-1657   Publisher: Institute of Electrical and Electronics Engineers

Abstract

When determining navigation actions, it is important to design effective visual and semantic representations of the observation scenes and robust navigation strategies. The paper proposes a goal-oriented visual semantic navigation method using semantic knowledge graph and transformer. Two kinds of knowledge graphs representing the location relationship between objects are constructed, namely current knowledge graph and prior knowledge graph. The pre-constructed prior knowledge graph is periodically updated by the current knowledge graph obtained in real time, and embedded into the semantic feature vector through graph convolutional network (GCN). The semantic features and extracted scene features are jointly embedded and stored, they are jointly fed into the transformer module to explore the spatio-temporal dependencies between objects in the environment. The navigation strategy is obtained from the Asynchronous Advantage Actor-Critic (A3C) model composed of Long-Short Term Memory (LSTM) and Multi-Layer Perception (MLP). Experiments show that the knowledge graph can significantly improve the navigation performance. More importantly, our experimental results show that our method can improve the generalization ability of navigation across novel scenes and novel objects. Video can be available at https://youtu.be/ZMjNvoK2rbY. Note to Practitioners — The motivation of this work is to develop an efficient visual semantic navigation method. Conventional navigation algorithms lack semantic information and learning ability, and can not adapt to the complex unknown environments. When semantic information is included in navigation, the location relationship between objects can be obtained as a prior knowledge, which can be combined with reinforcement learning to achieve autonomous navigation of agents. In this article, a knowledge graph representing the location relationships between objects has been constructed and regularly updated in real-time. The proposed visual semantic navigation method further improves the generalization ability of navigation. This navigation method can be applied to mobile robots and deployed in many scenarios such as home, restaurant, hospitals, and even factories.

Keywords:
Computer science Transformer Semantic computing Graph Artificial intelligence Natural language processing Computer vision Semantic Web Theoretical computer science Engineering Electrical engineering

Metrics

6
Cited By
19.86
FWCI (Field Weighted Citation Impact)
36
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Geographic Information Systems Studies
Social Sciences →  Social Sciences →  Geography, Planning and Development
Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Robotics and Automated Systems
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

Neurally-Guided Semantic Navigation in Knowledge Graph

Liang HeBin ShaoYanghua XiaoYatao LiTie‐Yan LiuEnhong ChenHuanhuan Xia

Journal:   IEEE Transactions on Big Data Year: 2018 Vol: 8 (3)Pages: 607-615
JOURNAL ARTICLE

Double Graph Attention Networks for Visual Semantic Navigation

Yunlian LyuMohammad Sadegh Talebi

Journal:   Neural Processing Letters Year: 2023 Vol: 55 (7)Pages: 9019-9040
JOURNAL ARTICLE

Goal-Oriented Semantic Communication for Wireless Visual Question Answering

Sige LiuNan LiYansha DengTony Q. S. Quek

Journal:   IEEE Journal on Selected Areas in Communications Year: 2025 Pages: 1-1
© 2026 ScienceGate Book Chapters — All rights reserved.