JOURNAL ARTICLE

Panoramic Video Salient Object Detection with Ambisonic Audio Guidance

Xiang LiHaoyuan CaoShijie ZhaoJunlin LiLi ZhangBhiksha Raj

Year: 2023 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 37 (2)Pages: 1424-1432   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Video salient object detection (VSOD), as a fundamental computer vision problem, has been extensively discussed in the last decade. However, all existing works focus on addressing the VSOD problem in 2D scenarios. With the rapid development of VR devices, panoramic videos have been a promising alternative to 2D videos to provide immersive feelings of the real world. In this paper, we aim to tackle the video salient object detection problem for panoramic videos, with their corresponding ambisonic audios. A multimodal fusion module equipped with two pseudo-siamese audio-visual context fusion (ACF) blocks is proposed to effectively conduct audio-visual interaction. The ACF block equipped with spherical positional encoding enables the fusion in the 3D context to capture the spatial correspondence between pixels and sound sources from the equirectangular frames and ambisonic audios. Experimental results verify the effectiveness of our proposed components and demonstrate that our method achieves state-of-the-art performance on the ASOD60K dataset.

Keywords:
Ambisonics Computer science Computer vision Salient Artificial intelligence Context (archaeology) Focus (optics) Object (grammar) Block (permutation group theory) Object detection Pixel Image fusion Spatial contextual awareness Computer graphics (images) Pattern recognition (psychology) Image (mathematics) Engineering Geography

Metrics

9
Cited By
0.72
FWCI (Field Weighted Citation Impact)
93
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multisensory perception and integration
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems

Related Documents

BOOK-CHAPTER

Audio-Visual Salient Object Detection

Shuaiyang ChengLiang SongJingjing TangShihui Guo

Lecture notes in computer science Year: 2021 Pages: 510-521
JOURNAL ARTICLE

Mutual-Guidance Transformer-Embedding Network for Video Salient Object Detection

Dingyao MinChao ZhangYukang LuKeren FuQijun Zhao

Journal:   IEEE Signal Processing Letters Year: 2022 Vol: 29 Pages: 1674-1678
JOURNAL ARTICLE

Salient Object Detection based on Panoramic Images

Jian LinXinyi Ni

Journal:   Scientific journal of intelligent systems research. Year: 2025 Vol: 7 (11)Pages: 35-45
© 2026 ScienceGate Book Chapters — All rights reserved.