Foundation Model Based Camouflaged Object Detection

Zefeng Chen; Zhijiang Li; Y. Y. Xue; L. Zhang

doi:10.1049/cvi2.70009

ScienceGate Book Chapters

JOURNAL ARTICLE

Foundation Model Based Camouflaged Object Detection

Zefeng Chen Zhijiang Li Y. Y. Xue L. Zhang

Year: 2025 Journal: IET Computer Vision Vol: 19 (1) Publisher: Institution of Engineering and Technology

DOI: 10.1049/cvi2.70009

Get Full-Text PDF Get Analytical Report

Abstract

ABSTRACT Camouflaged object detection (COD) aims to identify and segment objects that closely resemble and are seamlessly integrated into their surrounding environments, making it a challenging task in computer vision. COD is constrained by the limited availability of training data and annotated samples, and most carefully designed COD models exhibit diminished performance under low‐data conditions. In recent years, there has been increasing interest in leveraging foundation models, which have demonstrated robust general capabilities and superior generalisation performance, to address COD challenges. This work proposes a knowledge‐guided domain adaptation (KGDA) approach to tackle the data scarcity problem in COD. The method utilises the knowledge descriptions generated by multimodal large language models (MLLMs) for camouflaged images, aiming to enhance the model's comprehension of semantic objects and camouflaged scenes through highly abstract and generalised knowledge representations. To resolve ambiguities and errors in the generated text descriptions, a multi‐level knowledge aggregation (MLKG) module is devised. This module consolidates consistent semantic knowledge and forms multi‐level semantic knowledge features. To incorporate semantic knowledge into the visual foundation model, the authors introduce a knowledge‐guided semantic enhancement adaptor (KSEA) that integrates the semantic knowledge of camouflaged objects while preserving the original knowledge of the foundation model. Extensive experiments demonstrate that our method surpasses 19 state‐of‐the‐art approaches and exhibits strong generalisation capabilities even with limited annotated data.

Keywords:

Foundation (evidence) Artificial intelligence Computer science Computer vision Object (grammar) Object detection Pattern recognition (psychology)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.08

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Foundation Model Based Camouflaged Object Detection

Abstract

Metrics

Topics

Related Documents

Polarization-based Camouflaged Object Detection

Diffusion Model for Camouflaged Object Detection

A Camouflaged Object Detection Model Based on Deep Learning

Camouflaged Object Detection

Camouflaged Object Detection with State-Space Model