JOURNAL ARTICLE

Dual-Stream Intermediate Fusion Network for Image Forgery Localization

Caiping YanRenhai LiuHong LiJinghui WuHaojie Pan

Year: 2024 Journal:   IEEE Access Vol: 12 Pages: 90511-90524   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Nowadays, powerful image editing applications not only simplify image processing significantly but also enhance the realism of processed digital images. However, this convenience has presented unprecedented challenges in verifying the authenticity of images. Although existing methods have achieved significant results in image forgery localization, most of them struggle to obtain satisfactory performance when dealing with tampered areas of various sizes, especially for large-scale tampered regions. To enhance the localization performance for various types and sizes of tampered regions, we propose a novel dual-stream intermediate fusion network for image forgery localization, named DIF-Net. This network adopts an encoder-decoder architecture composed of an adaptive convolutional pyramid and dual-stream intermediate fusion modules. Specifically, the former extracts multi-scale information from different depths by utilizing two depth-wise strip convolutions instead of standard large-kernel convolutions. Moreover, during feature fusion, learnable parameters are employed to dynamically allocate weights to each feature scale, so that the network can adaptively select the most relevant features at the target scale. The latter effectively reduces category information differences between the two feature streams by utilizing two learnable intermediate representations to model channel and spatial consistency in the dual-stream features. Compared to traditional and previous deep learning methods, the DIF-Net can generate high-quality prediction masks with fewer parameters. Through extensive experimental validation, our DIF-Net demonstrates outstanding performance on various datasets, surpassing the state-of-the-art forgery localization methods currently available. On the commonly used CASIA2 dataset, our DIF-Net achieves an improvement of 3.3% in F1 and 2.4% in AUC compared to previous methods.

Keywords:
Computer science Dual (grammatical number) Computer vision Image fusion Artificial intelligence Image (mathematics) Pattern recognition (psychology)

Metrics

3
Cited By
1.59
FWCI (Field Weighted Citation Impact)
48
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Dual-stream and dual-branch generative adversarial network for image forgery localization

Ruyi Bai

Journal:   Knowledge-Based Systems Year: 2025 Vol: 325 Pages: 113970-113970
JOURNAL ARTICLE

DMFF-Net: Double-stream multilevel feature fusion network for image forgery localization

Xiang XiaLi Chao SuShi Ping WangXiaoyan Li

Journal:   Engineering Applications of Artificial Intelligence Year: 2023 Vol: 127 Pages: 107200-107200
JOURNAL ARTICLE

DSSE-net: dual stream skip edge-enhanced network with forgery loss for image forgery localization

Aokun ZhengTianqiang HuangWei HuangLiqing HuangFeng YeHaifeng Luo

Journal:   International Journal of Machine Learning and Cybernetics Year: 2023 Vol: 15 (6)Pages: 2323-2335
© 2026 ScienceGate Book Chapters — All rights reserved.