JOURNAL ARTICLE

TFNet: Transformer-Based Multi-Scale Feature Fusion Forest Fire Image Detection Network

Hongying LiuFuquan ZhangYiqing XuJunling WangHong LuWei WeiJun Zhu

Year: 2025 Journal:   Fire Vol: 8 (2)Pages: 59-59   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Forest fires pose a severe threat to ecological environments and the safety of human lives and property, making real-time forest fire monitoring crucial. This study addresses challenges in forest fire image object detection, including small fire targets, sparse smoke, and difficulties in feature extraction, by proposing TFNet, a Transformer-based multi-scale feature fusion detection network. TFNet integrates several components: SRModule, CG-MSFF Encoder, Decoder and Head, and WIOU Loss. The SRModule employs a multi-branch structure to learn diverse feature representations of forest fire images, utilizing 1 × 1 convolutions to generate redundant feature maps and enhance feature diversity. The CG-MSFF Encoder introduces a context-guided attention mechanism combined with adaptive feature fusion (AFF), enabling effective multi-scale feature fusion by reweighting features across layers and extracting both local and global representations. The Decoder and Head refine the output by iteratively optimizing target queries using self- and cross-attention, improving detection accuracy. Additionally, the WIOU Loss assigns varying weights to the IoU metric for predicted versus ground truth boxes, thereby balancing positive and negative samples and improving localization accuracy. Experimental results on two publicly available datasets, D-Fire and M4SFWD, demonstrate that TFNet outperforms comparative models in terms of precision, recall, F1-Score, mAP50, and mAP50–95. Specifically, on the D-Fire dataset, TFNet achieved metrics of 81.6% precision, 74.8% recall, an F1-Score of 78.1%, mAP50 of 81.2%, and mAP50–95 of 46.8%. On the M4SFWD dataset, these metrics improved to 86.6% precision, 83.3% recall, an F1-Score of 84.9%, mAP50 of 89.2%, and mAP50–95 of 52.2%. The proposed TFNet offers technical support for developing efficient and practical forest fire monitoring systems.

Keywords:
Computer science Artificial intelligence Transformer Fire detection Feature (linguistics) Scale (ratio) Pattern recognition (psychology) Environmental science Remote sensing Computer vision Geology Engineering Cartography Geography Voltage Electrical engineering Architectural engineering

Metrics

13
Cited By
46.20
FWCI (Field Weighted Citation Impact)
46
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Fire Detection and Safety Systems
Physical Sciences →  Engineering →  Safety, Risk, Reliability and Quality
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Fire effects on ecosystems
Physical Sciences →  Environmental Science →  Global and Planetary Change

Related Documents

BOOK-CHAPTER

TFNet: Transformer Fusion Network for Ultrasound Image Segmentation

Tao WangZhihui LaiHeng Kong

Lecture notes in computer science Year: 2022 Pages: 314-325
JOURNAL ARTICLE

Image Recognition based on Multi-scale Feature Fusion Transformer

Zhefeng ZhuKe QiWenbin ChenYicong ZhouPeiyue LiZhenxian Liu

Journal:   2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA) Year: 2022 Pages: 7-13
JOURNAL ARTICLE

Multi-scale enhanced contextual transformer network for forest fire detection

Changhui DingHaiyan LiYajie LiuBingbing HeXun LangGuanbo Wang

Journal:   Digital Signal Processing Year: 2025 Vol: 172 Pages: 105850-105850
JOURNAL ARTICLE

Transformer-based multi-scale feature fusion network for remote sensing change detection

Shike LiangZhen HuaJinjiang Li

Journal:   Journal of Applied Remote Sensing Year: 2022 Vol: 16 (04)
© 2026 ScienceGate Book Chapters — All rights reserved.