DuaDiff: Dual-Conditional Diffusion Model for Guided Thermal Image Super-Resolution

Linrui Shi; Gaochang Wu; Yingqian Wang; Yebin Liu; Tianyou Chai

doi:10.1109/tnnls.2025.3640168

ScienceGate Book Chapters

JOURNAL ARTICLE

DuaDiff: Dual-Conditional Diffusion Model for Guided Thermal Image Super-Resolution

Linrui Shi Gaochang Wu Yingqian Wang Yebin Liu Tianyou Chai

Year: 2025 Journal: IEEE Transactions on Neural Networks and Learning Systems Vol: PP Pages: 1-15 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tnnls.2025.3640168

Get Full-Text PDF Get Analytical Report

Abstract

Thermal imaging offers valuable properties, but suffers from inherently low spatial resolution, which can be enhanced using a high-resolution (HR) visible image as guidance. However, the substantial modality differences between thermal and visible images, coupled with significant resolution gaps, pose challenges to existing guided super-resolution (SR) approaches. In this article, we present dual-conditional diffusion (DuaDiff), an innovative diffusion model featuring a dual-conditioning mechanism to enhance guided thermal image SR. Unlike typical conditional diffusion models, DuaDiff integrates a learnable Laplacian pyramid to extract high-frequency details from the visible image, serving as one of the conditioning inputs. By capturing multiscale high-frequency components, DuaDiff effectively focuses on intricate textures and edges in the HR visible images, significantly enhancing thermal image fidelity. Furthermore, we project both thermal and visible images into a semantic latent space, constructing another conditioning input. Leveraging these complementary conditions, DuaDiff employs a multimodal latent feature cross-attention module to facilitate effective interaction between noise, thermal, and visible latent representations. Extensive experiments on the FLIR-ADAS and CATS datasets for $4\times $ and $8\times $ guided SR demonstrate that combining learnable Laplacian conditioning with semantic latent conditioning enables DuaDiff to surpass state-of-the-art methods in both visual quality and metric evaluation, particularly in scenarios with a large resolution gap. Besides, the applications to downstream tasks further confirm the capability of DuaDiff to recover high-fidelity semantic information. The code will be released.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

DuaDiff: Dual-Conditional Diffusion Model for Guided Thermal Image Super-Resolution

Abstract

Metrics

Topics

Related Documents

Improved conditional diffusion model for image super‐resolution

Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model

Edge-guided conditional diffusion model for multi-contrast MRI super-resolution

ICDSR: Integrated Conditional Diffusion Model for Single Image Super-Resolution

Remote Sensing Image Blind Super-Resolution via a Convolutional Neural Network-Guided Conditional Diffusion Model