JOURNAL ARTICLE

HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation

Qingxin JiangYing FanMenghan LiSheng FangWeifang ZhuDehui XiangTao PengXinjian ChenXun XuFei Shi

Year: 2024 Journal:   Biomedical Optics Express Vol: 15 (11)Pages: 6156-6156   Publisher: Optica Publishing Group

Abstract

Optical coherence tomography (OCT) has become the leading imaging technique in diagnosing and treatment planning for retinal diseases. Retinal OCT image segmentation involves extracting lesions and/or tissue structures to aid in the decisions of ophthalmologists, and multi-class segmentation is commonly needed. As the target regions often spread widely inside the retina, and the intensities and locations of different categories can be close, good segmentation networks must possess both global modeling capabilities and the ability to capture fine details. To address the challenge in capturing both global and local features simultaneously, we propose HyFormer, an efficient, lightweight, and robust hybrid network architecture. The proposed architecture features parallel Transformer and convolutional encoders for independent feature capture. A multi-scale gated attention block and a group positional embedding block are introduced within the Transformer encoder to enhance feature extraction. Feature integration is achieved in the decoder composed of the proposed three-path fusion modules. A class activation map-based cross-entropy loss function is also proposed to improve segmentation results. Evaluations are performed on a private dataset with myopic traction maculopathy lesions and the public AROI dataset for retinal layer and lesion segmentation with age-related degeneration. The results demonstrate HyFormer's superior segmentation performance and robustness compared to existing methods, showing promise for accurate and efficient OCT image segmentation. .

Keywords:
Computer science Optical coherence tomography Segmentation Artificial intelligence Image processing Image segmentation Computer vision Retinal Architecture Transformer Image (mathematics) Medicine Ophthalmology Engineering Electrical engineering

Metrics

4
Cited By
3.28
FWCI (Field Weighted Citation Impact)
29
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Retinal Imaging and Analysis
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
Brain Tumor Detection and Classification
Life Sciences →  Neuroscience →  Neurology
Ocular and Laser Science Research
Health Sciences →  Medicine →  Ophthalmology
© 2026 ScienceGate Book Chapters — All rights reserved.