Performance Comparison of Vision Transformer-Based Models in Medical Image Classification

Elif Kanca; Selen Ayas; Elif Baykal Kablan; Murat Ekіncі

doi:10.1109/siu59756.2023.10223892

ScienceGate Book Chapters

JOURNAL ARTICLE

Performance Comparison of Vision Transformer-Based Models in Medical Image Classification

Elif Kanca Selen Ayas Elif Baykal Kablan Murat Ekіncі

Year: 2023 Pages: 1-4

DOI: 10.1109/siu59756.2023.10223892

Get Full-Text PDF Get Analytical Report

Abstract

In recent years, convolutional neural networks have shown significant success and are frequently used in medical image analysis applications. However, the convolution process in convolutional neural networks limits learning of long-term pixel dependencies in the local receptive field. Inspired by the success of transformer architectures in encoding long-term dependencies and learning more efficient feature representation in natural language processing, publicly available color fundus retina, skin lesion, chest X-ray, and breast histology images are classified using Vision Transformer (ViT), Data-Efficient Transformer (DeiT), Swin Transformer, and Pyramid Vision Transformer v2 (PVTv2) models and their classification performances are compared in this study. The results show that the highest accuracy values are obtained with the DeiT model at 96.5% in the chest X-ray dataset, the PVTv2 model at 91.6% in the breast histology dataset, the PVTv2 model at 91.3% in the retina fundus dataset, and the Swin model at 91.0% in the skin lesion dataset.

Keywords:

Artificial intelligence Computer science Convolutional neural network Pattern recognition (psychology) Transformer Computer vision Deep learning Pixel Engineering

Metrics

Cited By

1.53

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

AI in cancer detection

Physical Sciences → Computer Science → Artificial Intelligence

Brain Tumor Detection and Classification

Life Sciences → Neuroscience → Neurology

Retinal Imaging and Analysis

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

Performance Comparison of Vision Transformer-Based Models in Medical Image Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Medical image classification based on enhanced Vision Transformer

Image Classification Based on Vision Transformer

PERFORMANCE COMPARISON OF VISION-LANGUAGE MODELS IN IMAGE CLASSIFICATION

PERFORMANCE OF VISION TRANSFORMER ON GARBAGE IMAGE CLASSIFICATION

Vision Transformer (ViT)-based Applications in Image Classification