JOURNAL ARTICLE

GenConViT: Deepfake Video Detection Using Generative Convolutional Vision Transformer

Deressa Wodajo DeressaHannes MareenPeter LambertSolomon AtnafuZahid AkhtarGlenn Van Wallendael

Year: 2025 Journal:   Applied Sciences Vol: 15 (12)Pages: 6622-6622   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Deepfakes have raised significant concerns due to their potential to spread false information and compromise the integrity of digital media. Current deepfake detection models often struggle to generalize across a diverse range of deepfake generation techniques and video content. In this work, we propose a Generative Convolutional Vision Transformer (GenConViT) for deepfake video detection. Our model combines ConvNeXt and Swin Transformer models for feature extraction, and it utilizes an Autoencoder and Variational Autoencoder to learn from latent data distributions. By learning from the visual artifacts and latent data distribution, GenConViT achieves an improved performance in detecting a wide range of deepfake videos. The model is trained and evaluated on DFDC, FF++, TM, DeepfakeTIMIT, and Celeb-DF (v2) datasets. The proposed GenConViT model demonstrates strong performance in deepfake video detection, achieving high accuracy across the tested datasets. While our model shows promising results in deepfake video detection by leveraging visual and latent features, we demonstrate that further work is needed to improve its generalizability when encountering out-of-distribution data. Our model provides an effective solution for identifying a wide range of fake videos while preserving the integrity of media.

Keywords:
Computer science Transformer Artificial intelligence Computer vision Engineering Electrical engineering

Metrics

7
Cited By
33.41
FWCI (Field Weighted Citation Impact)
68
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

DeepFake Video Detection using Vision Transformer

Shereen HussienSeif Mohamed

Journal:   International journal of intelligent computing and information sciences/International Journal of Intelligent Computing and Information Sciences Year: 2024 Vol: 0 (0)Pages: 0-0
JOURNAL ARTICLE

Realtime Deepfake Detection Using Video Vision Transformer

Abhijith Rajeev, P Sreejindeth, Shamnad CP, Rini T Paul, Anu Eldho

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
JOURNAL ARTICLE

Realtime Deepfake Detection Using Video Vision Transformer

Abhijith Rajeev, P Sreejindeth, Shamnad CP, Rini T Paul, Anu Eldho

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
© 2026 ScienceGate Book Chapters — All rights reserved.