Improved Deepfake Video Detection Using Convolutional Vision Transformer

Deressa Wodajo Deressa; Peter Lambert; Glenn Van Wallendael; Solomon Atnafu; Hannes Mareen

doi:10.1109/gem61861.2024.10585593

ScienceGate Book Chapters

JOURNAL ARTICLE

Improved Deepfake Video Detection Using Convolutional Vision Transformer

Deressa Wodajo Deressa Peter Lambert Glenn Van Wallendael Solomon Atnafu Hannes Mareen

Year: 2024 Pages: 1-6

DOI: 10.1109/gem61861.2024.10585593

Get Full-Text PDF Get Analytical Report

Abstract

Deepfakes are hyper-realistic videos in which the faces are replaced, swapped, or forged using deep-learning models. This potent media manipulation techniques hold promise for applications across various domains. Yet, they also present a significant risk when employed for malicious intents like iden-tity fraud, phishing, spreading false information, and executing scams. In this work, we propose a novel and improved Deepfake video detector that uses a Convolutional Vision Transformer (CViT2), which builds on the concepts of our previous work (CViT). The CViT architecture consists of two components: a Convolutional Neural Network that extracts learnable features, and a Vision Transformer that categorizes these learned features using an attention mechanism. We trained and evaluted our model on 5 datasets, namely Deepfake Detection Challenge Dataset (DFDC), FaceForensics++ (FF++) I, Celeb-DF v2, Deep-fakeTIMIT, and TrustedMedia. On the test sets unseen during training, we achieved an accuracy of 95 %, 94.8 %, 98.3 % and 76.7% on the DFDC, FF++ , Celeb-DF v2, and TIMIT datasets, respectively. In conclusion, our proposed Deepfake detector can be used in the battle against misinformation and other forensic use cases.

Keywords:

Computer science Transformer Artificial intelligence Computer vision Pattern recognition (psychology) Engineering Electrical engineering Voltage

Metrics

Cited By

5.83

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Improved Deepfake Video Detection Using Convolutional Vision Transformer

Abstract

Metrics

Citation History

Topics

Related Documents

GenConViT: Deepfake Video Detection Using Generative Convolutional Vision Transformer

DeepFake Video Detection using Vision Transformer

Realtime Deepfake Detection Using Video Vision Transformer

Realtime Deepfake Detection using Video Vision Transformer

Realtime Deepfake Detection Using Video Vision Transformer