MEViT: Generalization of Deepfake Detection With Meta-Learning EfficientNet Vision Transformer

Van-Nhan Tran; Hoanh-Su Le; Piljoo Choi; Suk‐Hwan Lee; Ki‐Ryong Kwon

doi:10.1109/ojcs.2025.3568044

ScienceGate Book Chapters

JOURNAL ARTICLE

MEViT: Generalization of Deepfake Detection With Meta-Learning EfficientNet Vision Transformer

Van-Nhan Tran Hoanh-Su Le Piljoo Choi Suk‐Hwan Lee Ki‐Ryong Kwon

Year: 2025 Journal: IEEE Open Journal of the Computer Society Vol: 6 Pages: 789-800 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/ojcs.2025.3568044

Get Full-Text PDF Get Analytical Report

Abstract

Deepfakes are digitally manipulated videos that appear realistic but are actually fake. With the rapid advances in deep generative models, the accessibility and sophistication of such manipulation technologies are increasing, making it more challenging to detect fake content. Different facial forgery techniques result in complex data distributions, and most existing deepfake detection approaches rely on convolutional neural networks (CNNs) that treat the task as a binary classification problem. While these methods achieve high accuracy on specific datasets, their generalization performance across datasets is often poor due to overfitting to manipulation techniques seen during training. In this study, we propose a model called MEViT, which integrates the EfficientNet Vision Transformer with a meta-learning framework to enhance generalization in deepfake detection. Furthermore, we introduce a pair-discrimination loss to push the feature representations of fake samples away from those of real samples, and a domain adjustment loss to reduce domain shifts across different manipulation methods. The MEViT model is trained on a specific manipulation method in the FaceForensics++ dataset and evaluated on other unseen methods from the same dataset. Additionally, we conduct extensive experiments on multiple deepfake benchmarks, including FaceForensics++ and CelebDF-v2, and compare our method with various state-of-the-art approaches to demonstrate its effectiveness.

Keywords:

Generalization Computer science Transformer Artificial intelligence Mathematics Engineering Electrical engineering

Metrics

Cited By

4.82

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

MEViT: Generalization of Deepfake Detection With Meta-Learning EfficientNet Vision Transformer

Abstract

Metrics

Citation History

Topics

Related Documents

Generalized Meta-Learning EfficientNet Vision Transformer Model for Deepfake Detection

Learning Meta Model for Strong Generalization Deepfake Detection

Generalization of Forgery Detection With Meta Deepfake Detection Model

Wavelet Vision Transformer with Self-Supervised Learning for Improved Deepfake Detection

Combining EfficientNet and Vision Transformers for Video Deepfake Detection