JOURNAL ARTICLE

FMFN: Fine-Grained Multimodal Fusion Networks for Fake News Detection

Jingzi WangHongyan MaoHongwei Li

Year: 2022 Journal:   Applied Sciences Vol: 12 (3)Pages: 1093-1093   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

As one of the most popular social media platforms, microblogs are ideal places for news propagation. In microblogs, tweets with both text and images are more likely to attract attention than text-only tweets. This advantage is exploited by fake news producers to publish fake news, which has a devasting impact on individuals and society. Thus, multimodal fake news detection has attracted the attention of many researchers. For news with text and image, multimodal fake news detection utilizes both text and image information to determine the authenticity of news. Most of the existing methods for multimodal fake news detection obtain a joint representation by simply concatenating a vector representation of the text and a visual representation of the image, which ignores the dependencies between them. Although there are a small number of approaches that use the attention mechanism to fuse them, they are not fine-grained enough in feature fusion. The reason is that, for a given image, there are multiple visual features and certain correlations between these features. They do not use multiple feature vectors representing different visual features to fuse with textual features, and ignore the correlations, resulting in inadequate fusion of textual features and visual features. In this paper, we propose a novel fine-grained multimodal fusion network (FMFN) to fully fuse textual features and visual features for fake news detection. Scaled dot-product attention is utilized to fuse word embeddings of words in the text and multiple feature vectors representing different features of the image, which not only considers the correlations between different visual features but also better captures the dependencies between textual features and visual features. We conduct extensive experiments on a public Weibo dataset. Our approach achieves competitive results compared with other methods for fusing visual representation and text representation, which demonstrates that the joint representation learned by the FMFN (which fuses multiple visual features and multiple textual features) is better than the joint representation obtained by fusing a visual representation and a text representation in determining fake news.

Keywords:
Computer science Fuse (electrical) Feature (linguistics) Representation (politics) Social media Microblogging Artificial intelligence Image (mathematics) Information retrieval Pattern recognition (psychology) World Wide Web Linguistics

Metrics

61
Cited By
29.47
FWCI (Field Weighted Citation Impact)
30
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Misinformation and Its Impacts
Social Sciences →  Social Sciences →  Sociology and Political Science
COVID-19 diagnosis using AI
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multimodal fake news detection via progressive fusion networks

Jing JingHongchen WuJie SunXiaochang FangHuaxiang Zhang

Journal:   Information Processing & Management Year: 2022 Vol: 60 (1)Pages: 103120-103120
JOURNAL ARTICLE

Cross-Modal Fine-Grained Interaction Fusion in Fake News Detection

Zhanbin CheGuangBo Cui

Journal:   International Journal of Advanced Computer Science and Applications Year: 2024 Vol: 15 (5)
JOURNAL ARTICLE

Fine-Grained Differences-Similarities Enhancement Network for Multimodal Fake News Detection

Xiaoyu WuShi LiZhongyuan LaiHaifeng SongChunfang Hu

Journal:   International Journal of Advanced Computer Science and Applications Year: 2023 Vol: 14 (10)
© 2026 ScienceGate Book Chapters — All rights reserved.