Cross Modal-FT Net: A Multimodal Fake News Detection Framework using Text, Images, and User Behavior
Abstract
An unprecedented proliferation of fake news across digital platforms is a major hurdle for reliable information, people trust, and social stability. Current fake news detection techniques, primarily based on text analysis, frequently overlook the multimodal and behavioral indicators associated with contemporary misinformation. Multimodal approaches are rarer and typically classify news as either genuine or fraudulent. To address this problem, this paper proposes a CrossModal-FTNet (Fake News Transformer Network), a transformer-centric multimodal system that identifies fake news by analyzing text, associated images, and user actions such as likes, shares, and the reliability of sources. The suggested model includes three dedicated encoders: a BERT-inspired text encoder for contextual interpretation, a ResNet-50-inspired image encoder for visual cues, and a lightweight behavioral feature encoder for examining user interaction information. These varied representations are subsequently merged through a cross-modal fusion transformer, which synchronizes and enhances data from various sources into a single united feature space. Experiments on benchmark datasets such as Fakeddit, Weibo, MM-COVID, and Twitter15 indicate that the suggested model excels, attaining 94.3% accuracy and a 92.8% F1-score, outpacing multiple unimodal and early fusion baselines. The findings confirm that using cross-modal data greatly boosts the ability to detect fake news. Thus, CrossModal-FTNet offers a scalable, real-time, and precise solution for combating misinformation in the ever-changing online environment.
Keywords
References
K. Shu, S. Wang, D. Lee, and H. Liu, “Disinformation, misinformation, and fake news in social media. Cham: Springer International Publishing, 2020.
X.Zhou and R.Zafarani, “A survey of fake news: Fundamental theories, detection methods, and opportunities,” ACM Computing Surveys (CSUR), vol. 53, no. 5, pp. 1-40, 2020.
F.Alam, F.Dalvi, S.Shaar, N.Durrani, H.Mubarak, A.Nikolov, and P.Nakov, “Fighting the COVID-19 infodemic in social media: A holistic perspective and a call to arms,” In Proceedings of the International AAAI Conference on Web and Social Media, vol. 15, pp. 913-922, May. 2021.
Y.Li, B.Jiang, K.Shu, and H.Liu, “Mm-covid: A multilingual and multimodal data repository for combating covid-19 disinformation,” arXiv preprint arXiv:2011.04088, 2020.
R. K.Kaliyar, A.Goswami, and P.Narang, “FakeBERT: Fake news detection in social media with a BERT-based deep learning approach,” Multimedia tools and applications, vol. 80, no. 8, pp. 11765-11788, 2021.
M. van der Meer, P. Korshunov, S. Marcel, and L. van der Plas, “HintsOfTruth: A multimodal checkworthiness detection dataset with real and synthetic claims,” arXiv preprint arXiv:2502.11753, 2025.
W. Chen, F. Cai, Y. Guo, Z. Pan, W. Chen, and Y. Zhang, “Contrastive learning of cross-modal information enhancement for multimodal fake news detection,” Complex & Intelligent Systems, vol. 11, no. 7, pp. 303, 2025.
Y. Liu, Y. Ren, and J. Sui, “PMMC: Prompt-based multi-modal rumor detection model with modality conversion,” in Proc. 2024 Int. Joint Conf. Neural Netw. (IJCNN), pp. 1–6, Jun. 2024.
X. Fu, Z. Zhang, Y. Sun, T. Wu, H. Zhang, Y. Cao, and N. Zhang, “Dual-branch hybrid visual networks and hierarchical adaptive fusion strategy: An effective multimodal fake news detection model,” Eur. J. Artif. Intell., 2024, Art. no. 30504554251351227.
H.Xia, Y.Wang, J.Z.Zhang, L.J.Zheng, M.M.Kamal, and V.Arya, “COVID-19 fake news detection: A hybrid CNN-BiLSTM-AM model,” Technological Forecasting and Social Change, vol. 195, p.122746, 2023.
H.Chen, H.Guo, B.Hu, S.Hu, J.Hu, S.Lyu, and X.Wang, “A Self-Learning Multimodal Approach for Fake News Detection,” arXiv preprint arXiv:2412.05843, 2024.
P.Zhu, J.Hua, K.Tang, J.Tian, J.Xu, and X.Cui, “Multimodal fake news detection through intra-modality feature aggregation and inter-modality semantic fusion,” Complex & Intelligent Systems, vol. 10, no. 4, pp. 5851-5863, 2024.
J.Zhao, S.Zhang, B.Wang, T.Zhong, F.Yang, and B.Li, “Fake news detection by incorporating multi-modal information,” In International Conference on Internet of Things, Communication and Intelligent Technology (pp. 513-521). Singapore: Springer Nature Singapore, September. 2023.
R.Cantini, C.Cosentino, I.Kilanioti, F.Marozzo, and D.Talia, “Unmasking deception: a topic-oriented multimodal approach to uncover false information on social media,” Machine Learning, vol. 114, no. 1, pp.13, 2025.
J.Jing, H.Wu, J.Sun, X.Fang, H.Zhang, “Multimodal fake news detection via progressive fusion networks,” Information processing & management, vol. 60, no. 1, pp. 103120, 2023.
DOI: https://doi.org/10.52088/ijesty.v5i4.1245
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 K. Karnan, L.R. Aravind Babu



























