Multimodal and Multilingual Fact-Checked Article Retrieval
Papadopoulos, S.1, Beňová, I., Kula, S., Gregor, M., Karantaidis, G.1, Javůrek, T., Šimko, M., Papadopoulos, S.1
1 Centre for Research and Technology, Hellas, Information Technologies Institute, Thessaloniki, Greece
Fact-Check Retrieval (FCR) plays a crucial role in automated fact-checking by retrieving relevant fact-checked articles for disputed claims. While recent work has explored text-based, multilingual, and multimodal FCR, most efforts remain unimodal or limited to English. To bridge this gap, we introduce M3-Check, the first FCR dataset combining multilingual texts and images from social media posts with fact-check articles from diverse, credible sources. Furthermore, we introduce FACTOR a two-tower Transformer-based architecture that employs cross-tower parameter sharing and modality-wise aligned weight initialization; that outperforms zero-shot baselines, two-tower linear models, and vanilla Transformers, achieving a 17% improvement over the latter. Moreover we conduct modality ablations and compare state-of-the-art encoders, showing that multilingual encoders like multi-E5 can provide an additional 13% in performance without requiring English translations.
Cite: Papadopoulos, S., Benova, I., Kula, S., Gregor, M., Karantaidis, G., Javurek, T., Simko, M., Papadopoulos, S. Multimodal and Multilingual Fact-Checked Article Retrieval. In Proceedings of the 2025 International Conference on Multimedia Retrieval (ICMR ’25). Association for Computing Machinery, New York, NY, USA. (2025). DOI: https://doi.org/10.1145/3731715.3733402