Authors: J Geng, J Tonglet, I Gurevych
Year: 2026
Published in: arXiv preprint arXiv:2510.23508, 2025•arxiv.org
Institution: KU Leuven, TU Darmstadt, Ubiquitous Knowledge Processing Lab, MBZUAI, ATHENE
Research Area: Human-Computer Interaction
Discipline: Machine Learning, Artificial Intelligence
M4FC is a new dataset that addresses limitations in existing multimodal fact-checking datasets by providing multilingual and multicultural claims verified by professional fact-checkers across six fact-checking tasks.
Methods: The dataset was created by pairing 4,982 images with 6,980 claims, which were verified by professional fact-checkers from 22 organizations covering diverse cultural and geographic contexts. The claims are available in up to ten languages and span six different multimodal fact-checking tasks.
Key Findings: The study measured the efficacy of the M4FC dataset across six multimodal fact-checking tasks, with a focus on how combining intermediate tasks affects the performance of verdict prediction.
Citations: 3
Sample Size: 6980