The Open University
Browse

Explainable Deepfake Detection: A Multi-Model Framework with Human-Interpretable Rationales for legal investigation purposes

Posted on 2025-11-13 - 13:59 authored by Patrick Wong
<p dir="ltr">This project regards a new framework for deepfake detection, which pursues accuracy and explainability of detection, which is a critical need in legal investigations contexts such as policing and digital forensics. The framework is composed of advanced machine learning models, an explainable AI (XAI) component and three commonly used image processing methods for detecting manipulations, to detect and explain anomalies in deepfake images of human faces. Four independently trained CNN models were developed for the original and processed images, and through decision fusion achieved an overall detection accuracy of 97%. Moreover, the framework achieved an F1 score of 92% from a hidden test dataset used in the UK Home Office’s Deepfake Detection Challenge 2024, placing it third out of the competing teams in the image deepfake category. Shapley values were also used to identify the facial features that influenced the models’ detection decisions. This information enabled us to home in on various areas on the face to find features more likely to occur in deepfake images.</p><p dir="ltr">Through Bayes’ theorem, we presented a human-understandable detection method, achieving 85% detection accuracy on the test images while maintaining explainability of the detection rationales. Our work demonstrates that combining machine learning, image processing, XAI with human understandable rationales results in a demonstrably effective and practical deepfake detection system that could significantly streamline criminal investigations such as policing and digital forensics. Future research will explore the interplay between psychological factors and the acceptance and trust, and incorporate additional image processing techniques to enhance detection accuracy.</p><p dir="ltr">For reproducibility purposes, we have made our trained models and the test dataset available. </p><p dir="ltr">The findings of this project has been submitted to the following journal and is being reviewed: <a href="https://www.sciencedirect.com/journal/machine-learning-with-applications" rel="noreferrer" target="_blank">Machine Learning with Applications</a></p>

CITE THIS COLLECTION

DataCite
No result found
or
Select your citation style and then place your mouse over the citation text to select it.

Research Group

  • Centre for Research in Computing (CRC)

SHARE

email
need help?