An adversarial attack approach for eXplainable AI evaluation on deepfake detection models
Balachandar Gowrisankar, Vrizlynn L.L. Thing

TL;DR
This paper evaluates the effectiveness of XAI tools on deepfake detection models, demonstrating that generic evaluation methods are inadequate and proposing a specialized approach for better assessment.
Contribution
It introduces a novel XAI evaluation method tailored specifically for deepfake detection models, addressing limitations of generic evaluation techniques.
Findings
Generic XAI evaluation methods are less effective for deepfake models
Proposed approach provides more meaningful evaluation results
Experimental results validate the new evaluation method
Abstract
With the rising concern on model interpretability, the application of eXplainable AI (XAI) tools on deepfake detection models has been a topic of interest recently. In image classification tasks, XAI tools highlight pixels influencing the decision given by a model. This helps in troubleshooting the model and determining areas that may require further tuning of parameters. With a wide range of tools available in the market, choosing the right tool for a model becomes necessary as each one may highlight different sets of pixels for a given image. There is a need to evaluate different tools and decide the best performing ones among them. Generic XAI evaluation methods like insertion or removal of salient pixels/segments are applicable for general image classification tasks but may produce less meaningful results when applied on deepfake detection models due to their functionality. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
