Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks
Dorothy Torres, Wei Cheng, Ke Hu

TL;DR
This paper introduces an explainable AML triage framework using LLMs that incorporates evidence retrieval, explicit citations, and counterfactual validation to improve decision transparency and robustness.
Contribution
It presents a novel evidence-constrained decision process with structured LLM outputs and counterfactual checks, enhancing explainability and compliance in AML workflows.
Findings
Evidence grounding improves auditability and reduces hallucination errors.
Counterfactual validation increases decision explainability and robustness.
Achieved PR-AUC 0.75 and Escalate F1 0.62 on synthetic AML benchmarks.
Abstract
Anti-money laundering (AML) transaction monitoring generates large volumes of alerts that must be rapidly triaged by investigators under strict audit and governance constraints. While large language models (LLMs) can summarize heterogeneous evidence and draft rationales, unconstrained generation is risky in regulated workflows due to hallucinations, weak provenance, and explanations that are not faithful to the underlying decision. We propose an explainable AML triage framework that treats triage as an evidence-constrained decision process. Our method combines (i) retrieval-augmented evidence bundling from policy/typology guidance, customer context, alert triggers, and transaction subgraphs, (ii) a structured LLM output contract that requires explicit citations and separates supporting from contradicting or missing evidence, and (iii) counterfactual checks that validate whether minimal,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
