Code Word Detection in Fraud Investigations using a Deep-Learning Approach
Youri van der Zee, Jan C. Scholtes, Marcel Westerhoud, Julien Rossi

TL;DR
This paper presents a deep learning approach using BERT to detect code words in email communications, aiding fraud investigations by automating the identification of deceptive language.
Contribution
It introduces a novel synthetic dataset and demonstrates that BERT significantly outperforms other methods in detecting code words in fraud-related texts.
Findings
BERT achieves an F1 score of 0.9 in code word detection.
Deep neural language models are effective for fraud investigation tasks.
The framework helps organize data for easier investigative analysis.
Abstract
In modern litigation, fraud investigators often face an overwhelming number of documents that must be reviewed throughout a matter. In the majority of legal cases, fraud investigators do not know beforehand, exactly what they are looking for, nor where to find it. In addition, fraudsters may use deception to hide their behaviour and intentions by using code words. Effectively, this means fraud investigators are looking for a needle in the haystack without knowing what the needle looks like. As part of a larger research program, we use a framework to expedite the investigation process applying text-mining and machine learning techniques. We structure this framework using three well-known methods in fraud investigations: (i) the fraud triangle (ii) the golden ("W") investigation questions, and (iii) the analysis of competing hypotheses. With this framework, it is possible to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Authorship Attribution and Profiling
MethodsLinear Layer · Adam · Attention Is All You Need · Attention Dropout · WordPiece · Residual Connection · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout
