NLP-based detection of systematic anomalies among the narratives of   consumer complaints

Peiheng Gao; Ning Sun; Xuefeng Wang; Chen Yang; Ri\v{c}ardas Zitikis

arXiv:2308.11138·stat.ME·March 28, 2024

NLP-based detection of systematic anomalies among the narratives of consumer complaints

Peiheng Gao, Ning Sun, Xuefeng Wang, Chen Yang, Ri\v{c}ardas Zitikis

PDF

Open Access

TL;DR

This paper presents an NLP-based method to identify systematic nonmeritorious consumer complaints by converting complaint narratives into quantitative data and analyzing them with specialized algorithms.

Contribution

It introduces a novel two-step approach combining NLP and anomaly detection algorithms to identify systematic anomalies in consumer complaint narratives.

Findings

01

Effective detection of systematic anomalies demonstrated on CFPB data

02

Improved identification of frequent nonmeritorious complaints

03

Combines classification and quantitative analysis for better accuracy

Abstract

We develop an NLP-based procedure for detecting systematic nonmeritorious consumer complaints, simply called systematic anomalies, among complaint narratives. While classification algorithms are used to detect pronounced anomalies, in the case of smaller and frequent systematic anomalies, the algorithms may falter due to a variety of reasons, including technical ones as well as natural limitations of human analysts. Therefore, as the next step after classification, we convert the complaint narratives into quantitative data, which are then analyzed using an algorithm for detecting systematic anomalies. We illustrate the entire procedure using complaint narratives from the Consumer Complaint Database of the Consumer Financial Protection Bureau.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Cybercrime and Law Enforcement Studies