A Self-Attention Network for Hierarchical Data Structures with an   Application to Claims Management

Leander L\"ow; Martin Spindler; Eike Brechmann

arXiv:1808.10543·cs.LG·September 3, 2018

A Self-Attention Network for Hierarchical Data Structures with an Application to Claims Management

Leander L\"ow, Martin Spindler, Eike Brechmann

PDF

Open Access

TL;DR

This paper introduces a self-attention neural network model tailored for hierarchical healthcare claim data, demonstrating superior performance over traditional and other deep learning models in fraud detection tasks.

Contribution

It presents a novel self-attention based neural network architecture specifically designed for hierarchical, variable-length claim data, outperforming existing models.

Findings

01

Self-attention model outperforms bag-of-words and CNN models.

02

Proposed methods achieve higher accuracy on a large claims dataset.

03

Self-attention model performs best among tested approaches.

Abstract

Insurance companies must manage millions of claims per year. While most of these claims are non-fraudulent, fraud detection is core for insurance companies. The ultimate goal is a predictive model to single out the fraudulent claims and pay out the non-fraudulent ones immediately. Modern machine learning methods are well suited for this kind of problem. Health care claims often have a data structure that is hierarchical and of variable length. We propose one model based on piecewise feed forward neural networks (deep learning) and another model based on self-attention neural networks for the task of claim management. We show that the proposed methods outperform bag-of-words based models, hand designed features, and models based on convolutional neural networks, on a data set of two million health care claims. The proposed self-attention method performs the best.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Machine Learning in Healthcare · Topic Modeling