Embeddings and Attention in Predictive Modeling

Kevin Kuo; Ronald Richman

arXiv:2104.03545·stat.AP·April 9, 2021·5 cites

Embeddings and Attention in Predictive Modeling

Kevin Kuo, Ronald Richman

PDF

Open Access

TL;DR

This paper investigates how embeddings and attention mechanisms can improve predictive modeling of claim severity, demonstrating their utility in feature representation, interpretability, and performance enhancement.

Contribution

It introduces methods for integrating embeddings and attention in predictive models, and shows how they can be used for feature extraction and improved accuracy.

Findings

01

Embeddings serve as effective pretrained features for linear models.

02

Attention mechanisms enhance the contextual relevance of embeddings.

03

Models with attention outperform simpler neural networks in claim severity prediction.

Abstract

We explore in depth how categorical data can be processed with embeddings in the context of claim severity modeling. We develop several models that range in complexity from simple neural networks to state-of-the-art attention based architectures that utilize embeddings. We illustrate the utility of learned embeddings from neural networks as pretrained features in generalized linear models, and discuss methods for visualizing and interpreting embeddings. Finally, we explore how attention based models can contextually augment embeddings, leading to enhanced predictive performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Bayesian Modeling and Causal Inference