Embeddings and Attention in Predictive Modeling
Kevin Kuo, Ronald Richman

TL;DR
This paper investigates how embeddings and attention mechanisms can improve predictive modeling of claim severity, demonstrating their utility in feature representation, interpretability, and performance enhancement.
Contribution
It introduces methods for integrating embeddings and attention in predictive models, and shows how they can be used for feature extraction and improved accuracy.
Findings
Embeddings serve as effective pretrained features for linear models.
Attention mechanisms enhance the contextual relevance of embeddings.
Models with attention outperform simpler neural networks in claim severity prediction.
Abstract
We explore in depth how categorical data can be processed with embeddings in the context of claim severity modeling. We develop several models that range in complexity from simple neural networks to state-of-the-art attention based architectures that utilize embeddings. We illustrate the utility of learned embeddings from neural networks as pretrained features in generalized linear models, and discuss methods for visualizing and interpreting embeddings. Finally, we explore how attention based models can contextually augment embeddings, leading to enhanced predictive performance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Bayesian Modeling and Causal Inference
