Latent Feature Mining for Predictive Model Enhancement with Large   Language Models

Bingxuan Li; Pengyi Shi; Amy Ward

arXiv:2410.04347·cs.LG·October 8, 2024

Latent Feature Mining for Predictive Model Enhancement with Large Language Models

Bingxuan Li, Pengyi Shi, Amy Ward

PDF

Open Access

TL;DR

This paper introduces FLAME, a framework using large language models to infer latent features from limited data, improving predictive models in sensitive domains like criminal justice and healthcare.

Contribution

The paper presents a novel text-to-text logical reasoning approach with LLMs to extract latent features, enhancing predictive models in data-constrained, sensitive domains.

Findings

01

Latent features inferred by FLAME align with ground truth labels.

02

Enhanced predictive accuracy in criminal justice and healthcare tasks.

03

Framework demonstrates effective transferability across domains.

Abstract

Predictive modeling often faces challenges due to limited data availability and quality, especially in domains where collected features are weakly correlated with outcomes and where additional feature collection is constrained by ethical or practical difficulties. Traditional machine learning (ML) models struggle to incorporate unobserved yet critical factors. In this work, we introduce an effective approach to formulate latent feature mining as text-to-text propositional logical reasoning. We propose FLAME (Faithful Latent Feature Mining for Predictive Model Enhancement), a framework that leverages large language models (LLMs) to augment observed features with latent features and enhance the predictive power of ML models in downstream tasks. Our framework is generalizable across various domains with necessary domain-specific adaptation, as it is designed to incorporate contextual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Web Data Mining and Analysis

MethodsALIGN