LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction

Praneeth Vadlapati

arXiv:2409.18957·cs.CL·January 3, 2025

LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction

Praneeth Vadlapati

PDF

Open Access 1 Repo

TL;DR

This paper presents LML-DAP, a novel approach using Large Language Models for explainable, data-augmented classification that achieves high accuracy and reduces reliance on traditional feature engineering.

Contribution

It introduces the LML-DAP framework, combining dataset summarization and context-aware querying to enhance explainability and performance in classification tasks.

Findings

01

Achieved over 90% accuracy in test cases

02

Provides explainable and context-aware classification

03

Reduces need for data cleaning and feature engineering

Abstract

Classification tasks are typically handled using Machine Learning (ML) models, which lack a balance between accuracy and interpretability. This paper introduces a new approach for classification tasks using Large Language Models (LLMs) in an explainable method. Unlike ML models, which rely heavily on data cleaning and feature engineering, this method streamlines the process using LLMs. This paper proposes a method called "Language Model Learning (LML)" powered by a new method called "Data-Augmented Prediction (DAP)." The classification is performed by LLMs using a method similar to that used by humans who manually explore and understand the data to decide classifications. In the process of LML, a dataset is summarized and evaluated to determine the features leading to each label the most. In the DAP process, the system uses the data summary and a row of the testing dataset to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pro-genai/lml-dap
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dropout · Dense Connections · Multi-Head Attention · Layer Normalization · Position-Wise Feed-Forward Layer