LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction
Praneeth Vadlapati

TL;DR
This paper presents LML-DAP, a novel approach using Large Language Models for explainable, data-augmented classification that achieves high accuracy and reduces reliance on traditional feature engineering.
Contribution
It introduces the LML-DAP framework, combining dataset summarization and context-aware querying to enhance explainability and performance in classification tasks.
Findings
Achieved over 90% accuracy in test cases
Provides explainable and context-aware classification
Reduces need for data cleaning and feature engineering
Abstract
Classification tasks are typically handled using Machine Learning (ML) models, which lack a balance between accuracy and interpretability. This paper introduces a new approach for classification tasks using Large Language Models (LLMs) in an explainable method. Unlike ML models, which rely heavily on data cleaning and feature engineering, this method streamlines the process using LLMs. This paper proposes a method called "Language Model Learning (LML)" powered by a new method called "Data-Augmented Prediction (DAP)." The classification is performed by LLMs using a method similar to that used by humans who manually explore and understand the data to decide classifications. In the process of LML, a dataset is summarized and evaluated to determine the features leading to each label the most. In the DAP process, the system uses the data summary and a row of the testing dataset to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dropout · Dense Connections · Multi-Head Attention · Layer Normalization · Position-Wise Feed-Forward Layer
