Instruction-Based Fine-tuning of Open-Source LLMs for Predicting Customer Purchase Behaviors
Halil Ibrahim Ergul, Selim Balcisoy, and Burcin Bozkaya

TL;DR
This paper demonstrates that instruction fine-tuning large language models like Mistral with LoRA significantly improves their ability to predict customer purchase behaviors from financial transaction data, outperforming traditional models.
Contribution
It introduces a novel application of instruction fine-tuning with LoRA to adapt open-source LLMs for financial transaction prediction tasks, showing superior performance.
Findings
Fine-tuned Mistral outperforms traditional models in F1 scores.
Model better predicts minority classes in transaction data.
Enhanced semantic understanding improves prediction accuracy.
Abstract
In this study, the performance of various predictive models, including probabilistic baseline, CNN, LSTM, and finetuned LLMs, in forecasting merchant categories from financial transaction data have been evaluated. Utilizing datasets from Bank A for training and Bank B for testing, the superior predictive capabilities of the fine-tuned Mistral Instruct model, which was trained using customer data converted into natural language format have been demonstrated. The methodology of this study involves instruction fine-tuning Mistral via LoRA (LowRank Adaptation of Large Language Models) to adapt its vast pre-trained knowledge to the specific domain of financial transactions. The Mistral model significantly outperforms traditional sequential models, achieving higher F1 scores in the three key merchant categories of bank transaction data (grocery, clothing, and gas stations) that is crucial for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCustomer churn and segmentation · Data Mining Algorithms and Applications · Data Stream Mining Techniques
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
