Can AI Help with Your Personal Finances?
Oudom Hean, Utsha Saha, Binita Saha

TL;DR
This paper evaluates the effectiveness of large language models in providing financial advice, highlighting their current capabilities, limitations, and potential for future personal finance applications.
Contribution
It offers a comparative analysis of leading LLMs' performance in personal finance topics, revealing their accuracy levels and areas needing improvement.
Findings
Average accuracy of ~70% in financial advice
Performance varies across different financial topics
Notable improvements in newer LLM versions
Abstract
In recent years, Large Language Models (LLMs) have emerged as a transformative development in artificial intelligence (AI), drawing significant attention from industry and academia. Trained on vast datasets, these sophisticated AI systems exhibit impressive natural language processing and content generation capabilities. This paper explores the potential of LLMs to address key challenges in personal finance, focusing on the United States. We evaluate several leading LLMs, including OpenAI's ChatGPT, Google's Gemini, Anthropic's Claude, and Meta's Llama, to assess their effectiveness in providing accurate financial advice on topics such as mortgages, taxes, loans, and investments. Our findings show that while these models achieve an average accuracy rate of approximately 70%, they also display notable limitations in certain areas. Specifically, LLMs struggle to provide accurate responses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Literacy, Pension, Retirement Analysis · Insurance, Mortality, Demography, Risk Management
MethodsSoftmax · Attention Is All You Need
