InvestLM: A Large Language Model for Investment using Financial Domain Instruction Tuning
Yi Yang, Yixuan Tang, Kar Yan Tam

TL;DR
InvestLM is a specialized large language model for finance, trained on a curated dataset, demonstrating strong understanding and helpful responses in financial investment tasks, comparable to leading commercial models.
Contribution
This work introduces InvestLM, a financial domain LLM fine-tuned on a small, carefully curated instruction dataset, showing high performance with limited data and supporting the Superficial Alignment Hypothesis.
Findings
InvestLM performs comparably to GPT-3.5, GPT-4, and Claude-2 in financial tasks.
Zero-shot evaluation shows strong generalizability across financial NLP benchmarks.
Financial experts rate InvestLM's responses as highly helpful and accurate.
Abstract
We present a new financial domain large language model, InvestLM, tuned on LLaMA-65B (Touvron et al., 2023), using a carefully curated instruction dataset related to financial investment. Inspired by less-is-more-for-alignment (Zhou et al., 2023), we manually curate a small yet diverse instruction dataset, covering a wide range of financial related topics, from Chartered Financial Analyst (CFA) exam questions to SEC filings to Stackexchange quantitative finance discussions. InvestLM shows strong capabilities in understanding financial text and provides helpful responses to investment related questions. Financial experts, including hedge fund managers and research analysts, rate InvestLM's response as comparable to those of state-of-the-art commercial models (GPT-3.5, GPT-4 and Claude-2). Zero-shot evaluation on a set of financial NLP benchmarks demonstrates strong generalizability. From…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Adam · Layer Normalization · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Dense Connections
