Evaluating Large Language Models for Stance Detection on Financial Targets from SEC Filing Reports and Earnings Call Transcripts
Nikesh Gyawali, Doina Caragea, Alex Vasenkov, and Cornelia Caragea

TL;DR
This paper introduces a new dataset and evaluates large language models for sentence-level stance detection on financial targets in SEC filings and earnings transcripts, demonstrating the effectiveness of few-shot CoT prompting.
Contribution
The work provides a novel stance detection corpus for financial texts and systematically evaluates LLMs with various prompting strategies, highlighting their practical utility in finance.
Findings
Few-shot CoT prompting outperforms other methods
LLMs show variable performance across datasets
Leverages LLMs without large labeled datasets
Abstract
Financial narratives from U.S. Securities and Exchange Commission (SEC) filing reports and quarterly earnings call transcripts (ECTs) are very important for investors, auditors, and regulators. However, their length, financial jargon, and nuanced language make fine-grained analysis difficult. Prior sentiment analysis in the financial domain required a large, expensive labeled dataset, making the sentence-level stance towards specific financial targets challenging. In this work, we introduce a sentence-level corpus for stance detection focused on three core financial metrics: debt, earnings per share (EPS), and sales. The sentences were extracted from Form 10-K annual reports and ECTs, and labeled for stance (positive, negative, neutral) using the advanced ChatGPT-o3-pro model under rigorous human validation. Using this corpus, we conduct a systematic evaluation of modern large language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
