VideoConviction: A Multimodal Benchmark for Human Conviction and Stock Market Recommendations
Michael Galarnyk, Veer Kejriwal, Agam Shah, Yash Bhardwaj, Nicholas Meyer, Anand Krishnan, Sudheer Chava

TL;DR
VideoConviction introduces a comprehensive multimodal dataset and benchmark to analyze the influence of finfluencers on stock recommendations, highlighting challenges in modeling conviction and investment actions from multimodal signals.
Contribution
The paper presents VideoConviction, a new multimodal dataset with expert annotations, and benchmarks multimodal models on financial discourse analysis, addressing the gap in understanding finfluencer influence.
Findings
Multimodal inputs improve stock ticker extraction accuracy.
Models struggle to accurately classify investment conviction and actions.
Inverse investment strategies based on finfluencer recommendations outperform the S&P 500.
Abstract
Social media has amplified the reach of financial influencers known as "finfluencers," who share stock recommendations on platforms like YouTube. Understanding their influence requires analyzing multimodal signals like tone, delivery style, and facial expressions, which extend beyond text-based financial analysis. We introduce VideoConviction, a multimodal dataset with 6,000+ expert annotations, produced through 457 hours of human effort, to benchmark multimodal large language models (MLLMs) and text-based large language models (LLMs) in financial discourse. Our results show that while multimodal inputs improve stock ticker extraction (e.g., extracting Apple's ticker AAPL), both MLLMs and LLMs struggle to distinguish investment actions and conviction--the strength of belief conveyed through confident delivery and detailed reasoning--often misclassifying general commentary as definitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
