# Interpretable multimodal reasoning for robo-advisory: the FinErva framework

**Authors:** Jiarui Chi

PMC · DOI: 10.3389/frai.2025.1752580 · Frontiers in Artificial Intelligence · 2026-01-21

## TL;DR

This paper introduces FinErva, a dataset and framework for interpretable AI in financial advisory tasks, enabling lightweight models to perform like finance professionals.

## Contribution

The novel FinErva dataset and two-stage training framework enable interpretable multimodal reasoning in robo-advisory systems.

## Key findings

- Lightweight models trained with FinErva approach the performance of finance professionals.
- The framework outperforms non-expert investors in financial tasks like contract review and chart analysis.
- Multimodal chain-of-thought supervision improves interpretability under computational constraints.

## Abstract

The rapid development of robo-advisory and quantitative investment has been accompanied by persistent concerns about limited personalization and the opacity of black-box models operating on multimodal financial information. This paper addresses these issues from a decision-support perspective by constructing FinErva, a multimodal chain-of-thought dataset tailored to financial applications. FinErva comprises 7,544 manually verified question–answer pairs, divided into two economically relevant tasks: contract and disclosure understanding (FinErva-Pact) and candlestick-chart-based technical analysis (FinErva-Price). Building on this dataset, the paper propose a two-stage training framework: Supervised-CoT Learning followed by Self-CoT Refinement, and apply it to eight vision–language models, each with fewer than 0.8 billion parameters. Empirical results show that those lightweight models approach the performance of finance professionals and clearly outperform non-expert investors. Overall, the findings indicate that appropriately designed multimodal chain of thought supervision enables interpretable modeling of key research tasks such as contract review and chart interpretation under realistic computational and deployment constraints, providing new data and methodology for the development of personalized, explainable, and operationally feasible AI systems in investment advisory and risk management.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12868138/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12868138/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12868138/full.md

---
Source: https://tomesphere.com/paper/PMC12868138