FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents
Bolun "Namir" Xia, Vipula D. Rawte, Mohammed J. Zaki, Aparna Gupta

TL;DR
This paper introduces FETILDA, a deep learning framework that enhances the representation of long financial documents using fine-tuned, long-input language models with chunking and self-attention, improving predictive accuracy for financial KPIs.
Contribution
The work presents a novel framework combining chunking, domain-specific fine-tuning, and self-attention to better extract features from long financial texts for regression tasks.
Findings
Outperforms baseline models in financial text regression
Effective in capturing soft financial information from long documents
Improves predictive accuracy over standard pre-trained models
Abstract
Unstructured data, especially text, continues to grow rapidly in various domains. In particular, in the financial sphere, there is a wealth of accumulated unstructured financial data, such as the textual disclosure documents that companies submit on a regular basis to regulatory agencies, such as the Securities and Exchange Commission (SEC). These documents are typically very long and tend to contain valuable soft information about a company's performance. It is therefore of great interest to learn predictive models from these long textual documents, especially for forecasting numerical key performance indicators (KPIs). Whereas there has been a great progress in pre-trained language models (LMs) that learn from tremendously large corpora of textual data, they still struggle in terms of effective representations for long documents. Our work fills this critical need, namely how to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
