FinTexTS: Financial Text-Paired Time-Series Dataset via Semantic-Based and Multi-Level Pairing
Jaehoon Lee, Suhwan Park, Tae Yoon Lim, Seunghan Lee, Jun Seo, Dongwan Kang, Hwanil Choi, Minjae Kim, Sungdong Yoo, SoonYoung Lee, Yongjae Lee, Wonbin Ahn

TL;DR
This paper introduces FinTexTS, a large-scale financial text-paired time-series dataset created using a semantic-based, multi-level pairing framework that improves stock price forecasting by capturing complex interdependencies in financial data.
Contribution
The paper presents a novel semantic-based, multi-level pairing framework for constructing financial text and time-series datasets, enhancing the quality of data for stock prediction tasks.
Findings
Semantic-based pairing improves relevance of news to stock data.
Multi-level classification captures diverse contextual influences.
Enhanced datasets lead to better stock price forecasting accuracy.
Abstract
The financial domain involves a variety of important time-series problems. Recently, time-series analysis methods that jointly leverage textual and numerical information have gained increasing attention. Accordingly, numerous efforts have been made to construct text-paired time-series datasets in the financial domain. However, financial markets are characterized by complex interdependencies, in which a company's stock price is influenced not only by company-specific events but also by events in other companies and broader macroeconomic factors. Existing approaches that pair text with financial time-series data based on simple keyword matching often fail to capture such complex relationships. To address this limitation, we propose a semantic-based and multi-level pairing framework. Specifically, we extract company-specific context for the target company from SEC filings and apply an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Time Series Analysis and Forecasting · Complex Systems and Time Series Analysis
