Chinese Stock Prediction Based on a Multi-Modal Transformer Framework: Macro-Micro Information Fusion
Lumen AI, Tengzhou No. 1 Middle School, Shihao Ji, Zihui Song, Fucheng, Zhong, Jisen Jia, Zhaobo Wu, Zheyi Cao, Xu Tianhao

TL;DR
This paper introduces a Multi-Modal Transformer framework that integrates macroeconomic, micro-market, textual, and event data to enhance Chinese stock market prediction accuracy, effectively addressing data heterogeneity and time alignment challenges.
Contribution
The paper presents a novel multi-modal Transformer architecture with dynamic cross-modal fusion and mixed-frequency processing for stock prediction.
Findings
RMSE reduced by 23.7% compared to baseline
Event response prediction accuracy improved by 41.2%
Sharpe ratio increased by 32.6%
Abstract
This paper proposes an innovative Multi-Modal Transformer framework (MMF-Trans) designed to significantly improve the prediction accuracy of the Chinese stock market by integrating multi-source heterogeneous information including macroeconomy, micro-market, financial text, and event knowledge. The framework consists of four core modules: (1) A four-channel parallel encoder that processes technical indicators, financial text, macro data, and event knowledge graph respectively for independent feature extraction of multi-modal data; (2) A dynamic gated cross-modal fusion mechanism that adaptively learns the importance of different modalities through differentiable weight allocation for effective information integration; (3) A time-aligned mixed-frequency processing layer that uses an innovative position encoding method to effectively fuse data of different time frequencies and solves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsAttention Is All You Need · Softmax · Adam · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer
