Chinese Stock Prediction Based on a Multi-Modal Transformer Framework:   Macro-Micro Information Fusion

Lumen AI; Tengzhou No. 1 Middle School; Shihao Ji; Zihui Song; Fucheng; Zhong; Jisen Jia; Zhaobo Wu; Zheyi Cao; Xu Tianhao

arXiv:2501.16621·cs.LG·January 29, 2025

Chinese Stock Prediction Based on a Multi-Modal Transformer Framework: Macro-Micro Information Fusion

Lumen AI, Tengzhou No. 1 Middle School, Shihao Ji, Zihui Song, Fucheng, Zhong, Jisen Jia, Zhaobo Wu, Zheyi Cao, Xu Tianhao

PDF

Open Access

TL;DR

This paper introduces a Multi-Modal Transformer framework that integrates macroeconomic, micro-market, textual, and event data to enhance Chinese stock market prediction accuracy, effectively addressing data heterogeneity and time alignment challenges.

Contribution

The paper presents a novel multi-modal Transformer architecture with dynamic cross-modal fusion and mixed-frequency processing for stock prediction.

Findings

01

RMSE reduced by 23.7% compared to baseline

02

Event response prediction accuracy improved by 41.2%

03

Sharpe ratio increased by 32.6%

Abstract

This paper proposes an innovative Multi-Modal Transformer framework (MMF-Trans) designed to significantly improve the prediction accuracy of the Chinese stock market by integrating multi-source heterogeneous information including macroeconomy, micro-market, financial text, and event knowledge. The framework consists of four core modules: (1) A four-channel parallel encoder that processes technical indicators, financial text, macro data, and event knowledge graph respectively for independent feature extraction of multi-modal data; (2) A dynamic gated cross-modal fusion mechanism that adaptively learns the importance of different modalities through differentiable weight allocation for effective information integration; (3) A time-aligned mixed-frequency processing layer that uses an innovative position encoding method to effectively fuse data of different time frequencies and solves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods

MethodsAttention Is All You Need · Softmax · Adam · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer