Stock Market Sentiment Classification and Backtesting via Fine-tuned BERT
Jiashu Lou

TL;DR
This paper develops a fine-tuned BERT model to classify stock market sentiment from user comments, integrating it into a regression-based trading strategy that significantly improves returns.
Contribution
It introduces a sentiment classification approach using fine-tuned BERT for stock trading and demonstrates its effectiveness in enhancing trading performance.
Findings
Sentiment classification accuracy improved with fine-tuning.
Incorporating emotional factors increased return rate by 73.8%.
The model outperformed baseline and original Alpha191 models.
Abstract
With the rapid development of big data and computing devices, low-latency automatic trading platforms based on real-time information acquisition have become the main components of the stock trading market, so the topic of quantitative trading has received widespread attention. And for non-strongly efficient trading markets, human emotions and expectations always dominate market trends and trading decisions. Therefore, this paper starts from the theory of emotion, taking East Money as an example, crawling user comment titles data from its corresponding stock bar and performing data cleaning. Subsequently, a natural language processing model BERT was constructed, and the BERT model was fine-tuned using existing annotated data sets. The experimental results show that the fine-tuned model has different degrees of performance improvement compared to the original model and the baseline model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Dropout · Linear Layer · Layer Normalization · WordPiece · Multi-Head Attention · Linear Warmup With Linear Decay · Softmax
