Text Mining of Stocktwits Data for Predicting Stock Prices
Mukul Jaggi, Priyanka Mandal, Shreya Narang, Usman Naseem, Matloob, Khushi

TL;DR
This paper introduces FinALBERT, a specialized financial text classification model trained on Stocktwits data to improve stock price prediction by understanding market sentiment.
Contribution
The paper presents FinALBERT, a novel ALBERT-based model fine-tuned on labeled Stocktwits data for financial sentiment analysis and stock movement prediction.
Findings
FinALBERT outperforms traditional models in sentiment classification.
Labeling techniques effectively capture stock price change signals.
Model demonstrates adaptability across different architectures.
Abstract
Stock price prediction can be made more efficient by considering the price fluctuations and understanding the sentiments of people. A limited number of models understand financial jargon or have labelled datasets concerning stock price change. To overcome this challenge, we introduced FinALBERT, an ALBERT based model trained to handle financial domain text classification tasks by labelling Stocktwits text data based on stock price change. We collected Stocktwits data for over ten years for 25 different companies, including the major five FAANG (Facebook, Amazon, Apple, Netflix, Google). These datasets were labelled with three labelling techniques based on stock price changes. Our proposed model FinALBERT is fine-tuned with these labels to achieve optimal results. We experimented with the labelled dataset by training it on traditional machine learning, BERT, and FinBERT models, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Weight Decay · WordPiece · Softmax · Dense Connections · LAMB · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Residual Connection
