Predicting Financial Markets: Comparing Survey, News, Twitter and Search Engine Data
Huina Mao, Scott Counts, Johan Bollen

TL;DR
This study compares various online sentiment data sources and traditional surveys to evaluate their effectiveness in predicting financial market movements, finding Google search data and Twitter sentiment to be particularly predictive.
Contribution
It provides a comprehensive comparison of online sentiment indicators and traditional surveys, highlighting the predictive power of Google search volumes and Twitter sentiment over conventional methods.
Findings
Google search volumes predict market movements.
Twitter sentiment is a significant predictor.
Traditional surveys lag behind market changes.
Abstract
Financial market prediction on the basis of online sentiment tracking has drawn a lot of attention recently. However, most results in this emerging domain rely on a unique, particular combination of data sets and sentiment tracking tools. This makes it difficult to disambiguate measurement and instrument effects from factors that are actually involved in the apparent relation between online sentiment and market values. In this paper, we survey a range of online data sets (Twitter feeds, news headlines, and volumes of Google search queries) and sentiment tracking methods (Twitter Investor Sentiment, Negative News Sentiment and Tweet & Google Search volumes of financial terms), and compare their value for financial prediction of market indices such as the Dow Jones Industrial Average, trading volumes, and market volatility (VIX), as well as gold prices. We also compare the predictive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Complex Systems and Time Series Analysis
