Interpretable Machine Learning for Macro Alpha: A News Sentiment Case Study
Yuke Zhang

TL;DR
This paper presents an interpretable machine learning framework that leverages global news sentiment processed through domain-specific NLP to predict macroeconomic asset returns with high accuracy and explainability.
Contribution
It introduces a novel combination of finance-specific NLP and interpretable ML models for macro alpha extraction from news sentiment data.
Findings
XGBoost outperforms logistic regression in return prediction.
Sentiment dispersion and article impact are key predictive features.
High risk-adjusted returns demonstrated over extensive out-of-sample testing.
Abstract
This study introduces an interpretable machine learning (ML) framework to extract macroeconomic alpha from global news sentiment. We process the Global Database of Events, Language, and Tone (GDELT) Project's worldwide news feed using FinBERT -- a Bidirectional Encoder Representations from Transformers (BERT) based model pretrained on finance-specific language -- to construct daily sentiment indices incorporating mean tone, dispersion, and event impact. These indices drive an XGBoost classifier, benchmarked against logistic regression, to predict next-day returns for EUR/USD, USD/JPY, and 10-year U.S. Treasury futures (ZN). Rigorous out-of-sample (OOS) backtesting (5-fold expanding-window cross-validation, OOS period: c. 2017-April 2025) demonstrates exceptional, cost-adjusted performance for the XGBoost strategy: Sharpe ratios achieve 5.87 (EUR/USD), 4.65 (USD/JPY), and 4.65…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Sentiment Analysis and Opinion Mining · Market Dynamics and Volatility
