GroupSHAP-Guided Integration of Financial News Keywords and Technical Indicators for Stock Price Prediction
Minjoo Kim, Jinwoong Kim, Sangjin Park

TL;DR
This paper introduces a GroupSHAP-guided framework that leverages semantic keyword groups from financial news to improve stock price prediction accuracy and interpretability, reducing computational costs compared to traditional SHAP methods.
Contribution
It presents the first application of GroupSHAP in financial forecasting, combining semantic grouping with SHAP to enhance interpretability and predictive performance.
Findings
Achieved 32.2% reduction in MAE for S&P 500 forecasting
Achieved 40.5% reduction in RMSE compared to benchmarks
Demonstrated effective use of semantic keyword groups for stock prediction
Abstract
Recent advances in finance-specific language models such as FinBERT have enabled the quantification of public sentiment into index-based measures, yet compressing diverse linguistic signals into single metrics overlooks contextual nuances and limits interpretability. To address this limitation, explainable AI techniques, particularly SHAP (SHapley Additive Explanations), have been employed to identify influential features. However, SHAP's computational cost grows exponentially with input features, making it impractical for large-scale text-based financial data. This study introduces a GRU-based forecasting framework enhanced with GroupSHAP, which quantifies contributions of semantically related keyword groups rather than individual tokens, substantially reducing computational burden while preserving interpretability. We employed FinBERT to embed news articles from 2015 to 2024,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
