Quantitative Financial Modeling for Sri Lankan Markets: Approach Combining NLP, Clustering and Time-Series Forecasting
Linuk Perera

TL;DR
This paper presents a comprehensive quantitative framework combining NLP, clustering, and time-series forecasting to predict market regimes and signals in Sri Lankan markets, integrating ESG sentiment analysis with macroeconomic data.
Contribution
It introduces a novel integrated methodology specifically designed for emerging markets, combining sentiment analysis, clustering, and advanced forecasting models for improved market prediction.
Findings
Achieved 84.04% accuracy in regime classification
GRU model attained 0.801 R-squared in price forecasting
Fusion logic improved market signal accuracy
Abstract
This research introduces a novel quantitative methodology tailored for quantitative finance applications, enabling banks, stockbrokers, and investors to predict economic regimes and market signals in emerging markets, specifically Sri Lankan stock indices (S&P SL20 and ASPI) by integrating Environmental, Social, and Governance (ESG) sentiment analysis with macroeconomic indicators and advanced time-series forecasting. Designed to leverage quantitative techniques for enhanced risk assessment, portfolio optimization, and trading strategies in volatile environments, the architecture employs FinBERT, a transformer-based NLP model, to extract sentiment from ESG texts, followed by unsupervised clustering (UMAP/HDBSCAN) to identify 5 latent ESG regimes, validated via PCA. These regimes are mapped to economic conditions using a dense neural network and gradient boosting classifier, achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Forecasting Techniques and Applications · Sentiment Analysis and Opinion Mining
