ESG-FTSE: A corpus of news articles with ESG relevance labels and use cases
Mariya Pavlova, Bernard Casey, Miaosen Wang

TL;DR
ESG-FTSE is a novel corpus of news articles with detailed ESG relevance labels, enabling improved ESG scoring and responsible investing through NLP techniques.
Contribution
It introduces the first ESG news article corpus with a new annotation schema and demonstrates its utility for ESG relevance detection using supervised and unsupervised methods.
Findings
Supervised models achieve high accuracy in ESG relevance detection.
Unsupervised methods also show promising results.
The corpus facilitates research in ESG-related NLP applications.
Abstract
We present ESG-FTSE, the first corpus comprised of news articles with Environmental, Social and Governance (ESG) relevance annotations. In recent years, investors and regulators have pushed ESG investing to the mainstream due to the urgency of climate change. This has led to the rise of ESG scores to evaluate an investment's credentials as socially responsible. While demand for ESG scores is high, their quality varies wildly. Quantitative techniques can be applied to improve ESG scores, thus, responsible investing. To contribute to resource building for ESG and financial text mining, we pioneer the ESG-FTSE corpus. We further present the first of its kind ESG annotation schema. It has three levels: a binary classification (relevant versus irrelevant news articles), ESG classification (ESG-related news articles), and target company. Both supervised and unsupervised learning experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnvironmental and Social Impact Assessments
