TL;DR
This paper explores how to improve large language models for detecting ESG-related activities in financial texts by fine-tuning them on a new benchmark dataset, leading to better accuracy and domain-specific performance.
Contribution
It introduces ESG-Activities, a labeled dataset for ESG activity detection, and demonstrates that fine-tuning LLMs on this data significantly improves their classification accuracy.
Findings
Fine-tuning enhances LLM performance on ESG detection
Open models outperform proprietary solutions in specific setups
ESG-Activities dataset enables domain-specific model training
Abstract
The integration of Environmental, Social, and Governance (ESG) factors into corporate decision-making is a fundamental aspect of sustainable finance. However, ensuring that business practices align with evolving regulatory frameworks remains a persistent challenge. AI-driven solutions for automatically assessing the alignment of sustainability reports and non-financial disclosures with specific ESG activities could greatly support this process. Yet, this task remains complex due to the limitations of general-purpose Large Language Models (LLMs) in domain-specific contexts and the scarcity of structured, high-quality datasets. In this paper, we investigate the ability of current-generation LLMs to identify text related to environmental activities. Furthermore, we demonstrate that their performance can be significantly enhanced through fine-tuning on a combination of original and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLLaMA · ALIGN
