Optimizing Large Language Models for ESG Activity Detection in Financial Texts

Mattia Birti; Andrea Maurino; Francesco Osborne

arXiv:2502.21112·cs.AI·December 17, 2025

Optimizing Large Language Models for ESG Activity Detection in Financial Texts

Mattia Birti, Andrea Maurino, Francesco Osborne

PDF

1 Repo

TL;DR

This paper explores how to improve large language models for detecting ESG-related activities in financial texts by fine-tuning them on a new benchmark dataset, leading to better accuracy and domain-specific performance.

Contribution

It introduces ESG-Activities, a labeled dataset for ESG activity detection, and demonstrates that fine-tuning LLMs on this data significantly improves their classification accuracy.

Findings

01

Fine-tuning enhances LLM performance on ESG detection

02

Open models outperform proprietary solutions in specific setups

03

ESG-Activities dataset enables domain-specific model training

Abstract

The integration of Environmental, Social, and Governance (ESG) factors into corporate decision-making is a fundamental aspect of sustainable finance. However, ensuring that business practices align with evolving regulatory frameworks remains a persistent challenge. AI-driven solutions for automatically assessing the alignment of sustainability reports and non-financial disclosures with specific ESG activities could greatly support this process. Yet, this task remains complex due to the limitations of general-purpose Large Language Models (LLMs) in domain-specific contexts and the scarcity of structured, high-quality datasets. In this paper, we investigate the ability of current-generation LLMs to identify text related to environmental activities. Furthermore, we demonstrate that their performance can be significantly enhanced through fine-tuning on a combination of original and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mattia-brt/fine_tuning_llm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLLaMA · ALIGN