Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance
By: Yilei Zhao , Wentao Zhang , Xiao Lei and more
Environmental, social, and governance (ESG) criteria are essential for evaluating corporate sustainability and ethical performance. However, professional ESG analysis is hindered by data fragmentation across unstructured sources, and existing large language models (LLMs) often struggle with the complex, multi-step workflows required for rigorous auditing. To address these limitations, we introduce ESGAgent, a hierarchical multi-agent system empowered by a specialized toolset, including retrieval augmentation, web search and domain-specific functions, to generate in-depth ESG analysis. Complementing this agentic system, we present a comprehensive three-level benchmark derived from 310 corporate sustainability reports, designed to evaluate capabilities ranging from atomic common-sense questions to the generation of integrated, in-depth analysis. Empirical evaluations demonstrate that ESGAgent outperforms state-of-the-art closed-source LLMs with an average accuracy of 84.15% on atomic question-answering tasks, and excels in professional report generation by integrating rich charts and verifiable references. These findings confirm the diagnostic value of our benchmark, establishing it as a vital testbed for assessing general and advanced agentic capabilities in high-stakes vertical domains.
Similar Papers
Optimizing Large Language Models for ESG Activity Detection in Financial Texts
Artificial Intelligence
Helps companies check if their green claims are true.
ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
Computation and Language
Helps computers understand green business rules.
ESGBench: A Benchmark for Explainable ESG Question Answering in Corporate Sustainability Reports
Computation and Language
Helps computers understand company green reports better.