Third place AIM Hackathon: Dissecting ESG reports by relevance with LLMs

Authors: Andrei I. Cursaru and Orion Forowycz

The Venturers team created an architecture to identify the relevance of each section. Their prototype classified sentences based on its content, making it easier to identify concrete facts versus promotional, vague statements. This relevance map helps users quickly navigate reports, enabling standardized comparisons between companies and incentivizing clarity in ESG reporting.

Dissecting ESG reports by relevance with LLMs

ESG (Environment, Social, Governance) reports made yearly by companies are initially thought of as a way for investors to be able to easily assess how investing in a given company may have indirect environmental or social negative impacts.

However, the interests of the people writing these reports are not aligned with the interests of the people reading them. For companies, the goal is to make themselves look good and to make it as tedious as possible for readers to easily and quickly find the hard facts they are looking for.

As a result, ESG reports drown the facts in an ocean of vague and often meaningless motivational statements about the company’s beliefs, values, long-term goals, and so on. For investors, these digressions carry little to no information about the potential negative impacts of a company, since no company will ever claim that they strive to destroy the planet and mistreat all their workers.

How could we use AI to help us to get more use out of these reports?

Large language models (LLMs) like GPT-4o can do many things, but they can’t directly tell us if the information that a company reports is true, partial, biased or irrelevant. They can’t simply tell us if and how much greenwashing fills the pages of an ESG report, because the information to precisely do so is not out there.

During the hackathon Sustainability meets LLMs organised by AIM – AI Impact Mission and supported by TIMETOACT Group Österreich, the team The Venturers came up with a simple idea to make ESG reports more useful: They process each sentence of a report with an LLM to evaluate its relevance and usefulness to investors, in order to generate a "relevance map" of the report and summary statistics of relevance, and to be able to compare reports of different companies.

The prototype parses a PDF document page by page and sentence by sentence (including figures provided) and feeds it to a model (in our tests, GPT-4o) via the respective API. The model is prompted to answer a set of pre-defined questions by yes or no, such as:

"Does this statement or figure say something about the values or beliefs of the company?"
"Does this statement or figure state a quantified, concrete fact about the company?"

This output is then used to classify the string in one of more categories: beliefs & values, goals and missions, quantified hard facts about the company, qualitative facts about the company, facts unrelated to the company.

By combining these outputs with the length of the statement, they were able to easily infer percentages of facts vs abstract statements for each page, report, or company. This can then be used to:

Create a "relevance map" of the PDF, which outlines for example that facts about the company's environmental progress are only on pages 35-39 out of a 100+ page PDF. This can guide investors to quickly read only the relevant parts of the report, without relying on the report's often not so informative table of contents.
Extract only statements of a desired category into a new document, for example quantified hard facts about the company.

Create conciseness scores for reports based on these proportions.
Compare different companies to each other in a standardised way in terms of how much they try to drown the facts in their reports with decorative text. On the long term this could help to hold companies accountable for clear and readable reports, by giving them an incentive to increase their conciseness score and focus their reports on facts.

Outlook:

The prototype could be improved in various ways, for example by fine-tuning the categories and making more of them, depending on the sections of the report of the type of facts which are stated. Using GPT-4o is convenient for prototyping and for a good understanding of the context, but is likely rather overkill on the longer for this task. Working on using a smaller but still efficient model would improve the solution by reducing costs, energy consumption and the carbon footprint overall.

Blog 11/5/24

AIM Hackathon 2024: Sustainability Meets LLMs

Focusing on impactful AI applications, participants addressed key issues like greenwashing detection, ESG report relevance mapping, and compliance with the European Green Deal.

Blog 10/30/24

Second Place - AIM Hackathon 2024: Trustpilot for ESG

The NightWalkers designed a scalable tool that assigns trustworthiness scores based on various types of greenwashing indicators, including unsupported claims and inaccurate data.

Blog 11/4/24

SAM Wins First Prize at AIM Hackathon

The winning team of the AIM Hackathon, nexus. Group AI, developed SAM, an AI-powered ESG reporting platform designed to help companies streamline their sustainability compliance.

Technologie

Third Party Integration

The concept of bringing together the most important information in a central Digital Workplace platform (HCL and beyond) is now being continued with the ICEC Atlassian Confluence & Jira integration.

Blog 8/11/22

Part 1: TIMETOACT Logistics Hackathon - Behind the Scenes

A look behind the scenes of our Hackathon on Sustainable Logistic Simulation in May 2022. This was a hybrid event, running on-site in Vienna and remotely. Participants from 12 countries developed smart agents to control cargo delivery truck fleets in a simulated Europe.

Junger Business Mann der seinen Erfolg feiert

Event

We are 2024 Google Cloud Sales Partner of the Year - Alps!

We are proud to announce that we have won the prestigious 2024 Google Cloud Sales Partner of the Year - Alps Award!

Blog

catworkx behind the scenes - „The Lord of the Screens”

IIn our new blog article, we take a look behind the scenes and see who actually works at catworkx. Today: The lord of the screens.

Wissen 8/30/24

LLM-Benchmarks August 2024

Instead of our general LLM benchmarks, we present the first benchmark of different AI architectures in August.

Wissen 6/30/24

LLM-Benchmarks June 2024

This LLM Leaderboard from june 2024 helps to find the best Large Language Model for digital product development.

Wissen 5/30/24

LLM-Benchmarks May 2024

This LLM Leaderboard from may 2024 helps to find the best Large Language Model for digital product development.

Wissen 4/30/24

LLM-Benchmarks April 2024

This LLM Leaderboard from april 2024 helps to find the best Large Language Model for digital product development.

Wissen 7/30/24

LLM-Benchmarks July 2024

This LLM Leaderboard from July 2024 helps to find the best Large Language Model for digital product development.

Blog 5/25/21

From the idea to the product: The genesis of Skwill

We strongly believe in the benefits of continuous learning at work; this has led us to developing products that we also enjoy using ourselves. Meet Skwill.

News 1/26/21

The IPG Group becomes part of the TIMETOACT GROUP

The TIMETOACT GROUP acquires the majority of the shares of IPG Information Process Group Holding AG, based in Winterthur. Through the acquisition, the competencies for Identity and Access Management (IAM) solutions in the DACH market are combined.

News 1/26/21

The IPG Group becomes part of the TIMETOACT GROUP

The TIMETOACT GROUP acquires the majority of the shares of IPG Information Process Group Holding AG, based in Winterthur. Through the acquisition, the competencies for Identity and Access Management (IAM) solutions in the DACH market are combined.

Referenz 4/13/23

The new Idea and Innovation Management of the DDPS

The new solution is available to employees in the familiar portal and in the same design. It is very easy to use and adapted to the needs of the role holders. It was easy to move away from the old platform. The switch to the new solution is rated very positively by all roles.

Unternehmen 1/19/23

Sustainability in the TIMETOACT GROUP

Sustainability is one of the big topics of our time and we also want to get involved and face up to our responsibility as TIMETOACT GROUP. Find out everything about our sustainability activities here.

Headerbild zu Mendix in der Fertigungsindustrie

Technologie

Mendix in the manufacturing industry

Have solutions designed directly by the experts in your company's divisions and thus ensure that the solution implements the exact requirements of the specialist department. Use a wide variety of connectors to collect and evaluate all decision-relevant information.

Blog 7/25/23

Revolutionizing the Logistics Industry

As the logistics industry becomes increasingly complex, businesses need innovative solutions to manage the challenges of supply chain management, trucking, and delivery. With competitors investing in cutting-edge research and development, it is vital for companies to stay ahead of the curve and embrace the latest technologies to remain competitive. That is why we introduce the TIMETOACT Logistics Simulator Framework, a revolutionary tool for creating a digital twin of your logistics operation.

Wissen 4/14/23

The "Beautiful five" and the Power of "One-Number" Reporting

Key figures are a perennial favorite in idea management, have been used for many years (decades) and are now very topical again. The reasons are obvious. You want to set performance benchmarks, define targets, follow up on where things are not going so well and measure the success or failure of idea management.

Third place AIM Hackathon: Dissecting ESG reports by relevance with LLMs

Dissecting ESG reports by relevance with LLMs

How could we use AI to help us to get more use out of these reports?

Outlook:

More on this topic

AIM Hackathon 2024: Sustainability Meets LLMs

Second Place - AIM Hackathon 2024: Trustpilot for ESG

SAM Wins First Prize at AIM Hackathon

Third Party Integration

Part 1: TIMETOACT Logistics Hackathon - Behind the Scenes

We are 2024 Google Cloud Sales Partner of the Year - Alps!

catworkx behind the scenes - „The Lord of the Screens”

LLM-Benchmarks August 2024

LLM-Benchmarks June 2024

LLM-Benchmarks May 2024

LLM-Benchmarks April 2024

LLM-Benchmarks July 2024

From the idea to the product: The genesis of Skwill

The IPG Group becomes part of the TIMETOACT GROUP

The IPG Group becomes part of the TIMETOACT GROUP

The new Idea and Innovation Management of the DDPS

Sustainability in the TIMETOACT GROUP

Mendix in the manufacturing industry

Revolutionizing the Logistics Industry

The "Beautiful five" and the Power of "One-Number" Reporting

Bleiben Sie mit dem TIMETOACT GROUP Newsletter auf dem Laufenden!