An electronic product carbon footprint dataset for question answering
Kaiwen Zhao, Ajesh Koyatan Chathoth, Bharathan Balaji, Stephen Lee

TL;DR
This paper introduces a dataset to extract and analyze carbon footprint data from computing product reports, aiming to standardize emissions information.
Contribution
The novel contribution is a carbon QA dataset for structured extraction of emissions data from unstructured sustainability reports.
Findings
The dataset includes annotated metadata and numerical reasoning tasks for accurate data processing.
About 75% of the dataset reflects PAIA-style carbon reporting practices.
The dataset supports training language models to automate emissions data aggregation.
Abstract
The embodied carbon of computing systems constitutes a significant portion of their greenhouse gas (GHG) emissions. To support environmental initiatives and meet evolving standards, many companies now disclose product carbon footprints in sustainability reports, often with detailed breakdowns. Yet these reports appear in diverse and unstructured formats—text, tables, and graphs embedded in PDFs—creating major challenges for extracting and analyzing component-specific emissions data. This lack of standardization limits comparative assessments and opportunities for targeted reductions. To address this, we introduce a carbon question-answering (QA) dataset designed to enable the extraction and analysis of data from carbon reports of computing products. The dataset features annotated metadata, numerical reasoning tasks, and structured derivations to ensure accurate processing of fragmented…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Environmental Impact and Sustainability · Machine Learning in Materials Science
