Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach
Tanwi Mallick, Joshua David Bergerson, Duane R. Verner, John K, Hutchison, Leslie-Anne Levy, Prasanna Balaprakash

TL;DR
This paper presents a weakly supervised NLP method that rapidly labels large climate and infrastructure literature corpora, enabling efficient topic analysis and supporting policy decisions on climate change impacts.
Contribution
The authors develop a weak supervision approach that significantly reduces labeling time, facilitating targeted literature analysis in climate change and infrastructure research.
Findings
Labels a large corpus in about 13 hours using weak supervision.
Enables efficient identification of relevant documents and topics.
Supports rapid trend discovery at the intersection of climate hazards and infrastructure.
Abstract
Natural language processing (NLP) is a promising approach for analyzing large volumes of climate-change and infrastructure-related scientific literature. However, best-in-practice NLP techniques require large collections of relevant documents (corpus). Furthermore, NLP techniques using machine learning and deep learning techniques require labels grouping the articles based on user-defined criteria for a significant subset of a corpus in order to train the supervised model. Even labeling a few hundred documents with human subject-matter experts is a time-consuming process. To expedite this process, we developed a weak supervision-based NLP approach that leverages semantic similarity between categories and documents to (i) establish a topic-specific corpus by subsetting a large-scale open-access corpus and (ii) generate category labels for the topic-specific corpus. In comparison with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPublic Relations and Crisis Communication · Infrastructure Resilience and Vulnerability Analysis · Risk Perception and Management
