Visual Text Mining with Progressive Taxonomy Construction for Environmental Studies
Sam Yu-Te Lee, Cheng-Wei Hung, Mei-Hua Yuan, Kwan-Liu Ma

TL;DR
This paper introduces GreenMine, an interactive system leveraging prompt engineering and LLMs to automate and refine the construction of DPSIR taxonomies from environmental text corpora, enhancing efficiency and flexibility.
Contribution
The paper presents GreenMine, a novel system that enables iterative taxonomy refinement and corpus annotation through natural language prompts and uncertainty visualization, addressing challenges in traditional text mining methods.
Findings
GreenMine effectively supports taxonomy construction and annotation in real-world environmental data.
The system's uncertainty score and visualization aid experts in evaluating and refining the taxonomy.
Case study demonstrates improved efficiency and insights in environmental text analysis.
Abstract
Environmental experts have developed the DPSIR (Driver, Pressure, State, Impact, Response) framework to systematically study and communicate key relationships between society and the environment. Using this framework requires experts to construct a DPSIR taxonomy from a corpus, annotate the documents, and identify DPSIR variables and relationships, which is laborious and inflexible. Automating it with conventional text mining faces technical challenges, primarily because the taxonomy often begins with abstract definitions, which experts progressively refine and contextualize as they annotate the corpus. In response, we develop GreenMine, a system that supports interactive text mining with prompt engineering. The system implements a prompting pipeline consisting of three simple and evaluable subtasks. In each subtask, the DPSIR taxonomy can be defined in natural language and iteratively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
