Large Language Models as a Tool for Mining Object Knowledge
Hannah YoungEun An, Lenhart K. Schubert

TL;DR
This paper explores the ability of large language models to explicitly extract and organize knowledge about the parts and materials of everyday objects, creating a resource for reasoning about object structure.
Contribution
It introduces a method to mine and compile detailed object knowledge from LLMs, focusing on parts and materials, with a large dataset of 2,300 objects.
Findings
LLMs demonstrate good coverage of object parts and materials.
The extracted knowledge is largely sound and reliable.
The dataset supports reasoning about object composition.
Abstract
Commonsense knowledge is essential for machines to reason about the world. Large language models (LLMs) have demonstrated their ability to perform almost human-like text generation. Despite this success, they fall short as trustworthy intelligent systems, due to the opacity of the basis for their answers and a tendency to confabulate facts when questioned about obscure entities or technical domains. We hypothesize, however, that their general knowledge about objects in the everyday world is largely sound. Based on that hypothesis, this paper investigates LLMs' ability to formulate explicit knowledge about common physical artifacts, focusing on their parts and materials. Our work distinguishes between the substances that comprise an entire object and those that constitute its partsa previously underexplored distinction in knowledge base construction. Using few-shot with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsBalanced Selection
