Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan's Historical Celebrities
Junjie Hao, Chun Wang, Ying Qiao, Qiuyue Zuo, Qiya Song, Hua Ma, Xieping Gao

TL;DR
This paper demonstrates how supervised fine-tuning of large language models can significantly improve the extraction of structured knowledge about Hunan's historical celebrities, aiding cultural heritage research.
Contribution
It introduces a schema-guided instruction fine-tuning method and applies it to enhance domain-specific knowledge extraction in low-resource cultural heritage contexts.
Findings
All models showed performance improvements after fine-tuning.
Qwen3-8B achieved the highest score of 89.39 with minimal data.
Fine-tuning large models is effective for regional cultural knowledge extraction.
Abstract
Large language models and knowledge graphs offer strong potential for advancing research on historical culture by supporting the extraction, analysis, and interpretation of cultural heritage. Using Hunan's modern historical celebrities shaped by Huxiang culture as a case study, pre-trained large models can help researchers efficiently extract key information, including biographical attributes, life events, and social relationships, from textual sources and construct structured knowledge graphs. However, systematic data resources for Hunan's historical celebrities remain limited, and general-purpose models often underperform in domain knowledge extraction and structured output generation in such low-resource settings. To address these issues, this study proposes a supervised fine-tuning approach for enhancing domain-specific information extraction. First, we design a fine-grained,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Big Data and Digital Economy
