A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction
Bogdan Bogachov, Yaoyao Fiona Zhao

TL;DR
This paper presents a lightweight, graph-based multi-expert language model system called SLG that improves engineering information extraction by reducing computational costs and hallucination issues, outperforming traditional fine-tuning methods.
Contribution
The introduction of the Small Language Graph (SLG), a novel lightweight, graph-structured adaptation method with small expert models for efficient and accurate engineering text generation.
Findings
SLG surpasses conventional fine-tuning on Exact Match by 3x.
Fine-tuning with SLG is 1.7x faster than larger models.
SLG enables cost-effective deployment for small to medium engineering firms.
Abstract
Despite recent advancements in domain adaptation techniques for large language models, these methods remain computationally intensive, and the resulting models can still exhibit hallucination issues. Most existing adaptation methods do not prioritize reducing the computational resources required for fine-tuning and inference of language models. Hallucination issues have gradually decreased with each new model release. However, they remain prevalent in engineering contexts, where generating well-structured text with minimal errors and inconsistencies is critical. This work introduces a novel approach called the Small Language Graph (SLG), which is a lightweight adaptation solution designed to address the two key challenges outlined above. The system is structured in the form of a graph, where each node represents a lightweight expert - a small language model fine-tuned on specific and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBIM and Construction Integration · AI-based Problem Solving and Planning · Semantic Web and Ontologies
