Table Comprehension in Building Codes using Vision Language Models and Domain-Specific Fine-Tuning
Mohammad Aqib, Mohd Hamza, Ying Hei Chui, Qipei Mei

TL;DR
This paper investigates methods for extracting information from building code tables using Vision Language Models, comparing direct and indirect input approaches, and demonstrates significant accuracy improvements through domain-specific fine-tuning.
Contribution
It introduces a novel domain-specific fine-tuning approach for VLMs using LoRA, enhancing their ability to interpret complex tabular data in building codes.
Findings
Direct input method outperforms indirect input in accuracy
Fine-tuning with LoRA significantly improves model performance
Qwen2.5-VL-3B-Instruct achieves over 100% accuracy gain
Abstract
Building codes contain critical information for ensuring safety, regulatory compliance, and informed decision-making in construction and engineering. Automated question answering systems over such codes enable quick and accurate access to specific regulatory clauses, improving efficiency and reducing errors. Retrieval-Augmented Generation (RAG) systems are essential for this task as they combine the precision of information retrieval with the generative capabilities of language models. However, tabular data are challenging to extract as they often involve complex layouts, merged cells, multi-row headers, and embedded semantic relationships that are not easily captured by traditional natural language processing techniques and Vision Language Models (VLMs). This paper explores and compares two methods for extracting information from tabular data in building codes using several pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Handwritten Text Recognition Techniques · BIM and Construction Integration
