Effective and Efficient Schema-aware Information Extraction Using On-Device Large Language Models
Zhihao Wen, Sheng Liang, Yaxiong Wu, Yongyue Zhang, Yong Liu

TL;DR
This paper introduces DLISC, a two-stage on-device LLM approach for schema-aware information extraction that improves accuracy and speed by schema identification and incremental caching.
Contribution
The paper proposes DLISC, a novel two-stage method with schema caching for effective and efficient on-device information extraction using large language models.
Findings
Significant accuracy improvements over baseline methods.
Substantial reduction in inference latency.
Enhanced schema relevance retrieval performance.
Abstract
Information extraction (IE) plays a crucial role in natural language processing (NLP) by converting unstructured text into structured knowledge. Deploying computationally intensive large language models (LLMs) on resource-constrained devices for information extraction is challenging, particularly due to issues like hallucinations, limited context length, and high latency-especially when handling diverse extraction schemas. To address these challenges, we propose a two-stage information extraction approach adapted for on-device LLMs, called Dual-LoRA with Incremental Schema Caching (DLISC), which enhances both schema identification and schema-aware extraction in terms of effectiveness and efficiency. In particular, DLISC adopts an Identification LoRA module for retrieving the most relevant schemas to a given query, and an Extraction LoRA module for performing information extraction based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Web Data Mining and Analysis
