Effective and Efficient Schema-aware Information Extraction Using On-Device Large Language Models

Zhihao Wen; Sheng Liang; Yaxiong Wu; Yongyue Zhang; Yong Liu

arXiv:2505.14992·cs.CL·May 22, 2025

Effective and Efficient Schema-aware Information Extraction Using On-Device Large Language Models

Zhihao Wen, Sheng Liang, Yaxiong Wu, Yongyue Zhang, Yong Liu

PDF

Open Access

TL;DR

This paper introduces DLISC, a two-stage on-device LLM approach for schema-aware information extraction that improves accuracy and speed by schema identification and incremental caching.

Contribution

The paper proposes DLISC, a novel two-stage method with schema caching for effective and efficient on-device information extraction using large language models.

Findings

01

Significant accuracy improvements over baseline methods.

02

Substantial reduction in inference latency.

03

Enhanced schema relevance retrieval performance.

Abstract

Information extraction (IE) plays a crucial role in natural language processing (NLP) by converting unstructured text into structured knowledge. Deploying computationally intensive large language models (LLMs) on resource-constrained devices for information extraction is challenging, particularly due to issues like hallucinations, limited context length, and high latency-especially when handling diverse extraction schemas. To address these challenges, we propose a two-stage information extraction approach adapted for on-device LLMs, called Dual-LoRA with Incremental Schema Caching (DLISC), which enhances both schema identification and schema-aware extraction in terms of effectiveness and efficiency. In particular, DLISC adopts an Identification LoRA module for retrieving the most relevant schemas to a given query, and an Extraction LoRA module for performing information extraction based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Web Data Mining and Analysis