TransformLLM: Adapting Large Language Models via LLM-Transformed Reading   Comprehension Text

Iftach Arbel; Yehonathan Refael; Ofir Lindenbaum

arXiv:2410.21479·cs.CL·October 30, 2024

TransformLLM: Adapting Large Language Models via LLM-Transformed Reading Comprehension Text

Iftach Arbel, Yehonathan Refael, Ofir Lindenbaum

PDF

Open Access

TL;DR

This paper introduces a novel method for adapting large language models to specific domains by converting raw data into reading comprehension texts, demonstrated effectively in legal applications with improved performance and efficiency.

Contribution

The authors propose a domain-adaptive pre-training approach using LLMs to transform data into reading comprehension format, enhancing domain-specific model performance without extensive fine-tuning.

Findings

01

Legal LLMs outperform larger models on benchmarks

02

Continued pre-training on domain texts improves accuracy

03

Method is scalable to other domains

Abstract

Large Language Models (LLMs) have shown promise in highly-specialized domains, however challenges are still present in aspects of accuracy and costs. These limitations restrict the usage of existing models in domain-specific tasks. While fine-tuning pre-trained models have shown promising results, this process can be computationally expensive and require massive datasets of the specialized application in hand. In this work, we bridge that gap. We have developed Phi-2-Legal and Mistral-Legal-7B, which are language models specifically designed for legal applications. These models are based on Phi-2 and Mistral-7B-v0.1, and have gone through continued pre-training with over 500 million tokens of legal texts. Our innovative approach significantly improves capabilities in legal tasks by using Large Language Models (LLMs) to convert raw training data into reading comprehension text. Our legal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling