Online Adaptation of Language Models with a Memory of Amortized Contexts

Jihoon Tack; Jaehyung Kim; Eric Mitchell; Jinwoo Shin; Yee Whye Teh,; Jonathan Richard Schwarz

arXiv:2403.04317·cs.LG·November 5, 2024·1 cites

Online Adaptation of Language Models with a Memory of Amortized Contexts

Jihoon Tack, Jaehyung Kim, Eric Mitchell, Jinwoo Shin, Yee Whye Teh,, Jonathan Richard Schwarz

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Memory of Amortized Contexts (MAC), a novel online adaptation framework for large language models that efficiently incorporates new information through memory augmentation and meta-learning, enhancing knowledge retention and adaptability.

Contribution

The paper presents MAC, a memory-augmented, amortization-based meta-learning method enabling efficient online adaptation of LLMs without gradient updates, improving performance and memory use.

Findings

01

MAC outperforms existing methods in online adaptation tasks.

02

MAC is more time and memory efficient than traditional approaches.

03

MAC enhances retrieval-augmented generation performance.

Abstract

Due to the rapid generation and dissemination of information, large language models (LLMs) quickly run out of date despite enormous development costs. To address the crucial need to keep models updated, online learning has emerged as a critical tool when utilizing LLMs for real-world applications. However, given the ever-expanding corpus of unseen documents and the large parameter space of modern LLMs, efficient adaptation is essential. To address these challenges, we propose Memory of Amortized Contexts (MAC), an efficient and effective online adaptation framework for LLMs with strong knowledge retention. We propose a feature extraction and memory-augmentation approach to compress and extract information from new documents into compact modulations stored in a memory bank. When answering questions, our model attends to and extracts relevant knowledge from this memory bank. To learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jihoontack/mac
pytorchOfficial

Videos

Online Adaptation of Language Models with a Memory of Amortized Contexts· slideslive

Taxonomy

TopicsTopic Modeling