Online Training of Large Language Models: Learn while chatting

Juhao Liang; Ziwei Wang; Zhuoheng Ma; Jianquan Li; Zhiyi Zhang,; Xiangbo Wu; Benyou Wang

arXiv:2403.04790·cs.CL·March 11, 2024·2 cites

Online Training of Large Language Models: Learn while chatting

Juhao Liang, Ziwei Wang, Zhuoheng Ma, Jianquan Li, Zhiyi Zhang,, Xiangbo Wu, Benyou Wang

PDF

Open Access

TL;DR

This paper proposes a new online training paradigm for Large Language Models that enables real-time, persistent updates and customization through external interactions, addressing current limitations in flexibility and user accessibility.

Contribution

It introduces a novel interaction paradigm that combines persistent online training with external knowledge sources for improved LLM customization.

Findings

01

Enables real-time model updates during user interactions

02

Allows personalized model customization via external knowledge bases

03

Improves flexibility and user accessibility in LLM training

Abstract

Large Language Models(LLMs) have dramatically revolutionized the field of Natural Language Processing(NLP), offering remarkable capabilities that have garnered widespread usage. However, existing interaction paradigms between LLMs and users are constrained by either inflexibility, limitations in customization, or a lack of persistent learning. This inflexibility is particularly evident as users, especially those without programming skills, have restricted avenues to enhance or personalize the model. Existing frameworks further complicate the model training and deployment process due to their computational inefficiencies and lack of user-friendly interfaces. To overcome these challenges, this paper introduces a novel interaction paradigm-'Online Training using External Interactions'-that merges the benefits of persistent, real-time model updates with the flexibility for individual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Intelligent Tutoring Systems and Adaptive Learning