AI capabilities can be significantly improved without expensive   retraining

Tom Davidson; Jean-Stanislas Denain; Pablo Villalobos; Guillem Bas

arXiv:2312.07413·cs.AI·December 13, 2023·6 cites

AI capabilities can be significantly improved without expensive retraining

Tom Davidson, Jean-Stanislas Denain, Pablo Villalobos, Guillem Bas

PDF

Open Access

TL;DR

Post-training enhancements significantly boost AI performance without costly retraining, offering a cost-effective way to improve capabilities across various tasks.

Contribution

The paper categorizes recent post-training enhancement techniques and quantifies their benefits in terms of compute-equivalent gains.

Findings

01

Most enhancements improve performance by over 5x in compute terms

02

Some enhancements achieve more than 20x improvements

03

Fine-tuning costs are typically less than 1% of original training costs

Abstract

State-of-the-art AI systems can be significantly improved without expensive retraining via "post-training enhancements"-techniques applied after initial training like fine-tuning the system to use a web browser. We review recent post-training enhancements, categorizing them into five types: tool-use, prompting methods, scaffolding, solution selection, and data generation. Different enhancements improve performance on different tasks, making it hard to compare their significance. So we translate improvements from different enhancements into a common currency, the compute-equivalent gain: how much additional training compute would be needed to improve performance by the same amount as the enhancement. Our non-experimental work shows that post-training enhancements have significant benefits: most surveyed enhancements improve benchmark performance by more than a 5x increase in training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Software Engineering Research