Training Language Models on the Knowledge Graph: Insights on   Hallucinations and Their Detectability

Jiri Hron; Laura Culp; Gamaleldin Elsayed; Rosanne Liu; Ben Adlam,; Maxwell Bileschi; Bernd Bohnet; JD Co-Reyes; Noah Fiedel; C. Daniel Freeman,; Izzeddin Gur; Kathleen Kenealy; Jaehoon Lee; Peter J. Liu; Gaurav Mishra,; Igor Mordatch; Azade Nova; Roman Novak; Aaron Parisi; Jeffrey Pennington,; Alex Rizkowsky; Isabelle Simpson; Hanie Sedghi; Jascha Sohl-dickstein; Kevin; Swersky; Sharad Vikram; Tris Warkentin; Lechao Xiao; Kelvin Xu; Jasper Snoek,; Simon Kornblith

arXiv:2408.07852·cs.CL·August 16, 2024·2 cites

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Jiri Hron, Laura Culp, Gamaleldin Elsayed, Rosanne Liu, Ben Adlam,, Maxwell Bileschi, Bernd Bohnet, JD Co-Reyes, Noah Fiedel, C. Daniel Freeman,, Izzeddin Gur, Kathleen Kenealy, Jaehoon Lee, Peter J. Liu, Gaurav Mishra,, Igor Mordatch, Azade Nova, Roman Novak, Aaron Parisi

PDF

Open Access

TL;DR

This study investigates how the size and training duration of language models affect hallucinations, especially those verifiable from training data, revealing that larger models hallucinate less but are harder to detect.

Contribution

It introduces a knowledge graph-based dataset to control training data and systematically analyzes hallucination frequency and detectability across different model scales.

Findings

01

Larger models hallucinate less on fixed datasets.

02

Hallucinating on ≤5% of data requires significantly larger models.

03

Detectability of hallucinations decreases as model size increases.

Abstract

While many capabilities of language models (LMs) improve with increased training budget, the influence of scale on hallucinations is not yet fully understood. Hallucinations come in many forms, and there is no universally accepted definition. We thus focus on studying only those hallucinations where a correct answer appears verbatim in the training set. To fully control the training data content, we construct a knowledge graph (KG)-based dataset, and use it to train a set of increasingly large LMs. We find that for a fixed dataset, larger and longer-trained LMs hallucinate less. However, hallucinating on $\leq 5$ % of the training data requires an order of magnitude larger model, and thus an order of magnitude more compute, than Hoffmann et al. (2022) reported was optimal. Given this costliness, we study how hallucination detectors depend on scale. While we see detector size improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing

MethodsSparse Evolutionary Training · Focus