Mining the Explainability and Generalization: Fact Verification Based on   Self-Instruction

Guangyao Lu; Yulin Liu

arXiv:2405.12579·cs.CL·May 24, 2024

Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction

Guangyao Lu, Yulin Liu

PDF

Open Access

TL;DR

This paper introduces a self-instruction based fine-tuning method for fact verification using LLMs, balancing accuracy, explainability, and data security, and demonstrating strong results on challenging datasets.

Contribution

It pioneers the use of self-supervised learning with contrastive learning and improved DPO for fact-checking, enhancing accuracy and generalization of LLMs.

Findings

01

Achieves comparable or better accuracy than traditional fine-tuning methods.

02

Generates fluent and high-quality explanations.

03

Shows high generalization performance on fact-checking datasets.

Abstract

Fact-checking based on commercial LLMs has become mainstream. Although these methods offer high explainability, it falls short in accuracy compared to traditional fine-tuning approaches, and data security is also a significant concern. In this paper, we propose a self-instruction based fine-tuning approach for fact-checking that balances accuracy and explainability. Our method consists of Data Augmentation and Improved DPO fine-tuning. The former starts by instructing the model to generate both positive and negative explanations based on claim-evidence pairs and labels, then sampling the dataset according to our customized difficulty standards. The latter employs our proposed improved DPO to fine-tune the model using the generated samples. We fine-tune the smallest-scale LLaMA-7B model and evaluate it on the challenging fact-checking datasets FEVEROUS and HOVER, utilizing four…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsDirect Preference Optimization · Contrastive Learning