Strengthened Symbol Binding Makes Large Language Models Reliable   Multiple-Choice Selectors

Mengge Xue; Zhenyu Hu; Liqun Liu; Kuo Liao; Shuang Li; Honglin Han,; Meng Zhao; Chengguo Yin

arXiv:2406.01026·cs.CL·June 7, 2024

Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors

Mengge Xue, Zhenyu Hu, Liqun Liu, Kuo Liao, Shuang Li, Honglin Han,, Meng Zhao, Chengguo Yin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper addresses the persistent selection bias in Large Language Models during supervised fine-tuning for multiple-choice questions by enhancing their symbol binding ability through a novel training algorithm, leading to improved accuracy.

Contribution

It introduces a new SFT algorithm called Point-wise Intelligent Feedback (PIF) that improves symbol-content binding and reduces selection bias in LLMs for MCQs.

Findings

01

PIF significantly reduces selection bias in LLMs.

02

Model accuracy on MCQs improves markedly with PIF.

03

Enhanced symbol binding correlates with better model performance.

Abstract

Multiple-Choice Questions (MCQs) constitute a critical area of research in the study of Large Language Models (LLMs). Previous works have investigated the selection bias problem in MCQs within few-shot scenarios, in which the LLM's performance may be influenced by the presentation of answer choices, leaving the selection bias during Supervised Fine-Tuning (SFT) unexplored. In this paper, we reveal that selection bias persists in the SFT phase , primarily due to the LLM's inadequate Multiple Choice Symbol Binding (MCSB) ability. This limitation implies that the model struggles to associate the answer options with their corresponding symbols (e.g., A/B/C/D) effectively. To enhance the model's MCSB capability, we first incorporate option contents into the loss function and subsequently adjust the weights of the option symbols and contents, guiding the model to understand the option content…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

berryxue/PIF
noneOfficial

Videos

Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms

MethodsShrink and Fine-Tune