Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study   of the Driver's License Knowledge Test

Saba Rahimi; Tucker Balch; Manuela Veloso

arXiv:2308.11827·cs.CL·August 24, 2023

Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study of the Driver's License Knowledge Test

Saba Rahimi, Tucker Balch, Manuela Veloso

PDF

Open Access

TL;DR

This study demonstrates how GPT models can be enhanced with external context to improve test-taking accuracy, achieving a 96% score on a driver's license exam, but still facing limitations in certain questions.

Contribution

The paper introduces a method for integrating external contextual information into GPT models to improve their question-answering performance on specific knowledge tests.

Findings

01

GPT-3 achieved 96% accuracy with context, 82% without.

02

Providing context improves answer accuracy but does not eliminate errors.

03

Prompt length and format influence model performance.

Abstract

Large language models such as Open AI's Generative Pre-trained Transformer (GPT) models are proficient at answering questions, but their knowledge is confined to the information present in their training data. This limitation renders them ineffective when confronted with questions about recent developments or non-public documents. Our research proposes a method that enables GPT models to answer questions by employing context from an information source not previously included in their training data. The methodology includes preprocessing of contextual information, the embedding of contexts and queries, constructing prompt through the integration of context embeddings, and generating answers using GPT models. We applied this method in a controlled test scenario using the California Driver's Handbook as the information source. The GPT-3 model achieved a 96% passing score on a set of 50…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)

MethodsLib · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Position-Wise Feed-Forward Layer · Linear Layer · Dense Connections · Weight Decay · Absolute Position Encodings