Assessing GPT Model Uncertainty in Mathematical OCR Tasks via Entropy   Analysis

Alexei Kaltchenko

arXiv:2412.01221·cs.IT·December 10, 2024

Assessing GPT Model Uncertainty in Mathematical OCR Tasks via Entropy Analysis

Alexei Kaltchenko

PDF

Open Access 1 Repo

TL;DR

This paper explores how GPT models' uncertainty in recognizing mathematical equations from images varies with image quality, using entropy analysis to quantify confidence and improve OCR accuracy.

Contribution

It introduces an entropy-based framework to assess GPT model uncertainty in mathematical OCR, linking image resolution to recognition confidence.

Findings

01

Higher image resolution reduces GPT uncertainty and improves accuracy.

02

Entropy correlates with recognition errors, serving as a confidence measure.

03

Practical insights for optimizing image quality in GPT-based OCR applications.

Abstract

This paper investigates the uncertainty of Generative Pre-trained Transformer (GPT) models in extracting mathematical equations from images of varying resolutions and converting them into LaTeX code. We employ concepts of entropy and mutual information to examine the recognition process and assess the model's uncertainty in this Optical Character Recognition (OCR) task. By analyzing the conditional entropy of the output token sequences, we provide both theoretical insights and practical measurements of the GPT model's performance given different image qualities. Our experimental results, obtained using a Python implementation available on GitHub, demonstrate a clear relationship between image resolution and GPT model uncertainty. Higher-resolution images lead to lower entropy values, indicating reduced uncertainty and improved accuracy in the recognized LaTeX code. Conversely,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Alexei-WLU/gpt-ocr-entropy-analysis
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning