Improving Self Consistency in LLMs through Probabilistic Tokenization

Ashutosh Sathe; Divyanshu Aggarwal; Sunayana Sitaram

arXiv:2407.03678·cs.CL·July 8, 2024

Improving Self Consistency in LLMs through Probabilistic Tokenization

Ashutosh Sathe, Divyanshu Aggarwal, Sunayana Sitaram

PDF

Open Access

TL;DR

This paper introduces a novel approach to improve the self-consistency of large language models in reasoning tasks by leveraging probabilistic tokenization capabilities, which have been underutilized in modern LLM training.

Contribution

The paper proposes a new method to utilize probabilistic tokenizations in LLMs, enhancing their reasoning consistency and generating more logically diverse reasoning paths.

Findings

01

Probabilistic tokenization improves LLM self-consistency.

02

Enhanced reasoning diversity beyond surface-level linguistic variation.

03

Consistent improvements across 5 LLMs and 4 benchmarks.

Abstract

Prior research has demonstrated noticeable performance gains through the use of probabilistic tokenizations, an approach that involves employing multiple tokenizations of the same input string during the training phase of a language model. Despite these promising findings, modern large language models (LLMs) have yet to be trained using probabilistic tokenizations. Interestingly, while the tokenizers of these contemporary LLMs have the capability to generate multiple tokenizations, this property remains underutilized. In this work, we propose a novel method to leverage the multiple tokenization capabilities of modern LLM tokenizers, aiming to enhance the self-consistency of LLMs in reasoning tasks. Our experiments indicate that when utilizing probabilistic tokenizations, LLMs generate logically diverse reasoning paths, moving beyond mere surface-level linguistic diversity.We carefully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis · Semantic Web and Ontologies