Optimizing Token Usage on Large Language Model Conversations Using the   Design Structure Matrix

Ramon Maria Garcia Alarcia; Alessandro Golkar

arXiv:2410.00749·cs.CL·October 2, 2024

Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix

Ramon Maria Garcia Alarcia, Alessandro Golkar

PDF

TL;DR

This paper introduces a novel approach using the Design Structure Matrix to optimize token usage in large language model conversations, reducing costs and improving efficiency by organizing dialogue and managing context windows effectively.

Contribution

It applies the Design Structure Matrix from engineering to LLM conversation management, offering a new methodology for token optimization and context organization.

Findings

01

DSM effectively organizes conversations to minimize token usage

02

Clustering and sequencing improve context window management

03

Method reduces token costs in LLM interactions

Abstract

As Large Language Models become ubiquitous in many sectors and tasks, there is a need to reduce token usage, overcoming challenges such as short context windows, limited output sizes, and costs associated with token intake and generation, especially in API-served LLMs. This work brings the Design Structure Matrix from the engineering design discipline into LLM conversation optimization. Applied to a use case in which the LLM conversation is about the design of a spacecraft and its subsystems, the DSM, with its analysis tools such as clustering and sequencing, demonstrates being an effective tool to organize the conversation, minimizing the number of tokens sent to or retrieved from the LLM at once, as well as grouping chunks that can be allocated to different context windows. Hence, this work broadens the current set of methodologies for token usage optimization and opens new avenues…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training