ADO: Automatic Data Optimization for Inputs in LLM Prompts

Sam Lin; Wenyue Hua; Lingyao Li; Zhenting Wang; Yongfeng Zhang

arXiv:2502.11436·cs.LG·February 18, 2025

ADO: Automatic Data Optimization for Inputs in LLM Prompts

Sam Lin, Wenyue Hua, Lingyao Li, Zhenting Wang, Yongfeng Zhang

PDF

Open Access 1 Video

TL;DR

This paper presents ADO, a method for optimizing input data in prompts to improve LLM performance through content engineering and structural reformulation.

Contribution

It introduces a novel two-step approach for input data optimization, focusing on content and structure, to enhance LLM prompt effectiveness.

Findings

01

Optimized input data improves LLM task performance

02

Content engineering and reformulation significantly boost results

03

Open-source code available for replication

Abstract

This study explores a novel approach to enhance the performance of Large Language Models (LLMs) through the optimization of input data within prompts. While previous research has primarily focused on refining instruction components and augmenting input data with in-context examples, our work investigates the potential benefits of optimizing the input data itself. We introduce a two-pronged strategy for input data optimization: content engineering and structural reformulation. Content engineering involves imputing missing values, removing irrelevant attributes, and enriching profiles by generating additional information inferred from existing attributes. Subsequent to content engineering, structural reformulation is applied to optimize the presentation of the modified content to LLMs, given their sensitivity to input format. Our findings suggest that these optimizations can significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ADO: Automatic Data Optimization for Inputs in LLM Prompts· underline

Taxonomy

TopicsMagnetic confinement fusion research · Nuclear reactor physics and engineering · Advanced Radiotherapy Techniques