Improving LLM Performance Through Black-Box Online Tuning: A Case for Adding System Specs to Factsheets for Trusted AI

Yonas Atinafu; Henry Lin; Robin Cohen

arXiv:2603.11340·cs.AI·March 13, 2026

Improving LLM Performance Through Black-Box Online Tuning: A Case for Adding System Specs to Factsheets for Trusted AI

Yonas Atinafu, Henry Lin, Robin Cohen

PDF

Open Access

TL;DR

This paper introduces a black-box online tuning method for LLMs that optimizes performance using only end-to-end measurements, emphasizing the importance of system specs in AI Factsheets for trustworthiness.

Contribution

It proposes a novel black-box controller for LLM performance tuning that does not require internal system access, enhancing AI system transparency and reliability.

Findings

01

Empirical evidence supports the effectiveness of the black-box controller.

02

Highlights the role of system specs in AI Factsheets for trust and sustainability.

03

Demonstrates improved LLM throughput through online tuning.

Abstract

In this paper, we present a novel black-box online controller that uses only end-to-end measurements over short segments, without internal instrumentation, and hill climbing to maximize goodput, defined as the throughput of requests that satisfy the service-level objective. We provide empirical evidence that this design is well-founded. Using this advance in LLM serving as a concrete example, we then discuss the importance of integrating system performance and sustainability metrics into Factsheets for organizations adopting AI systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Engineering Research · Security and Verification in Computing