Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game

Meiling Li; Hongrun Ren; Haixu Xiong; Zhenxing Qian; Xinpeng Zhang

arXiv:2405.15154·cs.AI·September 9, 2025

Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game

Meiling Li, Hongrun Ren, Haixu Xiong, Zhenxing Qian, Xinpeng Zhang

PDF

Open Access

TL;DR

This paper introduces an innovative online prompt pricing system for prompt bundle trading that leverages combinatorial multi-armed bandit and hierarchical Stackelberg game models to optimize profits among consumers, platforms, and sellers.

Contribution

It proposes a novel, flexible pricing mechanism for prompt bundle trading based on advanced game theory and bandit algorithms, addressing real-world transaction needs.

Findings

01

Effective price-setting standard demonstrated on simulated dataset

02

Achieves profit satisfaction for consumers, platforms, and sellers

03

More adaptable than fixed pricing models

Abstract

Generation models have shown promising performance in various tasks, making trading around machine learning models possible. In this paper, we aim at a novel prompt trading scenario, prompt bundle trading (PBT) system, and propose an online pricing mechanism. Based on the combinatorial multi-armed bandit (CMAB) and three-stage hierarchical Stackelburg (HS) game, our pricing mechanism considers the profits of the consumer, platform, and seller, simultaneously achieving the profit satisfaction of these three participants. We break down the pricing issue into two steps, namely unknown category selection and incentive strategy optimization. The former step is to select a set of categories with the highest qualities, and the latter is to derive the optimal strategy for each participant based on the chosen categories. Unlike the existing fixed pricing mode, the PBT pricing mechanism we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques

MethodsSparse Evolutionary Training