Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game
Meiling Li, Hongrun Ren, Haixu Xiong, Zhenxing Qian, Xinpeng Zhang

TL;DR
This paper introduces an innovative online prompt pricing system for prompt bundle trading that leverages combinatorial multi-armed bandit and hierarchical Stackelberg game models to optimize profits among consumers, platforms, and sellers.
Contribution
It proposes a novel, flexible pricing mechanism for prompt bundle trading based on advanced game theory and bandit algorithms, addressing real-world transaction needs.
Findings
Effective price-setting standard demonstrated on simulated dataset
Achieves profit satisfaction for consumers, platforms, and sellers
More adaptable than fixed pricing models
Abstract
Generation models have shown promising performance in various tasks, making trading around machine learning models possible. In this paper, we aim at a novel prompt trading scenario, prompt bundle trading (PBT) system, and propose an online pricing mechanism. Based on the combinatorial multi-armed bandit (CMAB) and three-stage hierarchical Stackelburg (HS) game, our pricing mechanism considers the profits of the consumer, platform, and seller, simultaneously achieving the profit satisfaction of these three participants. We break down the pricing issue into two steps, namely unknown category selection and incentive strategy optimization. The former step is to select a set of categories with the highest qualities, and the latter is to derive the optimal strategy for each participant based on the chosen categories. Unlike the existing fixed pricing mode, the PBT pricing mechanism we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques
MethodsSparse Evolutionary Training
