Using Non-Stationary Bandits for Learning in Repeated Cournot Games with   Non-Stationary Demand

Kshitija Taywade; Brent Harrison; Judy Goldsmith

arXiv:2201.00486·cs.LG·January 4, 2022

Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand

Kshitija Taywade, Brent Harrison, Judy Goldsmith

PDF

Open Access

TL;DR

This paper introduces a novel adaptive epsilon-greedy algorithm for non-stationary multi-armed bandit problems in repeated Cournot games, enabling agents to adapt to changing market demands and identify new optimal actions effectively.

Contribution

The paper proposes the AWE epsilon-greedy algorithm that detects demand changes and adjusts exploration and learning rates, improving decision-making in non-stationary Cournot game environments.

Findings

01

Agents swiftly adapt to demand changes.

02

The approach facilitates emergence of collusive behavior.

03

Scalability is demonstrated with multiple agents and large action spaces.

Abstract

Many past attempts at modeling repeated Cournot games assume that demand is stationary. This does not align with real-world scenarios in which market demands can evolve over a product's lifetime for a myriad of reasons. In this paper, we model repeated Cournot games with non-stationary demand such that firms/agents face separate instances of non-stationary multi-armed bandit problem. The set of arms/actions that an agent can choose from represents discrete production quantities; here, the action space is ordered. Agents are independent and autonomous, and cannot observe anything from the environment; they can only see their own rewards after taking an action, and only work towards maximizing these rewards. We propose a novel algorithm 'Adaptive with Weighted Exploration (AWE) $ϵ$ -greedy' which is remotely based on the well-known $ϵ$ -greedy approach. This algorithm detects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Game Theory and Applications