Inferring Strategies from Observations in Long Iterated Prisoner's Dilemma Experiments
Eladio Montero-Porras, Jelena Grujic, Elias Fernandez-Domingos, Tom, Lenaerts

TL;DR
This study investigates human strategies in long Iterated Prisoner's Dilemma experiments with fixed and shuffled partners, revealing how interaction patterns influence strategy development and cooperation.
Contribution
It introduces long-term experiments with different partner matching strategies and applies unsupervised clustering and Hidden Markov Models to infer strategies from human behavior.
Findings
Fixed partners promote behavioral self-organization and cooperation.
Shuffled partners entangle strategy subgroups, hindering cooperation.
Long experiments (over 25 rounds) are necessary to observe stable strategies.
Abstract
While many theoretical studies have revealed the strategies that could lead to and maintain cooperation in the Iterated Prisoner's Dilemma, less is known about what human participants actually do in this game and how strategies change when being confronted with anonymous partners in each round. Previous attempts used short experiments, made different assumptions of possible strategies, and led to very different conclusions. We present here two long treatments that differ in the partner matching strategy used, i.e. fixed or shuffled partners. Here we use unsupervised methods to cluster the players based on their actions and then Hidden Markov Model to infer what are those strategies in each cluster. Analysis of the inferred strategies reveals that fixed partner interaction leads to a behavioral self-organization. Shuffled partners generate subgroups of strategies that remain entangled,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
