A rational measure of irrationality

Davide Carpentiere; Alfio Giarlotta; Stephen Watson

arXiv:2302.13656·econ.TH·March 2, 2023

A rational measure of irrationality

Davide Carpentiere, Alfio Giarlotta, Stephen Watson

PDF

Open Access

TL;DR

This paper introduces a systematic way to classify and measure the degree of irrationality in deterministic choice behaviors by defining a metric-based distance from rational benchmarks, including stochastic models.

Contribution

It proposes a novel metric for quantifying deviations from rational choice and extends this to stochastic irrationality using the random utility model as a benchmark.

Findings

01

Classifies all deterministic choice behaviors by their irrationality degree.

02

Introduces a metric to measure deviations from rationality.

03

Defines a stochastic irrationality measure using Block-Marschak polynomials.

Abstract

All possible types of deterministic choice behavior are classified by their degree of irrationality. This classification is performed in three steps: (1) select a benchmark of rationality, for which this degree is zero; (2) endow the set of choices with a metric to measure deviations from rationality; and (3) compute the distance of any choice behavior from the selected benchmark. The natural candidate for step 1 is the family of all rationalizable behaviors. A possible candidate for step 2 is a suitable variation of the metric described by Klamler (2008), which displays a sharp discerning power among different types of choice behaviors. In step 3 we use this new metric to establish the minimum distance of any choice behavior from the benchmark of rationality. Finally we describe a measure of stochastic irrationality, which employs the random utility model as a benchmark of rationality,…

Tables3

	${(c_{2})}_{A} \in choice (A)$	${(c_{2}^{'})}_{A} \in choice (A)$
$A = {x, y}$	$\underline{x}, \underline{y}, \underline{x} y$	$\underline{x}, \underline{y}, x \underline{y}$
$A = {x, z}$	$\underline{x}, \underline{z}, \underline{x} z$	$\underline{x}, \underline{z}, \underline{x} z$
$A = {y, z}$	$\underline{y}, \underline{z}, \underline{y} z$	$\underline{y}, \underline{z}, \underline{y} z$
$A = {x, y, z}$	$\underline{x}, \underline{y}, \underline{z}, x \underline{y}, \underline{x} z, \underline{y} z, x \underline{y} z$	$\underline{x}, \underline{y}, \underline{z}, x \underline{y}, \underline{x} z, \underline{y} z, x \underline{y} z$

Table 2. Table 1: The stochastic choice function p 1 subscript 𝑝 1 p_{1} and its BM polynomials: the entries in columns 1–4 give the probability that an item is chosen in a menu containing it, whereas the entries in columns 5–8 are the respective BM polynomials. (All empty entries stand for 0 0 .)

	$x$	$y$	$z$	$w$	$q_{x, .}$	$q_{y, .}$	$q_{z, .}$	$q_{w, .}$
${x}$	$1$				$0.5$
${y}$		$1$				$- 0.1$
${z}$			$1$				$0.1$
${w}$				$1$				$0.5$
${x, y}$	$0.5$	$0.5$			$- 0.4$	$0.3$
${x, z}$	$0.4$		$0.6$		$- 0.2$		$0.2$
${x, w}$	$0.9$			$0.1$	$0.2$			$0$
${y, z}$		$0.5$	$0.5$			$0$	$0.2$
${y, w}$		$0.7$		$0.3$		$0.4$		$0.1$
${z, w}$			$0.6$	$0.4$			$- 0.1$	$0.3$
${x, y, z}$	$0.6$	$0.3$	$0.1$		$0.2$	$0.1$	$- 0.1$
${x, y, w}$	$0.7$	$0.1$		$0.2$	$0.3$	$- 0.1$		$0$
${x, z, w}$	$0.4$		$0.5$	$0.1$	$0$		$0.3$	$- 0.1$
${y, z, w}$		$0.4$	$0.4$	$0.2$		$0.2$	$0.2$	$0$
${x, y, z, w}$	$0.4$	$0.2$	$0.2$	$0.2$	$0.4$	$0.2$	$0.2$	$0.2$

Table 3. Table 2: The stochastic choice function p 2 subscript 𝑝 2 p_{2} and its BM polynomials: all entries have the same meaning as in Table 1 .

	$x$	$y$	$z$	$w$	$q_{x, .}$	$q_{y, .}$	$q_{z, .}$	$q_{w, .}$
${x}$	$1$				$0.2$
${y}$		$1$				$0.2$
${z}$			$1$				$0.1$
${w}$				$1$				$0.5$
${x, y}$	$0.6$	$0.4$			$0$	$0.1$
${x, z}$	$0.5$		$0.5$		$0$		$0.1$
${x, w}$	$0.8$			$0.2$	$0.2$			$0$
${y, z}$		$0.4$	$0.6$			$- 0.1$	$0.2$
${y, w}$		$0.7$		$0.3$		$0.3$		$0$
${z, w}$			$0.6$	$0.4$			$0$	$0.2$
${x, y, z}$	$0.5$	$0.3$	$0.2$		$0$	$0.1$	$0$
${x, y, w}$	$0.6$	$0.2$		$0.2$	$0.1$	$0$		$0.1$
${x, z, w}$	$0.5$		$0.4$	$0.1$	$0$		$0.2$	$0$
${y, z, w}$		$0.4$	$0.4$	$0.2$		$0.2$	$0.2$	$0.1$
${x, y, z, w}$	$0.5$	$0.2$	$0.2$	$0.1$	$0.5$	$0.2$	$0.2$	$0.1$

Equations77

c (A) = max (A, ≻) = {x \in A : a ≻ x for no a \in A} \vspace - 0, 1 c m

c (A) = max (A, ≻) = {x \in A : a ≻ x for no a \in A} \vspace - 0, 1 c m

C (A) = max (A, \to) = {x \in A : a \to x for no a \in A} \vspace - 0, 1 c m

C (A) = max (A, \to) = {x \in A : a \to x for no a \in A} \vspace - 0, 1 c m

d_{\Delta}(C,C^{\prime})\coloneqq\sum_{S\in\mathscr{X}}\big{|}C(S)\,\Delta\,C^{\prime}(S)\big{|}\,.\vspace{-0,1cm}

d_{\Delta}(C,C^{\prime})\coloneqq\sum_{S\in\mathscr{X}}\big{|}C(S)\,\Delta\,C^{\prime}(S)\big{|}\,.\vspace{-0,1cm}

(c_{1})

(c_{1})

(c_{2})

(c_{3})

d_{Δ} (c_{1}, c_{2}) = ∣ c_{1} (X) Δ c_{2} (X) ∣ = 2 and d_{Δ} (c_{1}, c_{3}) = ∣ c_{1} (X) Δ c_{3} (X) ∣ = 2.

d_{Δ} (c_{1}, c_{2}) = ∣ c_{1} (X) Δ c_{2} (X) ∣ = 2 and d_{Δ} (c_{1}, c_{3}) = ∣ c_{1} (X) Δ c_{3} (X) ∣ = 2.

\underline{x} \underline{y}, \underline{x} z, \underline{x} w, \underline{y} \underline{z}, \underline{y} w, \underline{z} w, \underline{x} \underline{y} w, \underline{x} z w, \underline{y} \underline{z} w . \vspace - 0, 1 c m

\underline{x} \underline{y}, \underline{x} z, \underline{x} w, \underline{y} \underline{z}, \underline{y} w, \underline{z} w, \underline{x} \underline{y} w, \underline{x} z w, \underline{y} \underline{z} w . \vspace - 0, 1 c m

d_{Δ} (c_{1}, c_{2})

d_{Δ} (c_{1}, c_{2})

d_{Δ} (c_{1}, c_{3})

d_{Δ} (c_{1}, c_{4})

C_{S\mapsto T}(A)\coloneqq\left\{\begin{array}[]{lll}T&\text{if }A=S\\ \varnothing&\text{otherwise.}\end{array}\right.

C_{S\mapsto T}(A)\coloneqq\left\{\begin{array}[]{lll}T&\text{if }A=S\\ \varnothing&\text{otherwise.}\end{array}\right.

d_{S} (A, B) : = d (C_{S \mapsto A}, C_{S \mapsto B}) \vspace - 0, 1 c m

d_{S} (A, B) : = d (C_{S \mapsto A}, C_{S \mapsto B}) \vspace - 0, 1 c m

d (C, C^{'}) = S \in X \sum d_{S} (C (S), C^{'} (S)) .

d (C, C^{'}) = S \in X \sum d_{S} (C (S), C^{'} (S)) .

C_{A}(B)\coloneqq\left\{\begin{array}[]{ll}C(A)\cap B&\text{ if }C(A)\cap B\neq\varnothing,\\ C(B)&\text{ otherwise.}\end{array}\vspace{-0,1cm}\right.

C_{A}(B)\coloneqq\left\{\begin{array}[]{ll}C(A)\cap B&\text{ if }C(A)\cap B\neq\varnothing,\\ C(B)&\text{ otherwise.}\end{array}\vspace{-0,1cm}\right.

d_{rat} (C, C^{'}) : = A \in X \sum d_{Δ}^{A} (C_{A}, C_{A}^{'}), \vspace - 0, 1 c m \vspace - 0, 1 c m

d_{rat} (C, C^{'}) : = A \in X \sum d_{Δ}^{A} (C_{A}, C_{A}^{'}), \vspace - 0, 1 c m \vspace - 0, 1 c m

(c_{2})

(c_{2})

(c_{2}^{'})

d_{\operatorname{rat}}(c_{2},c_{2}^{\prime})=d^{\{x,y\}}_{\Delta}\big{(}(c_{2})_{\{x,y\}},(c_{2}^{\prime})_{\{x,y\}}\big{)}=2\,.

d_{\operatorname{rat}}(c_{2},c_{2}^{\prime})=d^{\{x,y\}}_{\Delta}\big{(}(c_{2})_{\{x,y\}},(c_{2}^{\prime})_{\{x,y\}}\big{)}=2\,.

irr_{ρ} (C) : = min {ρ (C, D) : D \in Choice_{rat} (X)} . \vspace - 0, 1 c m

irr_{ρ} (C) : = min {ρ (C, D) : D \in Choice_{rat} (X)} . \vspace - 0, 1 c m

irr_{d_{Δ}} (c_{1}) = 0, irr_{d_{Δ}} (c_{2}) = 2, irr_{d_{Δ}} (c_{3}) = 2 . \vspace - 0, 1 c m

irr_{d_{Δ}} (c_{1}) = 0, irr_{d_{Δ}} (c_{2}) = 2, irr_{d_{Δ}} (c_{3}) = 2 . \vspace - 0, 1 c m

irr_{d_{rat}} (c_{1}) = 0, irr_{d_{rat}} (c_{2}) = 2, irr_{d_{rat}} (c_{3}) = 3 . \vspace - 0, 1 c m

irr_{d_{rat}} (c_{1}) = 0, irr_{d_{rat}} (c_{2}) = 2, irr_{d_{rat}} (c_{3}) = 3 . \vspace - 0, 1 c m

x \underline{y}, \underline{x} z, \underline{y} z, x \underline{y} z \vspace - 0, 1 c m

x \underline{y}, \underline{x} z, \underline{y} z, x \underline{y} z \vspace - 0, 1 c m

\underline{x} y, x \underline{z}, \underline{y} \underline{z}, x y \underline{z} .

\underline{x} y, x \underline{z}, \underline{y} \underline{z}, x y \underline{z} .

irr_{d_{Δ}} (c_{2}) = irr_{d_{Δ}} (c_{3}) = irr_{d_{Δ}} (c_{4}) = 4 . \vspace - 0, 1 c m

irr_{d_{Δ}} (c_{2}) = irr_{d_{Δ}} (c_{3}) = irr_{d_{Δ}} (c_{4}) = 4 . \vspace - 0, 1 c m

irr_{d_{rat}} (c_{1}) = 0, irr_{d_{rat}} (c_{2}) = 6, irr_{d_{rat}} (c_{3}) = 16, irr_{d_{rat}} (c_{4}) = 19 . \vspace - 0, 1 c m

irr_{d_{rat}} (c_{1}) = 0, irr_{d_{rat}} (c_{2}) = 6, irr_{d_{rat}} (c_{3}) = 16, irr_{d_{rat}} (c_{4}) = 19 . \vspace - 0, 1 c m

{\operatorname{irr}}_{d_{{\operatorname{rat}}}}^{w}(c):=\min\big{\{}w(i_{r})\!\cdot\!d_{{\operatorname{rat}}}(c,r):r\in\textsf{choice}_{\mathrm{rat}}(X)\big{\}}.

{\operatorname{irr}}_{d_{{\operatorname{rat}}}}^{w}(c):=\min\big{\{}w(i_{r})\!\cdot\!d_{{\operatorname{rat}}}(c,r):r\in\textsf{choice}_{\mathrm{rat}}(X)\big{\}}.

irr_{d_{rat}}^{w} (c_{1}) = 0, irr_{d_{rat}}^{w} (c_{2}) = 5.4, irr_{d_{rat}}^{w} (c_{3}) = 12, irr_{d_{rat}}^{w} (c_{4}) = 13.2 .

irr_{d_{rat}}^{w} (c_{1}) = 0, irr_{d_{rat}}^{w} (c_{2}) = 5.4, irr_{d_{rat}}^{w} (c_{3}) = 12, irr_{d_{rat}}^{w} (c_{4}) = 13.2 .

irr_{d_{rat}}^{w^{'}} (c_{1}) = 0, irr_{d_{rat}}^{w^{'}} (c_{2}) = 6.6, irr_{d_{rat}}^{w^{'}} (c_{3}) = 16, irr_{d_{rat}}^{w^{'}} (c_{4}) = 17.6 .

irr_{d_{rat}}^{w^{'}} (c_{1}) = 0, irr_{d_{rat}}^{w^{'}} (c_{2}) = 6.6, irr_{d_{rat}}^{w^{'}} (c_{3}) = 16, irr_{d_{rat}}^{w^{'}} (c_{4}) = 17.6 .

p(a,A)\;=\;Pr\big{(}\{\rhd\in\textsf{LO}(X):(\forall\,x\in A\setminus\{a\})\;a\rhd x\}\big{)}.\vspace{-0,1cm}

p(a,A)\;=\;Pr\big{(}\{\rhd\in\textsf{LO}(X):(\forall\,x\in A\setminus\{a\})\;a\rhd x\}\big{)}.\vspace{-0,1cm}

q_{a, T} := T \subseteq U \subseteq X \sum (- 1)^{∣ U ∖ T ∣} p (a, U) .

q_{a, T} := T \subseteq U \subseteq X \sum (- 1)^{∣ U ∖ T ∣} p (a, U) .

v_{p}(x_{i}):=\left\{\begin{array}[]{ll}\left|\sum_{q_{x_{i},T}<0}q_{x_{i},T}\right|&\text{ if }q_{x_{i},T}<0\text{ for some $T\in 2^{X}$,}\\ 0&\text{ otherwise.}\end{array}\vspace{-0,1cm}\right.\vspace{-0,1cm}

v_{p}(x_{i}):=\left\{\begin{array}[]{ll}\left|\sum_{q_{x_{i},T}<0}q_{x_{i},T}\right|&\text{ if }q_{x_{i},T}<0\text{ for some $T\in 2^{X}$,}\\ 0&\text{ otherwise.}\end{array}\vspace{-0,1cm}\right.\vspace{-0,1cm}

p ≾^{*} p^{'} ⟺ (\exists σ \in S (X)) (\forall x \in X) v_{p} (x) ⩽ v_{p^{'}} (σ (x)) \vspace - 0, 1 c m

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models · Decision-Making and Behavioral Economics

Full text

Rational measures of irrationality

Davide Carpentiere, Alfio Giarlotta , Stephen Watson Department of Mathematics and Computer Science, University of Catania, Italy.Department of Economics and Business, University of Catania, Italy. [email protected] *(corresponding author)*Department of Mathematics and Statistics, York University, Toronto, Canada.

Abstract

All possible types of deterministic choice behavior are classified by their degree of irrationality. This classification is performed in three steps: (1) select a benchmark of rationality, for which this degree is zero; (2) endow the set of choices with a metric to measure deviations from rationality; and (3) compute the distance of any choice behavior from the selected benchmark. The natural candidate for step 1 is the family of all rationalizable behaviors. A possible candidate for step 2 is a suitable variation of the metric described by Klamler (2008), which displays a sharp discerning power among different types of choice behaviors. In step 3 we use this new metric to establish the minimum distance of any choice behavior from the benchmark of rationality. Finally we describe a measure of stochastic irrationality, which employs the random utility model as a benchmark of rationality, and the Block-Marschak polynomials to measure deviations from it.

Keywords: Metric space; choice; rationalization; revealed preference; transitivity; $(m,n)$ -Ferrers property; choice localization; random utility model; Block-Marschak polynomials.

JEL Classification: D01, D81, C44.

1 Introduction

The goal of this paper is to evaluate the irrationality level of all possible choice behaviors on a finite set of alternatives. We perform this task in three successive steps:

(1)

establish a benchmark of rational choice behavior;

(2)

endow the set of all choice behaviors with a highly discerning metric;

(3)

compute the distance of any behavior from the benchmark of rationality.

The output of this process is a rational degree of irrationality of any deterministic choice behavior. (The use of the term ‘rational’ is motivated by the fact that we compute a distance from rationality in order to measure irrationality.) Before addressing in detail each step of this approach, let us discuss the general domain of our analysis.

Classically, the literature on choice theory is exclusively concentrated on ‘decisive’ choice behaviors, intended as situations in which the decision maker (DM) selects at least one item from any nonempty subset of the ground set: see, among a large amount of relevant contributions, the seminal papers by Samuelson (1938), Arrow (1959), and Sen (1971). In other words, the domain of analysis is classically restricted to choice correspondences, which are functions mapping nonempty sets into nonempty subsets. In addition, most of the recent models of ‘bounded rationality in choice’ typically deal with the even more restricted case of choice functions, which are single-valued choices correspondences (i.e., a unique item is selected from any nonempty menu): see, among several papers on the topic, Manzini and Mariotti (2007) and Masatlioglu et al. (2012).111See Giarlotta et al. (2022a) for a list of many models of bounded rationality in choice and a common analysis of their features by a unified approach.

Despite the great abundance of literature on choice functions and choice correspondences, it appears more realistic to consider the general case of quasi-choices, which model the behavior of possibly indecisive DMs: in this situation, the agent is allowed to select all, some, or none of the items available in any menu. To justify the potential interest in this approach, very recently Costa-Gomes et al. (2022) mention some compelling experiments, which suggest that choice models rejecting decisiveness may offer a powerful tool to study revealed preferences.222See also Chapter 1 of the advanced textbook on microeconomic theory by Kreps (2013), as well as the arguments presented in Section 1 of the recent paper by Alcantud et al. (2022). That is why in this paper we evaluate the rationality level of any type of choice behavior, may it be decisive or not.

Now we describe the three stages of our approach.

(1)

The first step consists of the selection of the benchmark of rationality —the ‘zero’— from which deviations ought to be measured. We select the most natural candidate, namely the family of all quasi-choices over the given set that are considered ‘rational’ according to revealed preference theory (Samuelson, 1938). Technically, these are the quasi-choices that can be explained by the maximization of a binary relation.333A more restrictive benchmark of rationality may be the family of quasi-choices rationalizable by binary relations satisfying some desirable properties.

(2)

The selection of a metric is the key step: this distance should accurately discerns among different types of choice behavior in an economically significant way. A possible candidate for this goal is the distance on quasi-choices proposed by Klamler (2008), which is computed by summing the cardinalities of all symmetric differences between pairs of choice sets. Due to its decomposability into trivial metrics, Klamler’s distance is however not well-suited for our goals, due to its low discernibility power. Using a notion of local rationalization, we design a refinement of this metric, which displays a sharp level of discrimination among different choices.

(3)

To finally establish the degree of irrationality for any deterministic choice behavior, we use the metric selected at step 2 to compute the minimum distance of a quasi-choice from a rationalizable one. In this way, all quasi-choices belonging to the benchmark of rationality have a degree of irrationality equal to zero, whereas all the others display a degree with a strictly positive value. Moreover, the more irrational a choice behavior is, the higher the value of the index becomes. We also describe a weighted version of this approach. Formally, since each rationalizable choice is explained by the maximization of a unique asymmetric preference —the strict revealed preference (Samuelson, 1938)—, we measure the subjective desirability of each rational behavior by the ‘level of transitivity’ of this binary relation:444Both Mas-Colell et al. (1995) and Kreps (2013) consider transitivity and completeness the basic tenets of economic rationality. the more this preference is close to being fully transitive,555By ‘fully transitive’ we mean that both strict preference and associated incomparability are transitive. the higher the desirability of the choice becomes. Once subjective desirability is encoded, we measure the degree of irrationality of any behavior by taking a weighted distance from rational behavior.

Finally, we suggest a probabilistic extension of our approach, which applies to stochastic choice functions. Recall that a stochastic choice function assigns a real number to each pair formed by a menu and an item in it, evaluating the likelihood of that item being selected from that menu. Choice functions are special stochastic choices in which this likelihood is one for exactly one item in a menu and zero for all the others.

The steps to measure the irrationality of a stochastic choice behavior are, however, different from the ones of the deterministic setting. Specifically, the first step is again the selection of a benchmark of rationality, for which we take the family of all stochastic choice functions satisfying the random utility model (RUM) (Block and Marschak, 1960). On the other hand, since the second and the third step of the deterministic approach are hardly adaptable, we employ a different procedure. In fact, we take advantage of the characterization established by Falmagne (1978), who shows that a stochastic choice function satisfies RUM if and only if all its Block-Marschak polynomials are non-negative. Therefore, any choice that fails to satisfy RUM must have at least one negative Block-Marschak polynomial. Upon summing up all these negative polynomials for each element in the ground set, we obtain a negativity vector, which provides a discerning measure of the irrationality of a stochastic choice behavior. The comparison of these vectors is then performed by a permutation-invariant Pareto ordering, which in turn yields a partial classification of all stochastic choices according to their degree of irrationality.

The paper is organized as follows. Section 2 collects preliminary notions and presents a review of the related literature on deterministic choices. In Section 3 we describe the metric introduced by Klamler, and then a highly discerning variation of it. In Section 4 we formally define two distance-based degrees of irrationality of a choice behavior, and show the soundness of the novel metric for this task. In Section 5 we suggest an extension of our approach to a stochastic environment.

2 Measures of deterministic irrationality

First we recall some preliminary notions in choice theory. Then we suggest several ways to measures the irrationality of a deterministic choice behavior, and present a quick review of recent literature on the topic.

2.1 Preliminaries

A finite set $X$ of $n\geq 2$ alternatives is fixed throughout. We use $\mathscr{X}$ to denote the family of all nonempty subsets of $X$ .

A quasi-choice correspondence over $X$ is a function $C\colon\mathscr{X}\cup\{\varnothing\}\to\mathscr{X}\cup\{\varnothing\}$ such that $C(A)\subseteq A$ for all $A\in\mathscr{X}\cup\{\varnothing\}$ . A choice correspondence over $X$ is a quasi-choice that is never empty-valued on nonempty sets, that is, a function $c\colon\mathscr{X}\cup\{\varnothing\}\to\mathscr{X}\cup\{\varnothing\}$ such that $\varnothing\neq c(A)\subseteq A$ for all $A\in\mathscr{X}$ .666To emphasize decisiveness, we shall use upper case letters ( $C$ , $C^{\prime}$ , etc.) to denote possibly indecisive choice behaviors, that is, quasi-choice correspondences. On the other hand, lower case letters ( $c$ , $c^{\prime}$ , etc.) will be employed to denote decisive choice behaviors, that is, choice correspondences. Sets in $\mathscr{X}\cup\{\varnothing\}$ are menus, elements of a nonempty menu are items, and the set $C(A)$ (or $c(A)$ ) is the choice set of the menu $A$ . Unless confusion may arise, hereafter we speak of quasi-choices and choices, respectively. Moreover, $\textsf{Choice}(X)$ (resp. $\textsf{choice}(X)$ ) denotes the family of all quasi-choices (resp. choices) over $X$ .

A binary relation $\succ$ over $X$ is a subset of $X\times X$ , which is:

asymmetric if $x\succ y$ implies $\neg(y\succ x)$ for all $x,y\in X$ ;
irreflexive if $x\succ x$ holds for no $x\in X$ ;
acyclic if $x_{1}\succ x_{2}\succ\ldots\succ x_{k}\succ x_{1}$ holds for no $x_{1},x_{2},\ldots,x_{k}\in X$ ( $k\geqslant 3$ );
transitive if $x\succ y\succ z$ implies $x\succ z$ for all $x,y,z\in X$ ;
negatively transitive if $\neg(x\succ y)\wedge\neg(y\succ z)$ implies $\neg(x\succ z)$ for all $x,y,z\in X$ .

Note that (i) asymmetry implies irreflexivity, (ii) transitivity and asymmetry implies acyclicity, and (iii) asymmetry and negative transitivity implies transitivity. We will often refer to an asymmetric binary relation as a (strict) preference.

Choices and preferences are closely related to each other. In fact, since the seminal work of Samuelson (1938), the ‘rationality’ of a decisive choice behavior is classically modeled by the notion of ‘binary rationalizability’, that is, the possibility to explain it by maximizing a suitable binary relation. Formally, a choice $c\colon\mathscr{X}\cup\{\varnothing\}\to\mathscr{X}\cup\{\varnothing\}$ is rationalizable if there is an asymmetric binary relation $\succ$ over $X$ such that for any nonempty menu $A$ , the equality

[TABLE]

holds. The binary relation $\succ$ is called the (strict) preference revealed by $c$ . Note that $\succ$ must also be acyclic in order to rationalize the choice $c$ . Moreover, the asymmetric relation of revealed preference is unique for any rationalizable choice.777Here, we purposely avoid mentioning the symmetric part of the relation of reveled preference, because it is irrelevant to detect the rationalizability of a choice.

2.2 Related literature

In view of our goal to distinguish choice behaviors by their consistency features, the notion of rationalizability is the most popular in the literature. This notion was first introduced for choice functions (that is, single-valued choice correspondences), and then extended to choice correspondences. However, rationalizability can be naturally generalized to quasi-choices, provided that the rationalizing preference is allowed not to be irreflexive, asymmetric, or acyclic. Formally, we call a quasi-choice $C\colon\mathscr{X}\cup\{\varnothing\}\to\mathscr{X}\cup\{\varnothing\}$ rationalizable if there is an arbitrary binary relation over $X$ —here denoted by ‘ $\to$ ’ to emphasize its arbitrariness— such that the equality

[TABLE]

holds for all menus $A\in\mathscr{X}$ . Here the key fact is the possible lack of properties of $\to$ , which follows from the necessity to model indecisive choice behaviors. For instance, since asymmetry is not guaranteed, we may have $x\to y\to x$ for some distinct elements $x$ and $y$ , in which case $C(\{x,y\})$ is empty.888To justify such a situation, imagine a political ballot in which the two remaining candidates are extremists, and my moderate political view suggests me to abstain from voting. Similarly, the possible lack of irreflexivity of $\to$ permits situations of the type $x\to x$ , which in turn yields $C(\{x\})=\varnothing$ .999For instance, if a restaurant only offers a chocolate cake as dessert and I am allergic to chocolate, then I shall avoid taking dessert. Note also that, contrary to the case of choices, the rationalizable preference —which is called a voter by Alcantud et al. (2022)— need not be unique for the general case of quasi-choices.101010On this point, see Section 2 in Alcantud et al. (2022). Here the authors extensively dwell on the reasons motivating the more general use of quasi-choices instead of choices, and the use of arbitrary binary relations to justify choice behavior.

All in all, according to this classical paradigm, any (decisive or indecisive) choice behavior is regarded irrational if it fails to be rationalizable. This yields a simple dichotomy rational/irrational or, equivalently, rationalizable/non-rationalizable. However, this dichotomy not very satisfactory in practice, because rationalizability fails to explain the overwhelming majority of observed choice behaviors.111111For a precise computation of the fraction of rationalizable choices over a set of fixed size, see Giarlotta et al. (2022a, Lemma 6)

Recently, following the inspiring analysis of Simon (1955), the notion of rationalizability has been amended by several forms of bounded rationality, which aim to explain a larger portion of choice behaviors by means of more flexible paradigms. To wit this trend, there are tens of models of bounded rationality in choice that have been proposed in the last twenty years: see Giarlotta et al. (2022a) for a vast account of them. The dichotomy boundedly rational/boundedly irrational is certainly more satisfactory than the rational/irrational one, allowing one to identify choice behaviors that obey some more relaxed (but still justifiable) constraints.121212The fraction of boundedly rational choice functions is definitively larger than that of rationalizable choices: compare Lemma 6 with Theorem 3 in Giarlotta et al. (2022a). However, this bounded rationality approach does not apply to most choice behaviors: in fact, it has essentially been proposed exclusively for choice functions, with very few cases of choice correspondences, moreover leaving completely out the case of quasi-choices.

A conceptually different modelization of rationality does not distinguish between (bounded) rationality and (bounded) irrationality. Rather, it creates a partition of the family of choices in several classes, each of which is assigned a degree of rationality. A seminal approach in this direction is the rationalization by multiple rationales (RMR) of Kalai et al. (2002). The RMR model yields a partition of the family of all choice functions over a set with $n$ items into $n-1$ equivalence classes of rationality, which are determined by the minimum number of linear orders that are necessary to explain decisive choice behavior: the larger this number, the less rational the behavior.131313Very recently, a structured version of the RMR model, called choice by salience, has been proposed by Giarlotta, et al. (2022b). Rationalizable choice functions obviously belong to the first class of rationality, since a unique linear order suffices. On the other side of the scale of rationality, we find those choice functions that require the maximum number of rationales (namely $n-1$ ) to be justified. Despite its conceptually appealing motivation, the RMR model displays some drawbacks: (i) the family of rationalizing linear orders only provides a ‘non-structured’ explanation of choice behavior; (ii) the class of maximally irrational choices (i.e., the ones requiring $n-1$ rationales) essentially collects all choices, even for very small sets of alternatives; and (iii) this model only applies to choice functions (but it could be naturally extended to choice correspondences).141414The choice model based on salience (Giarlotta, et al., 2022b) creates a partition into $n$ classes of rationality, and positively addresses the first two issues of the RMR approach. Specifically, concerning (1), a binary relation of salience restricts the application of rationales to those indexed by the maximally salient items of a menu. Concerning (2), the smallest choice function in the last class of rationality that the authors are able to exhibit is defined on a set of 39 elements.

Another approach devoted to identify the degree of irrationality of a deterministic choice function is due to Ambrus and Rozen (2014). As for the RMR model, also this approach is based on a counting technique. Specifically, the authors use a classical property of choice consistency —namely Independence of Irrelevant Alternatives (Arrow, 1950), which is equivalent to Axiom $\alpha$ (Chernoff, 1954) for choice functions— to establish the degree of irrationality of a choice. They count the number of violations of Axiom $\alpha$ that a choice behavior exhibits: the larger this number, the less rational the behavior. In particular, they introduce a notion of violations of Axiom $\alpha$ , and accordingly define the index of irrationality of a choice by counting all menus that violate Axiom $\alpha$ . The abstract idea of their approach is appealing: it accounts to measure irrationality by counting deviations from rationality according to an axiomatic parameter (Axiom $\alpha$ ).

As we shall see, the approach developed in this paper measures the irrationality of choice behaviors in a way inspired by Ambrus and Rozen (2014). In fact, similarly to them, we analyze deviations from rationality according to axiomatic parameters, namely Axioms $\alpha$ and $\gamma$ (Sen, 1971), which are equivalent to the rationalizability of a quasi-choice. However, contrary to Ambrus and Rozen (2014), we do not directly count violations of properties of choice consistency. Instead, we use an indirect approach: first we establish a theoretical way to measure violations, that is, a metric, and only then we count deviations from rationality using this metric. Of course, the soundness of such a procedure boils down to the selection of a metric that is both economically significant and highly discerning. The next three sections will extensively address this issue.

3 Metrics on quasi-choices

This section is devoted to present ways to endow the family of all possible choice behaviors with metrics. Specifically, first we recall a metric due to Klamler (2008), and then describe a variation of it, which showcases a rather sharp discernibility power. In Section 4 we shall employ this novel metric as the measuring stick to evaluate deterministic deviations from rational behavior. To start, we recall the notion of distance between quasi-choices.

Definition 3.1.

A metric on $\textsf{Choice}(X)$ is a map $d\colon\textsf{Choice}(X)\times\textsf{Choice}(X)\to{\mathbb{R}}$ such that for all $C,C^{\prime},C^{\prime\prime}\in\textsf{Choice}(X)$ , the following properties hold:

[A0.1]

$d(C,C^{\prime})\geq 0$ , and equality holds if and only if $C=C^{\prime}$ ;

[A0.2]

$d(C,C^{\prime})=d(C^{\prime},C)$ ;

[A0.3]

$d(C,C^{\prime})+d(C^{\prime},C^{\prime\prime})\geq d(C,C^{\prime\prime})$ .

Property A0.1 is non-negativity, property A0.2 is symmetry, and property A0.3 is the triangle inequality.

3.1 Klamler’s metric

The symmetric difference $\Delta$ of sets (Kemeny, 1959) induces a metric on quasi-choices:

Definition 3.2 (Klamler, 2008).

Let $d_{\Delta}\colon\textsf{Choice}(X)\times\textsf{Choice}(X)\to{\mathbb{R}}$ be the function defined as follows for all $C,C^{\prime}\in\textsf{Choice}(X)$ :

[TABLE]

By Definition 3.2, the distance between two quasi-choices over the same set of alternatives is obtained by a simple and intuitive procedure: first count the number of items in a menu that are in one choice set but not in the other one, and then take the sum of these numbers over all menus. Usually, being simple and intuitive is regarded as a good feature of a notion. Unfortunately, here this fact translates into an oversimplified evaluation of the distance between two behaviors, which totally neglects their structural features. Specifically, by only looking at the ‘size’ of the disagreement of two quasi-choices over menus, Definition 3.2 fails to consider the ‘semantics’ of this disagreement, which lies in the very nature of the items selected by exactly one of them. This in turn produces some important shortcomings of this metric in the process of detecting deviations from rational behavior. The next two examples provide striking instances of this kind.

Example 3.3.

Consider the following three choice functions on $X=\{x,y,z\}$ (the unique item selected from each menu is underlined):

[TABLE]

The choices $c_{1},c_{2},c_{3}$ are equal on pairs of items but differ on the full menu $X$ . On pairs, the selection process is reproduced by maximizing the linear order $x\succ y\succ z$ . However, $c_{1}$ is rationalizable by $\succ$ , whereas $c_{2}$ and $c_{3}$ are not. Intuition suggests that $c_{3}$ should be further than $c_{2}$ from the rational choice $c_{1}$ . For instance, if we use the linear order $\succ$ to rationalize pairs of items, then $c_{2}$ selects the second-best item from $X$ , whereas $c_{3}$ ends up selecting the worst item of the three.151515Of course, one may always consider different scenarios, in which $c_{3}$ is regarded more rational than $c_{2}$ . However, these scenarios appear to be less likely to happen. On the other hand, the metric $d_{\Delta}$ does regard $c_{2}$ and $c_{3}$ as equally distant from $c_{1}$ , because we have

[TABLE]

Example 3.4.

Let $c_{1},c_{2},c_{3},c_{4}$ be four choice correspondences over $X=\{x,y,z,w\}$ , which are defined exactly in the same way for all menus distinct from $\{x,y,z\}$ and $X$ , namely

[TABLE]

However, $c_{1},c_{2},c_{3},c_{4}$ select different elements from the two menus $\{x,y,z\}$ and $X$ , namely

$(c_{1})$

$\underline{x}\underline{y}z\,,\;\underline{x}\underline{y}zw\,$ ,

$(c_{2})$

$\underline{x}y\underline{z}\,,\;\underline{x}y\underline{z}w\,$ ,

$(c_{3})$

$x\underline{y}\underline{z}\,,\;x\underline{y}z\underline{w}\,$ ,

$(c_{4})$

$x\underline{y}z\,,\;xyz\underline{w}\,$ .

Note that $c_{1}$ is rationalizable by the relation $\succ$ over $X$ such that $x\succ z$ , $x\succ w$ , $y\succ w$ , and $z\succ w$ . On the other hand, the three choices $c_{2},c_{3},c_{4}$ fail to be rationalizable, but they have exactly the same distance from the rationalizable choice $c_{1}$ :

[TABLE]

However, similarly to Example 3.3, it is reasonable to assume that $c_{2}$ is ‘semantically’ closer to $c_{1}$ than $c_{3}$ is: in fact, $c_{2}$ selects from the menus $\{x,y,z\}$ and $X$ some items that are better ranked (by $\succ$ ) than those selected by $c_{3}$ . There are also solid arguments to validate the opinion that $c_{4}$ should be the farthest choice from $c_{1}$ .

The low discernibility power of $d_{\Delta}$ is due to (some of) the properties it satisfies. Carpentiere et al. (2023) —slightly correcting the findings of Klamler (2008)— prove that the following properties characterize $d_{\Delta}$ (a universal quantification is implicit):

[A1]

$d(C,C^{\prime})+d(C^{\prime},C^{\prime\prime})=d(C,C^{\prime\prime})$ if and only if $C^{\prime}$ is between $C$ and $C^{\prime\prime}$ ;161616The notion of ‘betweenness’ is due to Alabayrak and Aleskerov (2000): $C^{\prime}$ is between $C$ and $C^{\prime\prime}$ if $C(S)\cap C^{\prime\prime}(S)\subseteq C^{\prime}(S)\subseteq C(S)\cup C^{\prime\prime}(S)$ holds for any $S\in\mathscr{X}$ .

[A2]

if $\widetilde{C}$ and $\widetilde{C^{\prime}}$ result from, respectively, $C$ and $C^{\prime}$ by the same permutation of alternatives, then $d(C,C^{\prime})=d(\widetilde{C},\widetilde{C^{\prime}})$ ;

[A3]

if $C$ and $C^{\prime}$ agree on all (nonempty) menus in $\mathscr{X}$ except for a subfamily $\mathscr{X}^{\prime}\subseteq\mathscr{X}$ , then the distance $d(C,C^{\prime})$ is determined exclusively from the choice sets over $\mathscr{X}^{\prime}$ ;

[A4*′*]

if $C,C^{\prime},\widetilde{C},\widetilde{C^{\prime}}$ only disagree on a menu $T\in\mathscr{X}$ such that $C(T)=\widetilde{C}(T)\Delta S$ and $C^{\prime}(T)=\widetilde{C^{\prime}}(T)\Delta S$ for some $S\subseteq T$ , then $d(C,C^{\prime})=d(\widetilde{C},\widetilde{C^{\prime}})$ ;

[A5’]

for all $C\in\textsf{Choice}(X)$ and $A\in\mathscr{X}$ , there is $C^{\prime}\in\textsf{Choice}(X)$ with the property that $|C(A)\Delta C^{\prime}(A)|=1$ , $C(B)=C^{\prime}(B)$ for all $B\neq A$ , and $d(C,C^{\prime})=1$ .

Axioms A1, A2, A3, A4’, and A5’ are rather intuitive requirements for a metric on the family of quasi-choices. In fact, Axiom A1 strengthens the triangle inequality A0.3 by requiring that equality holds exactly for cases of betweenness. Axiom A2 states a condition of invariance under permutations. Axiom A3 is a separability property, whereas Axiom A4’ is a condition of translation-invariance. The first four properties produce a unique metric, up to some multiplicative coefficients that only depend on the size of the menu: Axiom A5’ forces these coefficients to be unique.

As announced, we have:

Theorem 3.5 (Carpentiere et al., 2023).

The unique metric on $\textsf{Choice}(X)$ that satisfies Axioms A1, A2, A3, A4’, A5’ is $d_{\Delta}$ .

Unfortunately, the discernibility power of $d_{\Delta}$ among different choice behaviors is rather low, which is essentially due to the satisfaction of Axioms A1 and A3. To illustrate this fact, below we summarize some of the findings in Carpentiere et al. (2023).

Definition 3.6.

A quasi-choice $C$ on $X$ is elementary if there is at most one menu $S\in\mathscr{X}$ such that $C(S)\neq\varnothing$ . For any $S,T\in\mathscr{X}\cup\{\varnothing\}$ such that $T\subseteq S$ , we denote by $C_{S\mapsto T}$ the elementary quasi-choice over $X$ defined as follows:

[TABLE]

Definition 3.7.

Let $d$ be a metric on $\textsf{Choice}(X)$ , and $S\in\mathscr{X}$ . Denote by $\mathscr{X}_{S}$ the family of all nonempty subsets of $S$ . Define a metric $d_{S}\colon\mathscr{X}_{S}\cup\{\varnothing\}\times\mathscr{X}_{S}\cup\{\varnothing\}\to{\mathbb{R}}$ by

[TABLE]

for all $A,B\in\mathscr{X}_{S}\cup\{\varnothing\}$ . We call $d_{S}$ the characteristic metric induced by $d$ on $\mathscr{X}_{S}\cup\{\varnothing\}$ .

Any metric on $\textsf{Choice}(X)$ satisfying A1 and A3 —hence, in particular, $d_{\Delta}$ — is a sum of characteristic metrics:

Lemma 3.8 (Elementary Decomposability).

Let $d$ be a metric on $\textsf{Choice}(X)$ satisfying Axioms A1 and A3. For all $C,C^{\prime}\in\textsf{Choice}(X)$ ,

[TABLE]

The metric defined in the next section satisfies neither A1 nor A3.

3.2 A rational variation of Klamler’s metric

We design a novel metric by suitably modifying Klamler’s distance. This variation is inspired by Ambrus and Rozen (2014), because we employ two axioms of choice consistency —in place of one— to guide its construction:

Axiom $\alpha\,$ :

for all $A,B\subseteq X$ and $x\in X$ , if $x\in A\subseteq B$ and $x\in C(B)$ , then $x\in C(A)$ ;

Axiom $\gamma\,$ :

for all $A,B\subseteq X$ and $x\in X$ , if $x\in C(A)$ and $x\in C(B)$ , then $x\in C(A\cup B)$ .

Axiom $\alpha$ is due to Chernoff (1954). In words, if an item is selected from a menu, then it is also chosen from any smaller menu containing it. This property is often referred to as Standard Contraction Consistency. Its role in abstract theories of individual and social choice is central. Nehring (1997, p. 407) even calls Axiom $\alpha$ “the mother of all choice consistency conditions’’. Axiom $\gamma$ , often referred to as Standard Expansion Consistency, is due to Sen (1971). It says that if an item is selected from two menus, then it is also chosen from the larger menu obtained as their union.

The connection between these two properties and rational behavior is well-known:

Theorem 3.9 (Sen, 1971).

A choice correspondence is rationalizable if and only if both $\alpha$ and $\gamma$ hold.171717This characterization readily extends to quasi-choices: see Aizerman and Aleskerov (1995, Theorem 2.5). For a proof of this generalization, see Aleskerov and Monjardet (2002, Theorem 2.8).

We now proceed to define a suitable refinement of $d_{\Delta}$ , which takes into account all ‘locally rational approximations’ of the original quasi-choice. Specifically, we consider all restrictions of the given correspondence to all subsets of any given menu, and modify them in order to obtain quasi-choices that locally satisfy Axioms $\alpha$ and $\gamma$ . Finally, we sum up all differences of these rational modifications.

Definition 3.10.

Let $C\colon\mathscr{X}\cup\{\varnothing\}\to\mathscr{X}\cup\{\varnothing\}$ be a quasi-choice over $X$ , and $A\in\mathscr{X}$ a nonempty menu. Define a quasi-choice $C_{A}\colon\mathscr{A}\cup\{\varnothing\}\to\mathscr{A}\cup\{\varnothing\}$ over $A$ , where $\mathscr{A}$ is the family of $\mathscr{X}$ comprising all nonempty subsets of $A$ , as follows for each $B\in\mathscr{A}\cup\{\varnothing\}$ :

[TABLE]

We call $C_{A}$ the rational localization of $C$ at $A$ . Then, for all $C,C^{\prime}\in\textsf{Choice}(X)$ , the rational distance between $C$ and $C^{\prime}$ is defined by

[TABLE]

where $d^{A}_{\Delta}$ denotes the restriction $d_{\Delta}\!\!\upharpoonright_{\textsf{Choice}(A)\times\textsf{Choice}(A)}$ .

Definition 3.10 employs $(|2^{X}|-1)$ -many restrictions of the given metric $d_{\Delta}$ to compare standard modifications of two given quasi-choices: these modifications are a sort of rational closures of a given choice on a given menu. Note that the terminology of ‘local rationalization’ used for $C_{A}$ is motivated by the fact that any element $x\in C(A)$ is never responsible for a violation of Axioms $\alpha$ or $\gamma$ by $C_{A}$ .181818More formally, considering Axiom $\alpha$ , this means that if $y\in S\subseteq T\subseteq A$ , $y\in C_{A}(T)$ , and $y\notin C_{A}(S)$ , then $y\notin C(A)$ . Similarly, for Axiom $\gamma$ , if $S,T\subseteq A$ , $y\in C_{A}(S)\cap C_{A}(T)$ , and $y\notin C_{A}(S\cup T)$ , then $y\notin C(A)$ .

Example 3.11.

We illustrate how Definition 3.10 works in a very simple case. Consider the two choice functions $c_{2}$ and $c_{2}^{\prime}$ over $X=\{x,y,z\}$ defined by191919The choice function $c_{2}$ has already been considered in Example 3.3.

[TABLE]

Note that $c_{2}$ and $c_{2}^{\prime}$ are equal except on the menu $\{x,y\}$ , and $c_{2}^{\prime}$ is rationalizable by the linear order $y\succ x\succ z$ . To determine $d_{\operatorname{rat}}(c_{2},c_{2}^{\prime})$ , we preliminary compute their rational localizations $(c_{2})_{A}$ and $(c_{2}^{\prime})_{A}$ at any nonempty subset $A$ of $X$ having size at least two:

(Note that all rational localizations at singletons are trivial choice functions in this particular case.) Since $(c_{2})_{A}=(c^{\prime}_{2})_{A}$ for each $A\neq\{x,y\}$ , whereas $(c_{2})_{\{x,y\}}(\{x,y\})=x$ and $(c_{2}^{\prime})_{\{x,y\}}(\{x,y\})=y$ , we conclude

[TABLE]

As possibly expected, Definition 3.10 is sound:

Lemma 3.12.

The function $d_{\operatorname{rat}}$ is a metric on $\textsf{Choice}(X)$ .

Proof. For A0.1, clearly $d_{\operatorname{rat}}(C,C^{\prime})$ is nonnegative. If $d(C,C^{\prime})=0$ , then $d^{A}_{\Delta}(C_{A},C^{\prime}_{A})=0$ for all $A\in\mathscr{X}$ . It follows that $C_{A}(B)=C^{\prime}_{A}(B)$ for all $B\subseteq A$ , and so $C(A)=C_{A}(A)=C^{\prime}_{A}(A)=C^{\prime}(A)$ . This proves A0.1. Axiom A0.2 is obvious. For A0.3, observe that $d^{A}_{\Delta}(C_{A},C^{\prime\prime}_{A})\leq d^{A}_{\Delta}(C_{A},C^{\prime}_{A})+d^{A}_{\Delta}(C^{\prime}_{A},C^{\prime\prime}_{A})$ , because $d^{A}_{\Delta}$ is the restriction of a metric. Thus the claim follows from summing over all $A\in\mathscr{X}$ . $\Box$

The next remark shows that, despite being derived from $d_{\Delta}$ , the rational metric $d_{\operatorname{rat}}$ does not satisfy several properties considered by Klamler; in particular, neither of the two axioms responsible for elementary decomposability —namely A1 and A3— hold for $d_{\operatorname{rat}}$ .

Remark 3.13.

We prove that $d_{\operatorname{rat}}$ satisfies neither A1 nor A3 nor A4’. All counterexamples will be quasi-choices over the set $X=\{x,y,z\}$ . Since in all cases the choice set of any singleton is nonempty, we only define them on menus having size two or three.

To prove the failure of A1, define $C,C^{\prime},C^{\prime\prime}\in\textsf{Choice}(X)$ by

$(C)$

$\underline{x}y\,,\;\underline{x}\underline{z}\,,\;yz\,,\;\underline{x}y\underline{z}\,$ ;

$(C^{\prime})$

$\underline{x}\underline{y}\,,\;\underline{x}\underline{z}\,,\;\underline{y}z\,,\;\underline{x}\underline{y}z\,$ ;

$(C^{\prime\prime})$

$\underline{x}\underline{y}\,,\;\underline{x}\underline{z}\,,\;\underline{y}\underline{z}\,,\;x\underline{y}z\,$ .

Clearly, $C^{\prime}$ is between $C$ and $C^{\prime\prime}$ . However, $d_{\operatorname{rat}}(C,C^{\prime\prime})=10$ is different from $d_{\operatorname{rat}}(C,C^{\prime})+d_{\operatorname{rat}}(C^{\prime},C^{\prime\prime})=8+4=12$ .

For the failure of A3, define $C,C^{\prime},D,D^{\prime}\in\textsf{Choice}(X)$ by

$(C)$

$\underline{x}y\,,\;xz\,,\;y\underline{z}\,,\;\underline{x}\underline{y}z\,$ ;

$(C^{\prime})$

$xy\,,\;xz\,,\;y\underline{z}\,,\;\underline{x}\underline{y}z\,$ ;

$(D)$

$\underline{x}y\,,\;xz\,,\;yz\,,\;xy\underline{z}\,$ ;

$(D^{\prime})$

$xy\,,\;xz\,,\;yz\,,\;xy\underline{z}\,$ .

Let $\mathscr{X}^{\prime}=\{\{x,y\},\{x,z\}\}$ , and observe that $C\!\!\upharpoonright_{\mathscr{X}\setminus\mathscr{X}^{\prime}}=C^{\prime}\!\!\upharpoonright_{\mathscr{X}\setminus\mathscr{X}^{\prime}}$ , $D\!\!\upharpoonright_{\mathscr{X}\setminus\mathscr{X}^{\prime}}=D^{\prime}\!\!\upharpoonright_{\mathscr{X}\setminus\mathscr{X}^{\prime}}$ , $C\!\!\upharpoonright_{\mathscr{X}^{\prime}}=D\!\!\upharpoonright_{\mathscr{X}^{\prime}}$ , and $C^{\prime}\!\!\upharpoonright_{\mathscr{X}^{\prime}}=D^{\prime}\!\!\upharpoonright_{\mathscr{X}^{\prime}}$ . However, $d_{\operatorname{rat}}(C,C^{\prime})=1$ whereas $d_{\operatorname{rat}}(D,D^{\prime})=2$ .

For the failure of A4’, define $C,\widetilde{C},C^{\prime},\widetilde{C^{\prime}}\in\textsf{Choice}(X)$ by

$(C)$

$\underline{x}y\,,\;\underline{x}z\,,\;\underline{y}z\,,\;\underline{x}yz\,$ ;

$(\widetilde{C})$

$\underline{x}y\,,\;\underline{x}z\,,\;\underline{y}z\,,\;\underline{x}\underline{y}\underline{z}\,$ ;

$(C^{\prime})$

$\underline{x}y\,,\;\underline{x}z\,,\;\underline{y}z\,,\;x\underline{y}\underline{z}\,$ ;

$(\widetilde{C^{\prime}})$

$\underline{x}y\,,\;\underline{x}z\,,\;\underline{y}z\,,\;xyz\,$ .

The four quasi-choices over $X$ agree on every menu, except on $X$ . For $S=\{y,z\}$ , we have $C(X)=\widetilde{C}(X)\Delta S$ and $C^{\prime}(X)=\widetilde{C^{\prime}}(X)\Delta S$ , and yet $d_{\operatorname{rat}}(C,C^{\prime})=8\neq 6=d_{\operatorname{rat}}(\widetilde{C},\widetilde{C^{\prime}})$ .

It would be interesting to axiomatically characterize the rational metric $d_{\operatorname{rat}}$ : we leave this as an open problem.

4 Distance-based degrees of irrationality

We finally give a formal definition of the measure of irrationality of a deterministic choice behavior with respect to a given metric, where the family of rationalizable quasi-choices acts as the benchmark of rationality. We provide two versions of it: (1) simple, and (2) weighted. The first applies to all quasi-.choices, whereas the second is only designed for choice correspondences.

4.1 A simple degree of irrationality

Definition 4.1.

Let $\rho\colon\textsf{Choice}(X)\to\textsf{Choice}(X)$ be a metric. Denote by $\textsf{Choice}_{\mathrm{rat}}(X)$ the subfamily of $\textsf{Choice}(X)$ comprising all quasi-choices that are rationalizable. For any quasi-choice $C$ over $X$ , the $\rho$ -degree of irrationality of $C$ is the integer defined by

[TABLE]

(This degree is well-defined, because $X$ is finite.)

Given a metric $\rho$ on $\textsf{Choice}(X)$ , the larger the $\rho$ -degree of irrationality of a quasi-choice $C$ is, the more irrational $C$ is considered from the point of view of $\rho$ . Note that if a quasi-choice is rationalizable, then its $\rho$ -degree of irrationality is zero for any metric $\rho$ . For instance, the choice function $c_{1}$ defined in Example 3.3 has a $d_{\Delta}$ -degree of irrationality equal to zero, whereas $c_{2}$ and $c_{3}$ have a $d_{\Delta}$ -degree of irrationality equal to two.

As already pointed out, the soundness of Definition 4.1 depends on the economic significance and the discernibility power of the metric used to determine the degree of irrationality. In this respect, the rational metric $d_{\operatorname{rat}}$ appears to be better suited than Klamler’s distance $d_{\Delta}$ . The next two examples witness this claim.

Example 4.2.

Consider the three choice functions $c_{1},c_{2},c_{3}$ defined in Example 3.3. It is easy to show that

[TABLE]

On the other hand, below we show that

[TABLE]

$\bullet$ ** ${\operatorname{irr}}_{d_{\operatorname{rat}}}(c_{1})=0$ :**

This is obvious, because $c_{1}$ is rationalizable.

$\bullet$ ** ${\operatorname{irr}}_{d_{\operatorname{rat}}}(c_{2})=2$ :**

As noted in Example 3.11, the choice function $c_{2}^{\prime}$ defined by

[TABLE]

is rationalizable by the linear order $\succ$ , with $y\succ x\succ z$ . We know that $d_{\operatorname{rat}}(c_{2},c_{2}^{\prime})=2$ . Therefore, to prove the claim, we show that $d_{\operatorname{rat}}(c_{2},D)\geq 2$ for all $D\in\textsf{Choice}_{\mathrm{rat}}(X)$ .

Hereafter we shall employ a simplified notation, which is also used – mutatis mutandis – in the proof of the equality ${\operatorname{irr}}_{d_{\operatorname{rat}}}(c_{3})=3$ . Specifically, for all $A\in\mathscr{X}$ , we denote $d^{A}_{\Delta}((c_{2})_{A},D_{A})$ by the less cumbersome $d^{A}_{\Delta}$ . Moreover, we drop brackets and set separators whenever clear from context, using $D(xz)=x$ instead of $D(\{x,z\})=\{x\}$ , $d^{xz}_{\Delta}$ instead of $d^{\{x,z\}}_{\Delta}$ , etc.

Now fix $D\in\textsf{Choice}_{\mathrm{rat}}(X)$ . Then, either (1) $y\in D(xy)$ , or (2) $y\notin D(xy)$ .

(1)

If $y\in D(xy)$ , then we separately consider two cases.

(1A)

If $x\notin D(xy)$ , then $D_{xy}(xy)=y$ . Since $(c_{2})_{xy}(xy)=x$ , we obtain $d_{\Delta}^{xy}\geq 2$ , hence $d_{\operatorname{rat}}(c_{2},D)\geq 2$ .

(1B)

If $x\in D(xy)$ , then we split the analysis in two subcases.

(1B1)

If $x\notin D(xz)$ , then $x\notin D_{xz}(xz)$ , while $x\in(c_{2})_{xz}(xz)$ . It follows that $d_{\Delta}^{xz}\geq 1$ . Note that $y\in D_{xy}(xy)$ and $y\notin(c_{2})_{xy}(xy)$ imply $d_{\Delta}^{xy}\geq 1$ . We conclude $d_{\operatorname{rat}}(c_{2},D)\geq 2$ .

(1B2)

If $x\in D(xz)$ , then $x\in D(xyz)$ by Axiom $\gamma$ , and so $x\in D_{xyz}(xyz)$ . Since $x\notin(c_{2})_{xyz}(xyz)$ , we get $d_{\Delta}^{xyz}\geq 1$ . As before, $y\in D_{xy}(xy)$ and $y\notin(c_{2})_{xy}(xy)$ imply $d_{\Delta}^{xy}\geq 1$ . We conclude $d_{\operatorname{rat}}(c_{2},D)\geq 2$ .

(2)

If $y\notin D(xy)$ , then $y\notin D(xyz)$ by Axiom $\alpha$ . Thus $y\notin D_{xyz}(xyz)\cup D_{xyz}(xy)$ and $y\in(c_{2})_{xyz}(xyz)\cap(c_{2})_{xyz}(xy)$ . We conclude $d_{\Delta}^{xyz}\geq 2$ , hence $d_{\operatorname{rat}}(c_{2},D)\geq 2$ .

$\bullet$ ** ${\operatorname{irr}}_{d_{\operatorname{rat}}}(c_{3})=3$ :**

Let $c_{3}^{\prime}$ be the choice rationalizable by the relation $\succ$ on $X$ defined by $z\succ x$ and $x\succ y$ , that is,

[TABLE]

(Note that $\succ$ is not transitive, because $y$ and $z$ are incomparable.) It is easy to check that $d_{\Delta}^{xz}((c_{3})_{xz},(c_{3}^{\prime})_{xz})=2$ and $d_{\Delta}^{yz}((c_{3})_{yz},(c_{3}^{\prime})_{yz})=1$ , whereas all other rational localizations of $c_{3}$ and $c_{3}^{\prime}$ coincide. It follows that $d_{\operatorname{rat}}(c_{3},c_{3}^{\prime})=3$ .

To complete the proof, we show that $d_{\operatorname{rat}}(c_{3},D)\geq 3$ for any $D\in\textsf{Choice}_{\mathrm{rat}}(X)$ .

(1)

If $z\notin D(xyz)$ , then either (1A) $z\notin D(xz)$ , or (1B) $z\notin D(yz)$ , using Axiom $\gamma$ .

(1A)

If $z\notin D(xz)$ , then we split the analysis into two subcases.

(1A1)

If $x\in D(xyz)$ , then $x\in D_{xyz}(xyz)$ , $z\notin D_{xyz}(xyz)$ , and $D_{xyz}(xz)=x$ . Therefore, from $z\in(c_{3})_{xyz}(xyz)\cap(c_{3})_{xyz}(xz)$ and $x\notin(c_{3})_{xyz}(xz)$ , we derive $d_{\Delta}^{xyz}\geq 3$ . We conclude $d_{\operatorname{rat}}(c_{3},D)\geq 3$ .

(1A2)

If $x\notin D(xyz)$ , then by Axiom $\gamma$ , $x\notin D(xy)$ or $x\notin D(xz)$ , and so $x\notin D_{xy}(xy)$ or $x\notin D_{xz}(xz)$ . Since $x\in(c_{3})_{xy}(xy)\cap(c_{3})_{xz}(xz)$ , we obtain $d_{\Delta}^{xy}\geq 1$ or $d_{\Delta}^{xz}\geq 1$ . Finally, since $z\in(c_{3})_{xyz}(xyz)\cap(c_{3})_{xyz}(xz)$ and $z\notin D_{xyz}(xyz)\cup D_{xyz}(xz)$ , we get $d_{\Delta}^{xyz}\geq 2$ , and so $d_{\operatorname{rat}}(c_{3},D)\geq 3$ .

(1B)

Suppose $z\notin D(yz)$ . We can assume that $z\in D(xz)$ , otherwise we would be done by case (1A). It follows that $z\in D_{xz}(xz)$ while $z\notin c_{3_{xz}}(xz)$ , hence $d_{\Delta}^{xz}\geq 1$ . Since $z\notin D_{xyz}(xyz)$ and $z\notin D_{xyz}(yz)$ , we conclude $d_{\Delta}^{xyz}\geq 2$ , and so $d_{{\operatorname{rat}}}(c_{3},D)\geq 3$ .

(2)

If $z\in D(xyz)$ , then $z\in D(xz)$ and $z\in D(yz)$ by Axiom $\alpha$ .

(2A)

If $x\notin D(xy)$ , then $d_{\Delta}^{xy}\geq 1$ , $d_{\Delta}^{xz}\geq 1$ , $d_{\Delta}^{yz}\geq 1$ , and so $d_{\operatorname{rat}}(c_{3},D)\geq 3$ .

(2B)

If $x\in D(xy)$ , we consider two subcases.

(2B1)

If $x\notin D(xyz)$ , then $x\notin D(xz)$ by Axiom $\gamma$ , hence $d_{\Delta}^{xz}\geq 2$ and $d_{\Delta}^{yz}\geq 1$ . We conclude $d_{\operatorname{rat}}(c_{3},D)\geq 3$ .

(2B2)

If $x\in D(xyz)$ , then $x\in D_{xyz}(xz)$ and $x\in D(xz)$ by Axiom $\alpha$ , hence $d_{\Delta}^{xyz}\geq 2$ and $d_{\Delta}^{xz}=1$ . Again, we conclude $d_{\operatorname{rat}}(c_{3},D)\geq 3$ .

Example 4.3.

Let $c_{1},c_{2},c_{3},c_{4}$ be the four choice correspondences over $X=\{x,y,z,w\}$ defined in Example 3.4. One can easily show that these three choices have the same $d_{\Delta}$ -degree of irrationality, being

[TABLE]

On the contrary, the metric $d_{\operatorname{rat}}$ agrees with the perception that $c_{2}$ is less irrational than $c_{3}$ , and $c_{4}$ is the most irrational of all, being

[TABLE]

The related computations are extremely long and tedious, so we omit them.202020However, they are available upon request.

4.2 A weighted degree of irrationality

The evaluation of the degree of irrationality of choice behavior described above can be refined, as long as the DM is able to provide additional pieces of information. Here we illustrate a possible refinement of it, which applies to the family of choice correspondences; in other words, we consider the special case of a decisive DM.

In a preliminary step, the DM is required to provide additional information about the ‘subjective desirability’ of all rational choice behaviors. Operationally, this is obtained by assigning weights to each rationalizable choice correspondence. According to intuition, very desirable rational behaviors should be given a weight less or equal than one, because this may produce the effect of contracting the rational distance of all choices close to them. On the contrary, less appealing rational behaviors should given a weight greater or equal than one, in order to possibly dilate the distance from rationalization. Once desirability is assessed, the irrational degree of a decisive choice behavior is then computed as the minimum weighted distance from the benchmark of rationality.

In the process of designing the weighting procedure, we adhere to some natural rules of conduct. We select the transitivity of the relation of revealed preference as our guiding parameter: the more transitive this relation is, the more desirable the associated behavior becomes, and the lower the corresponding weight must be. From this point of view, the most desirable choices will be those rationalized by weak orders (asymmetric and negatively transitive, hence transitive), which will be assigned the lowest weight among all rational behaviors. Less desirable levels are those of choices rationalized by semiorders (asymmetric, Ferrers, and semitransitive) (Luce, 1956), and by interval orders (asymmetric and Ferrers) (Fishburn, 1970, 1985). At an even lower desirability level lie all choices rationalized by transitive asymmetric relations that fail to be interval orders. At the bottom of the scale, we find those choices that are rationalized by asymmetric, acyclic and intransitive binary relations, which will be given the highest weight of all.

An even finer tuning of the weighting procedure can be achieved by employing the so-called strict and weak $(m,n)$ -Ferrers properties (Giarlotta and Watson, 2014, 2018), which provide a classification of all asymmetric and acyclic binary relations on a set according to their discrete level of transitivity.212121On the point, see also Cantone et al. (2016) for a classification of all rationalizable choices on the basis of the so-called axioms of $(m,n)$ -replacement consistency. The next definition provides a simplified version of these properties, which is however sufficient for our goal.

Definition 4.4 (Giarlotta and Watson, 2014).

Let $\succ$ be an asymmetric and acyclic binary relation over $X$ . Denote by $\succsim$ the canonical completion of $\succ$ , obtained by adding all $\succ$ -incomparable pairs to $\succ$ .222222Two (not necessarily distinct) elements $x,y\in X$ are $\succ$ -incomparable if nether $x\succ y$ nor $y\succ x$ holds. Technically, the canonical completion $\succsim$ is the extension of $\succ$ in which incomparability is transformed into indifference. In particular, the canonical completion $\succsim$ of $\succ$ is both reflexive (i.e., $x\succsim x$ for all $x\in X$ ) and complete (i.e., $x\succsim y$ or $y\succsim x$ for all distinct $x,y\in X$ ). For any integers $m\geq n\geq 1$ , we say that $\succ$ is $(m,n)$ -Ferrers if the joint satisfaction of $(x_{1}\succsim\ldots\succsim x_{m})$ and $(y_{1}\succsim\ldots\succsim y_{n})$ implies either $x_{1}\succsim y_{n}$ or $y_{1}\succsim x_{m}$ , for all (not necessarily distinct) $x_{1},\ldots,x_{m},y_{1},\ldots,y_{n}\in X$ .

It is easy to show that $(m,n)$ -Ferrers implies $(m^{\prime},n^{\prime})$ -Ferrers for any $1\leqslant m^{\prime}\leqslant m$ and $1\leqslant n^{\prime}\leqslant n$ (Giarlotta and Watson, 2014, Lemma 2.6). Furthermore, $(3,3)$ -Ferrers implies $(m,n)$ -Ferrers for any $m\geqslant n\geqslant 1$ (Giarlotta and Watson, 2014, Theorem 3.1(v)). Note also that $(3,3)$ -Ferrers relations are weak orders, $(3,1)$ - and $(2,2)$ -Ferrers relations are semiorders, $(2,2)$ -Ferrers relations are interval orders, $(2,1)$ -Ferrers relations are transitive, and $(1,1)$ -Ferrers are acyclic but intransitive.

Consequently, all asymmetric and acyclic binary relation on a given set of alternatives can be partitioned according to a lattice structure, which is induced by the satisfaction of $(m,n)$ -Ferrers properties. This lattice is composed of 14 pairwise disjoint sets, which in turn can be arranged into 9 desirability classes according to their discrete degree of transitivity: see Figure 1.232323This figure is a simple elaboration of Figure 6 in Giarlotta (2019). See also Giarlotta (2014), where the typical form of strong semiorders and strong interval orders is displayed in Figure 5. For instance, the most desirable class is that of weak orders, whereas the least desirable class comprises all intransitive preferences. We can finally define a weighted variation of the degree of irrationality.

Definition 4.5.

A feasible weighting map is a function $w\colon\{1,2,\ldots,9\}\to(0,2)$ , which assigns a positive weight to each desirability class in a way that

(monotonicity)

$i\leqslant j$ implies $w(i)\leqslant w(j)$ for all $i,j\in\{1,2,\ldots,9\}$ , and

(average property)

$\sum_{i=1}^{9}\frac{w(i)}{9}\in\left[1-\varepsilon,1+\varepsilon\right]$ for some $0\leqslant\varepsilon<1$ ,

where $\varepsilon$ is a discrimination threshold determined a priori by the DM.242424Here we do not dwell on the procedure to assess the discrimination threshold $\varepsilon$ . In fact, the sole purpose of this section is to illustrate a simple variant of our approach. Given a rationalizable choice $r$ over $X$ , denote by $i_{r}$ the desirability class of its relation of revealed preference $\succ_{r}$ . Then, for any $c\in\textsf{choice}(X)$ , the irrationality index of $c$ induced by $w$ is

[TABLE]

Definition 4.5 can be motivated as follows. The property of monotonicity ensures that the weight of rational choices decreases as the level of transitivity of the corresponding revealed preference increases. As a consequence, if, for instance, a choice behavior is close to a highly desirable rational choice, then its degree of irrationality will be accordingly contracted. Furthermore, the average property guarantees that the average weight given to a rational choice behavior belongs to a close neighborhood of $1$ according to a threshold established by the DM.

In the simplest case, all weights are the same, and the discrimination threshold is equal to [math]. This implies that the weighting function $w$ assigns weight equal to $1$ to all asymmetric and acyclic binary relations over $X$ . However, even in this very special case, it may happen that ${\operatorname{irr}}_{\operatorname{rat}}(c)\neq{\operatorname{irr}}_{\operatorname{rat}}^{w}(c)$ for some choice $c$ . The reason is that the weighted variant of our approach only applies to decisive choice behaviors, and so the computation of the minimum distance from the benchmark of rationality may give different results.

We conclude this section with an example, which showcases how a weighting procedure of rational choices yields a fine tuning of the results obtained in Example 3.4.

Example 4.6.

Let $c_{1},c_{2},c_{3},c_{4}$ be the four choice correspondences over $X=\{x,y,z,w\}$ defined in Example 3.4 (and further analyzed in Example 4.3). On a four-element set, the phenomenology of $(m,n)$ -Ferrers properties is quite poor, that is, many equivalence classes of the partition are empty. In fact, it suffices to assign weights to the following classes: (1) weak orders, (6) semiorders, (7) interval orders and semitransitive relations, and (9) intransitive relations. For the sake of illustration, first set $w(i):=1+0.1(i-5)$ for $i=1,\ldots,9$ . Clearly, $w$ is a feasible weighting map for any $0\leqslant\varepsilon<1$ . A computer-aided computation yields the following $d_{\operatorname{rat}}$ -degrees of irrationality induced by $w$ :

[TABLE]

Now define $w^{\prime}\colon\{1,2,\ldots,9\}\to(0,2)$ by $w^{\prime}(i):=0.8$ for $1\leqslant i\leqslant 4$ , $w^{\prime}(5)=0.9$ , and $w^{\prime}(i):=1.1$ for $6\leqslant i\leqslant 9$ . Again, $w^{\prime}$ is a feasible weighting map for any $0.2\leqslant\varepsilon<1$ . Now we get the $d_{\operatorname{rat}}$ -degrees of irrationality induced by $w^{\prime}$ become

[TABLE]

A sensitivity analysis connected to the weighting procedure and the threshold of discrimination may provide further insight into the DM’s preference system.

5 Measures of stochastic irrationality

In this last section we suggest how to adapt our approach to a stochastic environment. The underlying idea is to transform the search for a measure of irrationality into the formulation of a geometric problem (concerning polytopes).

For simplicity, we shall only consider the case of stochastic choice functions,252525Our approach can be extended to stochastic choice correspondences, too. as defined below.

Definition 5.1.

A stochastic choice function over $X$ is a map $p\colon X\times 2^{X}\setminus\{\varnothing\}\to[0,1]$ such that for all $a\in X$ and $A\in 2^{X}\setminus\{\varnothing\}$ , the following conditions hold:

•

$\sum_{a\in A}\,p(a,A)=1$ ,

•

$a\notin A$ implies $p(a,A)=0$ .

We denote by $\textsf{choice}^{*}(X)$ the family of all stochastic choice functions over $X$ .

As in the deterministic case, the first step in determining the degree of irrationality of a stochastic behavior consists of fixing a benchmark of rationality.

In their interesting approach, Apesteguia and Ballester (2015) essentially consider the finite family of deterministic rationalizable choices (which are in a one-to-one correspondence with linear orders) as the benchmark of rationality. Roughly speaking, the authors associate to a suitable stochastic choice behavior —a collection of observations— what they call a swap index, computed by using probabilities to weigh swaps in linear orders.

Our selection of the benchmark is instead an infinite family of stochastic choices, namely those that satisfy the following well-known model of rational behavior:

Definition 5.2 (Block and Marschak, 1960).

A stochastic choice function $p$ over $X$ satisfies the random utility model (for brevity, it is a RUM function) if there is a probability distribution $Pr$ on the set $\textsf{LO}(X)$ of all linear orders over $X$ such that for each $A\in 2^{X}\setminus\{\varnothing\}$ and $a\in A$ ,

[TABLE]

Hereafter, any RUM function will be called rational; accordingly, we shall denote by $\textsf{choice}^{*}_{\operatorname{rat}}(X)$ the family of all RUM functions over $X$ .

The selection of RUM as a prototype of stochastic rationality is statistically robust: see, among several related contributions, Marley and Regenwetter (2017) for a review of random utility models, McCausland et al. (2019) for a direct Bayesian testing of RUM, and Davis-Stober (2009, Section 8) for an application to axiomatic measurement theory.

Now an attempt to fully adapt our deterministic approach to a stochastic setting poses salient challenges. In fact, we need an economically significant metric —or, alternatively, a function that satisfies weaker properties, such as a ‘divergence’— which enables us to discern different levels of irrationality for different types of stochastic choice behaviors. However, none of the metrics/divergences considered in the literature appears to be a good fit for our goal,262626Some examples in Subsection 5.1 illustrate how different types of stochastic choice behaviors are not adequately distinguished by some well-known distances/divergences. and it seems not simple to design new metrics that do the job.

In view of the difficulties illustrated above, here we choose a different path to evaluate the level of irrationality of a stochastic choice behavior. Specifically, we take advantage of a known characterization of the RUM model to attach a vector with $|X|$ -many components to each stochastic behavior: the higher the entries in the vector, the most irrational the choice behavior. Then, to compare irrationality levels, we use a permutation-invariant version of the classical Pareto ordering of these vectors, which arranges all irrational stochastic choices into a preordered set (ties and incomparability being allowed).

As a preliminary step, we recall the known characterization of the RUM model.

Definition 5.3 (Block and Marschak, 1960; Falmagne, 1978).

Let $p$ be a stochastic choice function over $X$ . For any $T\in 2^{X}\setminus\{\varnothing\}$ and $a\in T$ , define

[TABLE]

The $q_{a,T}$ ’s are the Block-Marshak polynomials (BM polynomials, for brevity)272727‘Polynomial’ is the usual term, although $q_{a,T}$ is a linear expression in the $p(a,U)$ ’s of $p$ .

Block and Marschak (1960) show that having $q_{a,T}\geq 0$ for suitable menus $T\subseteq X$ is a necessary condition for having a RUM function. However, the general definition of the BM polynomials and the complete characterization of the random utility model came almost twenty years later:282828See Fiorini (2004) for an elegant and very short proof of this result, which involves Möbius inversion and network flow.

Theorem 5.4 (Falmagne, 1978).

A stochastic choice function is RUM if and only if all its Block-Marshak polynomials are nonnegative.

Theorem 5.4 allows us to derive a measure of irrationality for stochastic choices.

Definition 5.5.

Let $p$ be a stochastic choice function over $X=\{x_{1},\ldots,x_{n}\}$ , where $n\geqslant 2$ . For each $x_{i}\in X$ , let

[TABLE]

The $n$ -tuple $v_{p}=\big{(}v_{p}(x_{1}),\ldots,v_{p}(x_{n})\big{)}\in{\mathbb{R}}^{n}_{+}$ is the negativity vector of $p$ .

Clearly, the larger the entries in the negativity vector, the more irrational the corresponding stochastic choice. By Definition 5.5 and Theorem 5.4, all RUM functions —and, in particular, all deterministic rationalizable choices— have $(0,\ldots,0)$ as negativity vector. For all non-RUM functions, the next definition establishes a way to compare their (strictly positive) irrationality.

Definition 5.6.

Denote by $\mathscr{S}(X)$ the family of all permutations of $X$ . Define a binary relation $\precsim^{*}$ over $\textsf{choice}^{*}(X)$ as follows:

[TABLE]

for any $p,p^{\prime}\in\textsf{choice}^{*}(X)$ . Then, we say that

•

$p$ and $p^{\prime}$ are equally irrational if $p\sim^{*}p^{\prime}$ (i.e., $p\precsim^{*}p^{\prime}$ and $p^{\prime}\precsim^{*}p$ ),

•

$p$ is less irrational than $p^{\prime}$ if $p\prec^{*}p^{\prime}$ (i.e., $p\precsim^{*}p^{\prime}$ and $\neg(p^{\prime}\precsim^{*}p)$ ), and

•

$p$ and $p^{\prime}$ are incomparably irrational if $p\perp^{*}p^{\prime}$ (i.e., $\neg(p\precsim^{*}p^{\prime})$ and $\neg(p^{\prime}\precsim^{*}p)$ ).

The pair $\left(\textsf{choice}^{*}(X),\precsim^{*}\right)$ is a preordered set,292929Recall that a preorder is a reflexive and transitive (but possibly incomplete) binary relation. having all RUM functions as a minimum.

The next example presents two stochastic choice functions over a set of size four. We shall compute all related BM polynomials and the two associated negativity vectors, to finally conclude that one function is more irrational than the other.

Example 5.7.

Set $X=\{x,y,z,w\}$ . Let $p_{1}$ be the stochastic choice function over $X$ defined in Table 1. For the sake of illustration, we explicitly compute the first two BM polynomials of $p_{1}$ associated to the item $x$ :

[TABLE]

By Definition 5.5, summing all entries in the last four columns of Table 1 yields the negativity vector of $p_{1}$ , which is $v_{p_{1}}=(0.6,0.2,0.2,0.1)$ .

A different stochastic choice function $p_{2}$ over $X$ is given in Table 2.

Note that $p_{2}$ provides a minimal counterexample to the fact that the property of monotonicity303030A stochastic choice function $p$ over $X$ is monotonic (or regular) if for all $x\in X$ and $A,B\in 2^{X}$ , $A\subseteq B$ implies $p_{2}(x,B)\leqslant p_{2}(x,A)$ : see Block and Marschak (1960). does not characterize the random utility model: in fact, $p_{2}$ is monotonic but not RUM.

Since the negativity vector of $p_{2}$ is $v_{p_{2}}=(0,0.1,0,0)$ , we get $v_{p_{2}}(a)\leqslant_{{\operatorname{Par}}}v_{p_{1}}(a)$ for all $a\in X$ , and so we conclude that $p_{2}\prec^{*}p_{1}$ .

As possibly expected, isomorphic stochastic choice functions —in the sense clarified below— are equally irrational.

Definition 5.8.

Two stochastic choice functions $p,p^{\prime}$ over $X$ are isomorphic is there is a permutation $\sigma\colon X\to X$ such that

[TABLE]

for all $x\in X$ and $A\in 2^{X}$ . The bijection $\sigma$ is called an isomorphism between $p$ and $p^{\prime}$ .

The next result shows that our measure of stochastic irrationality is independent of the names of alternatives.

Lemma 5.9.

For any stochastic choices $p,p^{\prime}$ over $X$ , if $\sigma$ is an isomorphism between $p$ and $p^{\prime}$ , then $v_{p}(x)=v_{p^{\prime}}(\sigma(x))$ for all $x\in X$ . Thus, isomorphic stochastic choice functions always have the same level of irrationality.

Proof. Observe that

[TABLE]

where the last equality is given by the fact that there is a one-to-one correspondences between the family of all menus $U$ containing $T$ and the family of all menus $U^{\prime}$ containing $\sigma(T)$ . We conclude that the BM polynomial $q_{x,T}$ of $p$ is equal to the BM polynomial $q_{\sigma(x),\sigma(T)}$ of $p^{\prime}$ . The claim follows. $\Box$

It is currently under study the implementation of a geometric approach (based on polytopes) to the measure of the irrationality of a stochastic choice behavior.

5.1 Some related literature

Here we review some existing metrics/divergences that apply to stochastic choices, and point out some possible drawbacks in detecting different levels of irrationality.

Definition 5.10.

Let $\delta\colon\textsf{choice}^{*}(X)\times\textsf{choice}^{*}(X)\to{\mathbb{R}}$ be the map defined by

[TABLE]

for all $p,p^{\prime}\in\textsf{choice}^{*}(X)$ . Then $\delta$ is a metric, called the total variation distance.313131This name originates from the process of considering all differences between two objects (stochastic functions, in this case) and taking either the sum or the supremum (the maximum, in this case).

The metric $\delta$ may not be a good fit for our purpose, as the next example shows.

Example 5.11.

Define a stochastic choice function $p$ over $X=\{x,y,z\}$ by $p(a,A):=\frac{1}{|A|}$ for all $A\in 2^{X}\setminus\{\varnothing\}$ and $a\in A$ . Clearly, $p$ is RUM function. Next, we define two additional stochastic choice functions $p_{1}$ and $p_{2}$ over $X$ as follows:

•

$p_{1}(a,A):=p(a,A)$ for all $A\in 2^{X}\setminus\{\varnothing,X\}$ and $a\in A$ ,

•

$p_{1}(x,X):=0.6$ , $p_{1}(x,X):=0.2$ , and $p_{1}(z,X):=0.2$ ;

•

$p_{2}(x,\{x,y\}):=0.7$ and $p_{2}(y,\{x,y\}):=0.3$ ,

•

$p_{2}(x,\{x,z\}):=0.3$ and $p_{2}(z,\{x,z\}):=0.7$ ,

•

$p_{2}(y,\{y,z\}):=0.5$ and $p_{2}(z,\{y,z\}):=0.5$ ,

•

$p_{2}(x,X):=0.6$ , $p_{2}(y,X):=0.3$ , and $p_{2}(z,X):=0.1$ .

It can be checked that $\delta(p,p_{1})=\delta(p,p_{2})=\frac{4}{15}$ , that is, the metric $\delta$ puts $p_{1}$ and $p_{2}$ at the same total variation distance from the rational function $p$ . However, $v_{p_{1}}=(0.2,0,0)$ and $v_{p_{2}}=(0.3,0,0.1)$ , and so $p_{1}\prec^{*}p_{2}$ by Definition 5.6.

Next, we consider a weaker type of distance, namely a ‘divergence’, which only satisfies the non-negativity property A0.1 of a metric, but not necessarily symmetry A0.2 and the triangle inequality A0.3.

Definition 5.12 (Kullback and Leibler, 1951).

Let $D_{\mathrm{KL}}\colon\textsf{choice}^{*}(X)\times\textsf{choice}^{*}(X)\to{\mathbb{R}}$ the function defined by

[TABLE]

for all $p,p^{\prime}\in\textsf{choice}^{*}(X)$ . The map $D$ is called the Kullback-Leibler divergence.

Example 5.13.

Let $X$ , $p$ , $p_{1}$ , and $p_{2}$ be exactly as in Example 5.11. Define $p_{3}$ as follows:

•

$p_{3}(x,\{x,y\})=0.7$ and $p_{3}(y,\{x,y\})=0.3$ ,

•

$p_{3}(x,\{x,z\})=0.3$ and $p_{3}(z,\{x,z\})=0.7$ ,

•

$p_{3}(y,\{y,z\})=0.7$ and $p_{3}(z,\{y,z\})=0.3$ ,

•

$p_{3}(x,X)=\frac{1}{3}$ , $p_{3}(y,X)=\frac{1}{3}$ , and $p_{3}(z,X)=\frac{1}{3}$ .

Note that the negativity vector of $p_{3}$ is $(0.03,0.03,0.03)$ . One can check that $D_{\mathrm{KL}}(p_{1}||p)<D_{\mathrm{KL}}(p_{3}||p)<D_{\mathrm{KL}}(p_{2}||p)$ . On the other hand, according to Definition 5.6, we have $p_{3}\perp^{*}p_{1}$ and $p_{3}\perp^{*}p_{2}$ .

Examples 5.11 and 5.13 show that both the total variation distance and the Kullback-Leibler divergence may fail to capture some features of irrationality. Although one may argue that both examples only deal with one rational function —possibly the most emblematic—, a similar pathology is still present when calculating distances from other rational functions. These issues suggest that Definition 5.6 may provide a more adequate tool in assigning levels of irrationality to stochastic choices.

Bibliography41

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1)
2Aizerman and Aleskerov (1995) Aizerman, M., and Aleskerov, F. , 1995. Theory of Choice. North Holland .
3Aleskerov and Monjardet (2002) Aleskerov, F., and Monjardet, B. , 2002. Utility Maximization, Choice and Preference. Springer, Berlin .
4Alabayrak and Aleskerov (2000) Albayrak, S. R., and Aleskerov, F. , 2000. Convexity of choice function sets. In: Bogazici University Research Paper ISS/EC-2000-01.
5Alcantud et al. (2022) Alcantud, J. C. R., Cantone, D., Giarlotta, A., and Watson, S. , 2022. Rationalization of indecisive choice behavior by majoritarian ballots. Ar Xiv: 2210.16885 [econ.TH].
6Ambrus and Rozen (2014) Ambrus, A., and Rozen, K. , 2014. Rationalising choice with multi-self models. Economic Journal 125(585): 1136–1156.
7Apesteguia and Ballester (2015) Apesteguia, J., and Ballester, M. A. , 2015. A measure of rationality and welfare. Journal of Political Economy 123: 1278–1310.
8Arrow (1950) Arrow, K. J. , 1950. A difficulty in the concept of social welfare. Journal of Political Economy 58: 328–346.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Rational measures of irrationality

Abstract

1 Introduction

2 Measures of deterministic irrationality

2.1 Preliminaries

2.2 Related literature

3 Metrics on quasi-choices

Definition 3.1**.**

3.1 Klamler’s metric

Definition 3.2** (Klamler, 2008).**

Example 3.3**.**

Example 3.4**.**

Theorem 3.5** (Carpentiere et al., 2023).**

Definition 3.6**.**

Definition 3.7**.**

Lemma 3.8** (Elementary Decomposability).**

3.2 A rational variation of Klamler’s metric

Theorem 3.9** (Sen, 1971).**

Definition 3.10**.**

Example 3.11**.**

Lemma 3.12**.**

Remark 3.13**.**

4 Distance-based degrees of irrationality

4.1 A simple degree of irrationality

Definition 4.1**.**

Example 4.2**.**

Example 4.3**.**

4.2 A weighted degree of irrationality

Definition 4.4** (Giarlotta and Watson, 2014).**

Definition 4.5**.**

Example 4.6**.**

5 Measures of stochastic irrationality

Definition 5.1**.**

Definition 5.2** (Block and Marschak, 1960).**

Definition 5.3** (Block and Marschak, 1960; Falmagne, 1978).**

Theorem 5.4** (Falmagne, 1978).**

Definition 5.5**.**

Definition 5.6**.**

Example 5.7**.**

Definition 5.8**.**

Lemma 5.9**.**

5.1 Some related literature

Definition 5.10**.**

Example 5.11**.**

Definition 5.12** (Kullback and Leibler, 1951).**

Example 5.13**.**

Definition 3.1.

Definition 3.2 (Klamler, 2008).

Example 3.3.

Example 3.4.

Theorem 3.5 (Carpentiere et al., 2023).

Definition 3.6.

Definition 3.7.

Lemma 3.8 (Elementary Decomposability).

Theorem 3.9 (Sen, 1971).

Definition 3.10.

Example 3.11.

Lemma 3.12.

Remark 3.13.

Definition 4.1.

Example 4.2.

Example 4.3.

Definition 4.4 (Giarlotta and Watson, 2014).

Definition 4.5.

Example 4.6.

Definition 5.1.

Definition 5.2 (Block and Marschak, 1960).

Definition 5.3 (Block and Marschak, 1960; Falmagne, 1978).

Theorem 5.4 (Falmagne, 1978).

Definition 5.5.

Definition 5.6.

Example 5.7.

Definition 5.8.

Lemma 5.9.

Definition 5.10.

Example 5.11.

Definition 5.12 (Kullback and Leibler, 1951).

Example 5.13.