A rational measure of irrationality
Davide Carpentiere, Alfio Giarlotta, Stephen Watson

TL;DR
This paper introduces a systematic way to classify and measure the degree of irrationality in deterministic choice behaviors by defining a metric-based distance from rational benchmarks, including stochastic models.
Contribution
It proposes a novel metric for quantifying deviations from rational choice and extends this to stochastic irrationality using the random utility model as a benchmark.
Findings
Classifies all deterministic choice behaviors by their irrationality degree.
Introduces a metric to measure deviations from rationality.
Defines a stochastic irrationality measure using Block-Marschak polynomials.
Abstract
All possible types of deterministic choice behavior are classified by their degree of irrationality. This classification is performed in three steps: (1) select a benchmark of rationality, for which this degree is zero; (2) endow the set of choices with a metric to measure deviations from rationality; and (3) compute the distance of any choice behavior from the selected benchmark. The natural candidate for step 1 is the family of all rationalizable behaviors. A possible candidate for step 2 is a suitable variation of the metric described by Klamler (2008), which displays a sharp discerning power among different types of choice behaviors. In step 3 we use this new metric to establish the minimum distance of any choice behavior from the benchmark of rationality. Finally we describe a measure of stochastic irrationality, which employs the random utility model as a benchmark of rationality,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Decision-Making and Behavioral Economics
Rational measures of irrationality
Davide Carpentiere, Alfio Giarlotta , Stephen Watson Department of Mathematics and Computer Science, University of Catania, Italy.Department of Economics and Business, University of Catania, Italy. [email protected] *(corresponding author)*Department of Mathematics and Statistics, York University, Toronto, Canada.
Abstract
All possible types of deterministic choice behavior are classified by their degree of irrationality. This classification is performed in three steps: (1) select a benchmark of rationality, for which this degree is zero; (2) endow the set of choices with a metric to measure deviations from rationality; and (3) compute the distance of any choice behavior from the selected benchmark. The natural candidate for step 1 is the family of all rationalizable behaviors. A possible candidate for step 2 is a suitable variation of the metric described by Klamler (2008), which displays a sharp discerning power among different types of choice behaviors. In step 3 we use this new metric to establish the minimum distance of any choice behavior from the benchmark of rationality. Finally we describe a measure of stochastic irrationality, which employs the random utility model as a benchmark of rationality, and the Block-Marschak polynomials to measure deviations from it.
Keywords: Metric space; choice; rationalization; revealed preference; transitivity; -Ferrers property; choice localization; random utility model; Block-Marschak polynomials.
JEL Classification: D01, D81, C44.
1 Introduction
The goal of this paper is to evaluate the irrationality level of all possible choice behaviors on a finite set of alternatives. We perform this task in three successive steps:
- (1)
establish a benchmark of rational choice behavior;
- (2)
endow the set of all choice behaviors with a highly discerning metric;
- (3)
compute the distance of any behavior from the benchmark of rationality.
The output of this process is a rational degree of irrationality of any deterministic choice behavior. (The use of the term ‘rational’ is motivated by the fact that we compute a distance from rationality in order to measure irrationality.) Before addressing in detail each step of this approach, let us discuss the general domain of our analysis.
Classically, the literature on choice theory is exclusively concentrated on ‘decisive’ choice behaviors, intended as situations in which the decision maker (DM) selects at least one item from any nonempty subset of the ground set: see, among a large amount of relevant contributions, the seminal papers by Samuelson (1938), Arrow (1959), and Sen (1971). In other words, the domain of analysis is classically restricted to choice correspondences, which are functions mapping nonempty sets into nonempty subsets. In addition, most of the recent models of ‘bounded rationality in choice’ typically deal with the even more restricted case of choice functions, which are single-valued choices correspondences (i.e., a unique item is selected from any nonempty menu): see, among several papers on the topic, Manzini and Mariotti (2007) and Masatlioglu et al. (2012).111See Giarlotta et al. (2022a) for a list of many models of bounded rationality in choice and a common analysis of their features by a unified approach.
Despite the great abundance of literature on choice functions and choice correspondences, it appears more realistic to consider the general case of quasi-choices, which model the behavior of possibly indecisive DMs: in this situation, the agent is allowed to select all, some, or none of the items available in any menu. To justify the potential interest in this approach, very recently Costa-Gomes et al. (2022) mention some compelling experiments, which suggest that choice models rejecting decisiveness may offer a powerful tool to study revealed preferences.222See also Chapter 1 of the advanced textbook on microeconomic theory by Kreps (2013), as well as the arguments presented in Section 1 of the recent paper by Alcantud et al. (2022). That is why in this paper we evaluate the rationality level of any type of choice behavior, may it be decisive or not.
Now we describe the three stages of our approach.
(1)
The first step consists of the selection of the benchmark of rationality —the ‘zero’— from which deviations ought to be measured. We select the most natural candidate, namely the family of all quasi-choices over the given set that are considered ‘rational’ according to revealed preference theory (Samuelson, 1938). Technically, these are the quasi-choices that can be explained by the maximization of a binary relation.333A more restrictive benchmark of rationality may be the family of quasi-choices rationalizable by binary relations satisfying some desirable properties.
(2)
The selection of a metric is the key step: this distance should accurately discerns among different types of choice behavior in an economically significant way. A possible candidate for this goal is the distance on quasi-choices proposed by Klamler (2008), which is computed by summing the cardinalities of all symmetric differences between pairs of choice sets. Due to its decomposability into trivial metrics, Klamler’s distance is however not well-suited for our goals, due to its low discernibility power. Using a notion of local rationalization, we design a refinement of this metric, which displays a sharp level of discrimination among different choices.
(3)
To finally establish the degree of irrationality for any deterministic choice behavior, we use the metric selected at step 2 to compute the minimum distance of a quasi-choice from a rationalizable one. In this way, all quasi-choices belonging to the benchmark of rationality have a degree of irrationality equal to zero, whereas all the others display a degree with a strictly positive value. Moreover, the more irrational a choice behavior is, the higher the value of the index becomes. We also describe a weighted version of this approach. Formally, since each rationalizable choice is explained by the maximization of a unique asymmetric preference —the strict revealed preference (Samuelson, 1938)—, we measure the subjective desirability of each rational behavior by the ‘level of transitivity’ of this binary relation:444Both Mas-Colell et al. (1995) and Kreps (2013) consider transitivity and completeness the basic tenets of economic rationality. the more this preference is close to being fully transitive,555By ‘fully transitive’ we mean that both strict preference and associated incomparability are transitive. the higher the desirability of the choice becomes. Once subjective desirability is encoded, we measure the degree of irrationality of any behavior by taking a weighted distance from rational behavior.
Finally, we suggest a probabilistic extension of our approach, which applies to stochastic choice functions. Recall that a stochastic choice function assigns a real number to each pair formed by a menu and an item in it, evaluating the likelihood of that item being selected from that menu. Choice functions are special stochastic choices in which this likelihood is one for exactly one item in a menu and zero for all the others.
The steps to measure the irrationality of a stochastic choice behavior are, however, different from the ones of the deterministic setting. Specifically, the first step is again the selection of a benchmark of rationality, for which we take the family of all stochastic choice functions satisfying the random utility model (RUM) (Block and Marschak, 1960). On the other hand, since the second and the third step of the deterministic approach are hardly adaptable, we employ a different procedure. In fact, we take advantage of the characterization established by Falmagne (1978), who shows that a stochastic choice function satisfies RUM if and only if all its Block-Marschak polynomials are non-negative. Therefore, any choice that fails to satisfy RUM must have at least one negative Block-Marschak polynomial. Upon summing up all these negative polynomials for each element in the ground set, we obtain a negativity vector, which provides a discerning measure of the irrationality of a stochastic choice behavior. The comparison of these vectors is then performed by a permutation-invariant Pareto ordering, which in turn yields a partial classification of all stochastic choices according to their degree of irrationality.
The paper is organized as follows. Section 2 collects preliminary notions and presents a review of the related literature on deterministic choices. In Section 3 we describe the metric introduced by Klamler, and then a highly discerning variation of it. In Section 4 we formally define two distance-based degrees of irrationality of a choice behavior, and show the soundness of the novel metric for this task. In Section 5 we suggest an extension of our approach to a stochastic environment.
2 Measures of deterministic irrationality
First we recall some preliminary notions in choice theory. Then we suggest several ways to measures the irrationality of a deterministic choice behavior, and present a quick review of recent literature on the topic.
2.1 Preliminaries
A finite set of alternatives is fixed throughout. We use to denote the family of all nonempty subsets of .
A quasi-choice correspondence over is a function such that for all . A choice correspondence over is a quasi-choice that is never empty-valued on nonempty sets, that is, a function such that for all .666To emphasize decisiveness, we shall use upper case letters (, , etc.) to denote possibly indecisive choice behaviors, that is, quasi-choice correspondences. On the other hand, lower case letters (, , etc.) will be employed to denote decisive choice behaviors, that is, choice correspondences. Sets in are menus, elements of a nonempty menu are items, and the set (or ) is the choice set of the menu . Unless confusion may arise, hereafter we speak of quasi-choices and choices, respectively. Moreover, (resp. ) denotes the family of all quasi-choices (resp. choices) over .
A binary relation over is a subset of , which is:
-
asymmetric if implies for all ;
-
irreflexive if holds for no ;
-
acyclic if holds for no ();
-
transitive if implies for all ;
-
negatively transitive if implies for all .
Note that (i) asymmetry implies irreflexivity, (ii) transitivity and asymmetry implies acyclicity, and (iii) asymmetry and negative transitivity implies transitivity. We will often refer to an asymmetric binary relation as a (strict) preference.
Choices and preferences are closely related to each other. In fact, since the seminal work of Samuelson (1938), the ‘rationality’ of a decisive choice behavior is classically modeled by the notion of ‘binary rationalizability’, that is, the possibility to explain it by maximizing a suitable binary relation. Formally, a choice is rationalizable if there is an asymmetric binary relation over such that for any nonempty menu , the equality
[TABLE]
holds. The binary relation is called the (strict) preference revealed by . Note that must also be acyclic in order to rationalize the choice . Moreover, the asymmetric relation of revealed preference is unique for any rationalizable choice.777Here, we purposely avoid mentioning the symmetric part of the relation of reveled preference, because it is irrelevant to detect the rationalizability of a choice.
2.2 Related literature
In view of our goal to distinguish choice behaviors by their consistency features, the notion of rationalizability is the most popular in the literature. This notion was first introduced for choice functions (that is, single-valued choice correspondences), and then extended to choice correspondences. However, rationalizability can be naturally generalized to quasi-choices, provided that the rationalizing preference is allowed not to be irreflexive, asymmetric, or acyclic. Formally, we call a quasi-choice rationalizable if there is an arbitrary binary relation over —here denoted by ‘’ to emphasize its arbitrariness— such that the equality
[TABLE]
holds for all menus . Here the key fact is the possible lack of properties of , which follows from the necessity to model indecisive choice behaviors. For instance, since asymmetry is not guaranteed, we may have for some distinct elements and , in which case is empty.888To justify such a situation, imagine a political ballot in which the two remaining candidates are extremists, and my moderate political view suggests me to abstain from voting. Similarly, the possible lack of irreflexivity of permits situations of the type , which in turn yields .999For instance, if a restaurant only offers a chocolate cake as dessert and I am allergic to chocolate, then I shall avoid taking dessert. Note also that, contrary to the case of choices, the rationalizable preference —which is called a voter by Alcantud et al. (2022)— need not be unique for the general case of quasi-choices.101010On this point, see Section 2 in Alcantud et al. (2022). Here the authors extensively dwell on the reasons motivating the more general use of quasi-choices instead of choices, and the use of arbitrary binary relations to justify choice behavior.
All in all, according to this classical paradigm, any (decisive or indecisive) choice behavior is regarded irrational if it fails to be rationalizable. This yields a simple dichotomy rational/irrational or, equivalently, rationalizable/non-rationalizable. However, this dichotomy not very satisfactory in practice, because rationalizability fails to explain the overwhelming majority of observed choice behaviors.111111For a precise computation of the fraction of rationalizable choices over a set of fixed size, see Giarlotta et al. (2022a, Lemma 6)
Recently, following the inspiring analysis of Simon (1955), the notion of rationalizability has been amended by several forms of bounded rationality, which aim to explain a larger portion of choice behaviors by means of more flexible paradigms. To wit this trend, there are tens of models of bounded rationality in choice that have been proposed in the last twenty years: see Giarlotta et al. (2022a) for a vast account of them. The dichotomy boundedly rational/boundedly irrational is certainly more satisfactory than the rational/irrational one, allowing one to identify choice behaviors that obey some more relaxed (but still justifiable) constraints.121212The fraction of boundedly rational choice functions is definitively larger than that of rationalizable choices: compare Lemma 6 with Theorem 3 in Giarlotta et al. (2022a). However, this bounded rationality approach does not apply to most choice behaviors: in fact, it has essentially been proposed exclusively for choice functions, with very few cases of choice correspondences, moreover leaving completely out the case of quasi-choices.
A conceptually different modelization of rationality does not distinguish between (bounded) rationality and (bounded) irrationality. Rather, it creates a partition of the family of choices in several classes, each of which is assigned a degree of rationality. A seminal approach in this direction is the rationalization by multiple rationales (RMR) of Kalai et al. (2002). The RMR model yields a partition of the family of all choice functions over a set with items into equivalence classes of rationality, which are determined by the minimum number of linear orders that are necessary to explain decisive choice behavior: the larger this number, the less rational the behavior.131313Very recently, a structured version of the RMR model, called choice by salience, has been proposed by Giarlotta, et al. (2022b). Rationalizable choice functions obviously belong to the first class of rationality, since a unique linear order suffices. On the other side of the scale of rationality, we find those choice functions that require the maximum number of rationales (namely ) to be justified. Despite its conceptually appealing motivation, the RMR model displays some drawbacks: (i) the family of rationalizing linear orders only provides a ‘non-structured’ explanation of choice behavior; (ii) the class of maximally irrational choices (i.e., the ones requiring rationales) essentially collects all choices, even for very small sets of alternatives; and (iii) this model only applies to choice functions (but it could be naturally extended to choice correspondences).141414The choice model based on salience (Giarlotta, et al., 2022b) creates a partition into classes of rationality, and positively addresses the first two issues of the RMR approach. Specifically, concerning (1), a binary relation of salience restricts the application of rationales to those indexed by the maximally salient items of a menu. Concerning (2), the smallest choice function in the last class of rationality that the authors are able to exhibit is defined on a set of 39 elements.
Another approach devoted to identify the degree of irrationality of a deterministic choice function is due to Ambrus and Rozen (2014). As for the RMR model, also this approach is based on a counting technique. Specifically, the authors use a classical property of choice consistency —namely Independence of Irrelevant Alternatives (Arrow, 1950), which is equivalent to Axiom (Chernoff, 1954) for choice functions— to establish the degree of irrationality of a choice. They count the number of violations of Axiom that a choice behavior exhibits: the larger this number, the less rational the behavior. In particular, they introduce a notion of violations of Axiom , and accordingly define the index of irrationality of a choice by counting all menus that violate Axiom . The abstract idea of their approach is appealing: it accounts to measure irrationality by counting deviations from rationality according to an axiomatic parameter (Axiom ).
As we shall see, the approach developed in this paper measures the irrationality of choice behaviors in a way inspired by Ambrus and Rozen (2014). In fact, similarly to them, we analyze deviations from rationality according to axiomatic parameters, namely Axioms and (Sen, 1971), which are equivalent to the rationalizability of a quasi-choice. However, contrary to Ambrus and Rozen (2014), we do not directly count violations of properties of choice consistency. Instead, we use an indirect approach: first we establish a theoretical way to measure violations, that is, a metric, and only then we count deviations from rationality using this metric. Of course, the soundness of such a procedure boils down to the selection of a metric that is both economically significant and highly discerning. The next three sections will extensively address this issue.
3 Metrics on quasi-choices
This section is devoted to present ways to endow the family of all possible choice behaviors with metrics. Specifically, first we recall a metric due to Klamler (2008), and then describe a variation of it, which showcases a rather sharp discernibility power. In Section 4 we shall employ this novel metric as the measuring stick to evaluate deterministic deviations from rational behavior. To start, we recall the notion of distance between quasi-choices.
Definition 3.1**.**
A metric on is a map such that for all , the following properties hold:
- [A0.1]
, and equality holds if and only if ;
- [A0.2]
;
- [A0.3]
.
Property A0.1 is non-negativity, property A0.2 is symmetry, and property A0.3 is the triangle inequality.
3.1 Klamler’s metric
The symmetric difference of sets (Kemeny, 1959) induces a metric on quasi-choices:
Definition 3.2** (Klamler, 2008).**
Let be the function defined as follows for all :
[TABLE]
By Definition 3.2, the distance between two quasi-choices over the same set of alternatives is obtained by a simple and intuitive procedure: first count the number of items in a menu that are in one choice set but not in the other one, and then take the sum of these numbers over all menus. Usually, being simple and intuitive is regarded as a good feature of a notion. Unfortunately, here this fact translates into an oversimplified evaluation of the distance between two behaviors, which totally neglects their structural features. Specifically, by only looking at the ‘size’ of the disagreement of two quasi-choices over menus, Definition 3.2 fails to consider the ‘semantics’ of this disagreement, which lies in the very nature of the items selected by exactly one of them. This in turn produces some important shortcomings of this metric in the process of detecting deviations from rational behavior. The next two examples provide striking instances of this kind.
Example 3.3**.**
Consider the following three choice functions on (the unique item selected from each menu is underlined):
[TABLE]
The choices are equal on pairs of items but differ on the full menu . On pairs, the selection process is reproduced by maximizing the linear order . However, is rationalizable by , whereas and are not. Intuition suggests that should be further than from the rational choice . For instance, if we use the linear order to rationalize pairs of items, then selects the second-best item from , whereas ends up selecting the worst item of the three.151515Of course, one may always consider different scenarios, in which is regarded more rational than . However, these scenarios appear to be less likely to happen. On the other hand, the metric does regard and as equally distant from , because we have
[TABLE]
Example 3.4**.**
Let be four choice correspondences over , which are defined exactly in the same way for all menus distinct from and , namely
[TABLE]
However, select different elements from the two menus and , namely
,
,
,
.
Note that is rationalizable by the relation over such that , , , and . On the other hand, the three choices fail to be rationalizable, but they have exactly the same distance from the rationalizable choice :
[TABLE]
However, similarly to Example 3.3, it is reasonable to assume that is ‘semantically’ closer to than is: in fact, selects from the menus and some items that are better ranked (by ) than those selected by . There are also solid arguments to validate the opinion that should be the farthest choice from .
The low discernibility power of is due to (some of) the properties it satisfies. Carpentiere et al. (2023) —slightly correcting the findings of Klamler (2008)— prove that the following properties characterize (a universal quantification is implicit):
- [A1]
if and only if is between and ;161616The notion of ‘betweenness’ is due to Alabayrak and Aleskerov (2000): is between and if holds for any .
- [A2]
if and result from, respectively, and by the same permutation of alternatives, then ;
- [A3]
if and agree on all (nonempty) menus in except for a subfamily , then the distance is determined exclusively from the choice sets over ;
- [A4*′*]
if only disagree on a menu such that and for some , then ;
- [A5’]
for all and , there is with the property that , for all , and .
Axioms A1, A2, A3, A4’, and A5’ are rather intuitive requirements for a metric on the family of quasi-choices. In fact, Axiom A1 strengthens the triangle inequality A0.3 by requiring that equality holds exactly for cases of betweenness. Axiom A2 states a condition of invariance under permutations. Axiom A3 is a separability property, whereas Axiom A4’ is a condition of translation-invariance. The first four properties produce a unique metric, up to some multiplicative coefficients that only depend on the size of the menu: Axiom A5’ forces these coefficients to be unique.
As announced, we have:
Theorem 3.5** (Carpentiere et al., 2023).**
The unique metric on that satisfies Axioms A1, A2, A3, A4’, A5’ is .
Unfortunately, the discernibility power of among different choice behaviors is rather low, which is essentially due to the satisfaction of Axioms A1 and A3. To illustrate this fact, below we summarize some of the findings in Carpentiere et al. (2023).
Definition 3.6**.**
A quasi-choice on is elementary if there is at most one menu such that . For any such that , we denote by the elementary quasi-choice over defined as follows:
[TABLE]
Definition 3.7**.**
Let be a metric on , and . Denote by the family of all nonempty subsets of . Define a metric by
[TABLE]
for all . We call the characteristic metric induced by on .
Any metric on satisfying A1 and A3 —hence, in particular, — is a sum of characteristic metrics:
Lemma 3.8** (Elementary Decomposability).**
Let be a metric on satisfying Axioms A1 and A3. For all ,
[TABLE]
The metric defined in the next section satisfies neither A1 nor A3.
3.2 A rational variation of Klamler’s metric
We design a novel metric by suitably modifying Klamler’s distance. This variation is inspired by Ambrus and Rozen (2014), because we employ two axioms of choice consistency —in place of one— to guide its construction:
Axiom :
for all and , if and , then ;
Axiom :
for all and , if and , then .
Axiom is due to Chernoff (1954). In words, if an item is selected from a menu, then it is also chosen from any smaller menu containing it. This property is often referred to as Standard Contraction Consistency. Its role in abstract theories of individual and social choice is central. Nehring (1997, p. 407) even calls Axiom “the mother of all choice consistency conditions’’. Axiom , often referred to as Standard Expansion Consistency, is due to Sen (1971). It says that if an item is selected from two menus, then it is also chosen from the larger menu obtained as their union.
The connection between these two properties and rational behavior is well-known:
Theorem 3.9** (Sen, 1971).**
A choice correspondence is rationalizable if and only if both and hold.171717This characterization readily extends to quasi-choices: see Aizerman and Aleskerov (1995, Theorem 2.5). For a proof of this generalization, see Aleskerov and Monjardet (2002, Theorem 2.8).
We now proceed to define a suitable refinement of , which takes into account all ‘locally rational approximations’ of the original quasi-choice. Specifically, we consider all restrictions of the given correspondence to all subsets of any given menu, and modify them in order to obtain quasi-choices that locally satisfy Axioms and . Finally, we sum up all differences of these rational modifications.
Definition 3.10**.**
Let be a quasi-choice over , and a nonempty menu. Define a quasi-choice over , where is the family of comprising all nonempty subsets of , as follows for each :
[TABLE]
We call the rational localization of at . Then, for all , the rational distance between and is defined by
[TABLE]
where denotes the restriction .
Definition 3.10 employs -many restrictions of the given metric to compare standard modifications of two given quasi-choices: these modifications are a sort of rational closures of a given choice on a given menu. Note that the terminology of ‘local rationalization’ used for is motivated by the fact that any element is never responsible for a violation of Axioms or by .181818More formally, considering Axiom , this means that if , , and , then . Similarly, for Axiom , if , , and , then .
Example 3.11**.**
We illustrate how Definition 3.10 works in a very simple case. Consider the two choice functions and over defined by191919The choice function has already been considered in Example 3.3.
[TABLE]
Note that and are equal except on the menu , and is rationalizable by the linear order . To determine , we preliminary compute their rational localizations and at any nonempty subset of having size at least two:
(Note that all rational localizations at singletons are trivial choice functions in this particular case.) Since for each , whereas and , we conclude
[TABLE]
As possibly expected, Definition 3.10 is sound:
Lemma 3.12**.**
The function is a metric on .
Proof. For A0.1, clearly is nonnegative. If , then for all . It follows that for all , and so . This proves A0.1. Axiom A0.2 is obvious. For A0.3, observe that , because is the restriction of a metric. Thus the claim follows from summing over all .
The next remark shows that, despite being derived from , the rational metric does not satisfy several properties considered by Klamler; in particular, neither of the two axioms responsible for elementary decomposability —namely A1 and A3— hold for .
Remark 3.13**.**
We prove that satisfies neither A1 nor A3 nor A4’. All counterexamples will be quasi-choices over the set . Since in all cases the choice set of any singleton is nonempty, we only define them on menus having size two or three.
To prove the failure of A1, define by
;
;
.
Clearly, is between and . However, is different from .
For the failure of A3, define by
;
;
;
.
Let , and observe that , , , and . However, whereas .
For the failure of A4’, define by
;
;
;
.
The four quasi-choices over agree on every menu, except on . For , we have and , and yet .
It would be interesting to axiomatically characterize the rational metric : we leave this as an open problem.
4 Distance-based degrees of irrationality
We finally give a formal definition of the measure of irrationality of a deterministic choice behavior with respect to a given metric, where the family of rationalizable quasi-choices acts as the benchmark of rationality. We provide two versions of it: (1) simple, and (2) weighted. The first applies to all quasi-.choices, whereas the second is only designed for choice correspondences.
4.1 A simple degree of irrationality
Definition 4.1**.**
Let be a metric. Denote by the subfamily of comprising all quasi-choices that are rationalizable. For any quasi-choice over , the -degree of irrationality of is the integer defined by
[TABLE]
(This degree is well-defined, because is finite.)
Given a metric on , the larger the -degree of irrationality of a quasi-choice is, the more irrational is considered from the point of view of . Note that if a quasi-choice is rationalizable, then its -degree of irrationality is zero for any metric . For instance, the choice function defined in Example 3.3 has a -degree of irrationality equal to zero, whereas and have a -degree of irrationality equal to two.
As already pointed out, the soundness of Definition 4.1 depends on the economic significance and the discernibility power of the metric used to determine the degree of irrationality. In this respect, the rational metric appears to be better suited than Klamler’s distance . The next two examples witness this claim.
Example 4.2**.**
Consider the three choice functions defined in Example 3.3. It is easy to show that
[TABLE]
On the other hand, below we show that
[TABLE]
** :**
This is obvious, because is rationalizable.
** :**
As noted in Example 3.11, the choice function defined by
[TABLE]
is rationalizable by the linear order , with . We know that . Therefore, to prove the claim, we show that for all .
Hereafter we shall employ a simplified notation, which is also used – mutatis mutandis – in the proof of the equality . Specifically, for all , we denote by the less cumbersome . Moreover, we drop brackets and set separators whenever clear from context, using instead of , instead of , etc.
Now fix . Then, either (1) , or (2) .
(1)
If , then we separately consider two cases.
(1A)
If , then . Since , we obtain , hence .
(1B)
If , then we split the analysis in two subcases.
(1B1)
If , then , while . It follows that . Note that and imply . We conclude .
(1B2)
If , then by Axiom , and so . Since , we get . As before, and imply . We conclude .
(2)
If , then by Axiom . Thus and . We conclude , hence .
** :**
Let be the choice rationalizable by the relation on defined by and , that is,
[TABLE]
(Note that is not transitive, because and are incomparable.) It is easy to check that and , whereas all other rational localizations of and coincide. It follows that .
To complete the proof, we show that for any .
(1)
If , then either (1A) , or (1B) , using Axiom .
(1A)
If , then we split the analysis into two subcases.
(1A1)
If , then , , and . Therefore, from and , we derive . We conclude .
(1A2)
If , then by Axiom , or , and so or . Since , we obtain or . Finally, since and , we get , and so .
(1B)
Suppose . We can assume that , otherwise we would be done by case (1A). It follows that while , hence . Since and , we conclude , and so .
(2)
If , then and by Axiom .
(2A)
If , then , , , and so .
(2B)
If , we consider two subcases.
(2B1)
If , then by Axiom , hence and . We conclude .
(2B2)
If , then and by Axiom , hence and . Again, we conclude .
Example 4.3**.**
Let be the four choice correspondences over defined in Example 3.4. One can easily show that these three choices have the same -degree of irrationality, being
[TABLE]
On the contrary, the metric agrees with the perception that is less irrational than , and is the most irrational of all, being
[TABLE]
The related computations are extremely long and tedious, so we omit them.202020However, they are available upon request.
4.2 A weighted degree of irrationality
The evaluation of the degree of irrationality of choice behavior described above can be refined, as long as the DM is able to provide additional pieces of information. Here we illustrate a possible refinement of it, which applies to the family of choice correspondences; in other words, we consider the special case of a decisive DM.
In a preliminary step, the DM is required to provide additional information about the ‘subjective desirability’ of all rational choice behaviors. Operationally, this is obtained by assigning weights to each rationalizable choice correspondence. According to intuition, very desirable rational behaviors should be given a weight less or equal than one, because this may produce the effect of contracting the rational distance of all choices close to them. On the contrary, less appealing rational behaviors should given a weight greater or equal than one, in order to possibly dilate the distance from rationalization. Once desirability is assessed, the irrational degree of a decisive choice behavior is then computed as the minimum weighted distance from the benchmark of rationality.
In the process of designing the weighting procedure, we adhere to some natural rules of conduct. We select the transitivity of the relation of revealed preference as our guiding parameter: the more transitive this relation is, the more desirable the associated behavior becomes, and the lower the corresponding weight must be. From this point of view, the most desirable choices will be those rationalized by weak orders (asymmetric and negatively transitive, hence transitive), which will be assigned the lowest weight among all rational behaviors. Less desirable levels are those of choices rationalized by semiorders (asymmetric, Ferrers, and semitransitive) (Luce, 1956), and by interval orders (asymmetric and Ferrers) (Fishburn, 1970, 1985). At an even lower desirability level lie all choices rationalized by transitive asymmetric relations that fail to be interval orders. At the bottom of the scale, we find those choices that are rationalized by asymmetric, acyclic and intransitive binary relations, which will be given the highest weight of all.
An even finer tuning of the weighting procedure can be achieved by employing the so-called strict and weak -Ferrers properties (Giarlotta and Watson, 2014, 2018), which provide a classification of all asymmetric and acyclic binary relations on a set according to their discrete level of transitivity.212121On the point, see also Cantone et al. (2016) for a classification of all rationalizable choices on the basis of the so-called axioms of -replacement consistency. The next definition provides a simplified version of these properties, which is however sufficient for our goal.
Definition 4.4** (Giarlotta and Watson, 2014).**
Let be an asymmetric and acyclic binary relation over . Denote by the canonical completion of , obtained by adding all -incomparable pairs to .222222Two (not necessarily distinct) elements are -incomparable if nether nor holds. Technically, the canonical completion is the extension of in which incomparability is transformed into indifference. In particular, the canonical completion of is both reflexive (i.e., for all ) and complete (i.e., or for all distinct ). For any integers , we say that is -Ferrers if the joint satisfaction of and implies either or , for all (not necessarily distinct) .
It is easy to show that -Ferrers implies -Ferrers for any and (Giarlotta and Watson, 2014, Lemma 2.6). Furthermore, -Ferrers implies -Ferrers for any (Giarlotta and Watson, 2014, Theorem 3.1(v)). Note also that -Ferrers relations are weak orders, - and -Ferrers relations are semiorders, -Ferrers relations are interval orders, -Ferrers relations are transitive, and -Ferrers are acyclic but intransitive.
Consequently, all asymmetric and acyclic binary relation on a given set of alternatives can be partitioned according to a lattice structure, which is induced by the satisfaction of -Ferrers properties. This lattice is composed of 14 pairwise disjoint sets, which in turn can be arranged into 9 desirability classes according to their discrete degree of transitivity: see Figure 1.232323This figure is a simple elaboration of Figure 6 in Giarlotta (2019). See also Giarlotta (2014), where the typical form of strong semiorders and strong interval orders is displayed in Figure 5. For instance, the most desirable class is that of weak orders, whereas the least desirable class comprises all intransitive preferences. We can finally define a weighted variation of the degree of irrationality.
Definition 4.5**.**
A feasible weighting map is a function , which assigns a positive weight to each desirability class in a way that
(monotonicity)
implies for all , and
(average property)
for some ,
where is a discrimination threshold determined a priori by the DM.242424Here we do not dwell on the procedure to assess the discrimination threshold . In fact, the sole purpose of this section is to illustrate a simple variant of our approach. Given a rationalizable choice over , denote by the desirability class of its relation of revealed preference . Then, for any , the irrationality index of induced by is
[TABLE]
Definition 4.5 can be motivated as follows. The property of monotonicity ensures that the weight of rational choices decreases as the level of transitivity of the corresponding revealed preference increases. As a consequence, if, for instance, a choice behavior is close to a highly desirable rational choice, then its degree of irrationality will be accordingly contracted. Furthermore, the average property guarantees that the average weight given to a rational choice behavior belongs to a close neighborhood of according to a threshold established by the DM.
In the simplest case, all weights are the same, and the discrimination threshold is equal to [math]. This implies that the weighting function assigns weight equal to to all asymmetric and acyclic binary relations over . However, even in this very special case, it may happen that for some choice . The reason is that the weighted variant of our approach only applies to decisive choice behaviors, and so the computation of the minimum distance from the benchmark of rationality may give different results.
We conclude this section with an example, which showcases how a weighting procedure of rational choices yields a fine tuning of the results obtained in Example 3.4.
Example 4.6**.**
Let be the four choice correspondences over defined in Example 3.4 (and further analyzed in Example 4.3). On a four-element set, the phenomenology of -Ferrers properties is quite poor, that is, many equivalence classes of the partition are empty. In fact, it suffices to assign weights to the following classes: (1) weak orders, (6) semiorders, (7) interval orders and semitransitive relations, and (9) intransitive relations. For the sake of illustration, first set for . Clearly, is a feasible weighting map for any . A computer-aided computation yields the following -degrees of irrationality induced by :
[TABLE]
Now define by for , , and for . Again, is a feasible weighting map for any . Now we get the -degrees of irrationality induced by become
[TABLE]
A sensitivity analysis connected to the weighting procedure and the threshold of discrimination may provide further insight into the DM’s preference system.
5 Measures of stochastic irrationality
In this last section we suggest how to adapt our approach to a stochastic environment. The underlying idea is to transform the search for a measure of irrationality into the formulation of a geometric problem (concerning polytopes).
For simplicity, we shall only consider the case of stochastic choice functions,252525Our approach can be extended to stochastic choice correspondences, too. as defined below.
Definition 5.1**.**
A stochastic choice function over is a map such that for all and , the following conditions hold:
- •
,
- •
implies .
We denote by the family of all stochastic choice functions over .
As in the deterministic case, the first step in determining the degree of irrationality of a stochastic behavior consists of fixing a benchmark of rationality.
In their interesting approach, Apesteguia and Ballester (2015) essentially consider the finite family of deterministic rationalizable choices (which are in a one-to-one correspondence with linear orders) as the benchmark of rationality. Roughly speaking, the authors associate to a suitable stochastic choice behavior —a collection of observations— what they call a swap index, computed by using probabilities to weigh swaps in linear orders.
Our selection of the benchmark is instead an infinite family of stochastic choices, namely those that satisfy the following well-known model of rational behavior:
Definition 5.2** (Block and Marschak, 1960).**
A stochastic choice function over satisfies the random utility model (for brevity, it is a RUM function) if there is a probability distribution on the set of all linear orders over such that for each and ,
[TABLE]
Hereafter, any RUM function will be called rational; accordingly, we shall denote by the family of all RUM functions over .
The selection of RUM as a prototype of stochastic rationality is statistically robust: see, among several related contributions, Marley and Regenwetter (2017) for a review of random utility models, McCausland et al. (2019) for a direct Bayesian testing of RUM, and Davis-Stober (2009, Section 8) for an application to axiomatic measurement theory.
Now an attempt to fully adapt our deterministic approach to a stochastic setting poses salient challenges. In fact, we need an economically significant metric —or, alternatively, a function that satisfies weaker properties, such as a ‘divergence’— which enables us to discern different levels of irrationality for different types of stochastic choice behaviors. However, none of the metrics/divergences considered in the literature appears to be a good fit for our goal,262626Some examples in Subsection 5.1 illustrate how different types of stochastic choice behaviors are not adequately distinguished by some well-known distances/divergences. and it seems not simple to design new metrics that do the job.
In view of the difficulties illustrated above, here we choose a different path to evaluate the level of irrationality of a stochastic choice behavior. Specifically, we take advantage of a known characterization of the RUM model to attach a vector with -many components to each stochastic behavior: the higher the entries in the vector, the most irrational the choice behavior. Then, to compare irrationality levels, we use a permutation-invariant version of the classical Pareto ordering of these vectors, which arranges all irrational stochastic choices into a preordered set (ties and incomparability being allowed).
As a preliminary step, we recall the known characterization of the RUM model.
Definition 5.3** (Block and Marschak, 1960; Falmagne, 1978).**
Let be a stochastic choice function over . For any and , define
[TABLE]
The ’s are the Block-Marshak polynomials (BM polynomials, for brevity)272727‘Polynomial’ is the usual term, although is a linear expression in the ’s of .
Block and Marschak (1960) show that having for suitable menus is a necessary condition for having a RUM function. However, the general definition of the BM polynomials and the complete characterization of the random utility model came almost twenty years later:282828See Fiorini (2004) for an elegant and very short proof of this result, which involves Möbius inversion and network flow.
Theorem 5.4** (Falmagne, 1978).**
A stochastic choice function is RUM if and only if all its Block-Marshak polynomials are nonnegative.
Theorem 5.4 allows us to derive a measure of irrationality for stochastic choices.
Definition 5.5**.**
Let be a stochastic choice function over , where . For each , let
[TABLE]
The -tuple v_{p}=\big{(}v_{p}(x_{1}),\ldots,v_{p}(x_{n})\big{)}\in{\mathbb{R}}^{n}_{+} is the negativity vector of .
Clearly, the larger the entries in the negativity vector, the more irrational the corresponding stochastic choice. By Definition 5.5 and Theorem 5.4, all RUM functions —and, in particular, all deterministic rationalizable choices— have as negativity vector. For all non-RUM functions, the next definition establishes a way to compare their (strictly positive) irrationality.
Definition 5.6**.**
Denote by the family of all permutations of . Define a binary relation over as follows:
[TABLE]
for any . Then, we say that
- •
and are equally irrational if (i.e., and ),
- •
is less irrational than if (i.e., and ), and
- •
and are incomparably irrational if (i.e., and ).
The pair is a preordered set,292929Recall that a preorder is a reflexive and transitive (but possibly incomplete) binary relation. having all RUM functions as a minimum.
The next example presents two stochastic choice functions over a set of size four. We shall compute all related BM polynomials and the two associated negativity vectors, to finally conclude that one function is more irrational than the other.
Example 5.7**.**
Set . Let be the stochastic choice function over defined in Table 1. For the sake of illustration, we explicitly compute the first two BM polynomials of associated to the item :
[TABLE]
By Definition 5.5, summing all entries in the last four columns of Table 1 yields the negativity vector of , which is .
A different stochastic choice function over is given in Table 2.
Note that provides a minimal counterexample to the fact that the property of monotonicity303030A stochastic choice function over is monotonic (or regular) if for all and , implies : see Block and Marschak (1960). does not characterize the random utility model: in fact, is monotonic but not RUM.
Since the negativity vector of is , we get for all , and so we conclude that .
As possibly expected, isomorphic stochastic choice functions —in the sense clarified below— are equally irrational.
Definition 5.8**.**
Two stochastic choice functions over are isomorphic is there is a permutation such that
[TABLE]
for all and . The bijection is called an isomorphism between and .
The next result shows that our measure of stochastic irrationality is independent of the names of alternatives.
Lemma 5.9**.**
For any stochastic choices over , if is an isomorphism between and , then for all . Thus, isomorphic stochastic choice functions always have the same level of irrationality.
Proof. Observe that
[TABLE]
where the last equality is given by the fact that there is a one-to-one correspondences between the family of all menus containing and the family of all menus containing . We conclude that the BM polynomial of is equal to the BM polynomial of . The claim follows.
It is currently under study the implementation of a geometric approach (based on polytopes) to the measure of the irrationality of a stochastic choice behavior.
5.1 Some related literature
Here we review some existing metrics/divergences that apply to stochastic choices, and point out some possible drawbacks in detecting different levels of irrationality.
Definition 5.10**.**
Let be the map defined by
[TABLE]
for all . Then is a metric, called the total variation distance.313131This name originates from the process of considering all differences between two objects (stochastic functions, in this case) and taking either the sum or the supremum (the maximum, in this case).
The metric may not be a good fit for our purpose, as the next example shows.
Example 5.11**.**
Define a stochastic choice function over by for all and . Clearly, is RUM function. Next, we define two additional stochastic choice functions and over as follows:
- •
for all and ,
- •
, , and ;
- •
and ,
- •
and ,
- •
and ,
- •
, , and .
It can be checked that , that is, the metric puts and at the same total variation distance from the rational function . However, and , and so by Definition 5.6.
Next, we consider a weaker type of distance, namely a ‘divergence’, which only satisfies the non-negativity property A0.1 of a metric, but not necessarily symmetry A0.2 and the triangle inequality A0.3.
Definition 5.12** (Kullback and Leibler, 1951).**
Let the function defined by
[TABLE]
for all . The map is called the Kullback-Leibler divergence.
Example 5.13**.**
Let , , , and be exactly as in Example 5.11. Define as follows:
- •
and ,
- •
and ,
- •
and ,
- •
, , and .
Note that the negativity vector of is . One can check that . On the other hand, according to Definition 5.6, we have and .
Examples 5.11 and 5.13 show that both the total variation distance and the Kullback-Leibler divergence may fail to capture some features of irrationality. Although one may argue that both examples only deal with one rational function —possibly the most emblematic—, a similar pathology is still present when calculating distances from other rational functions. These issues suggest that Definition 5.6 may provide a more adequate tool in assigning levels of irrationality to stochastic choices.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 2Aizerman and Aleskerov (1995) Aizerman, M., and Aleskerov, F. , 1995. Theory of Choice. North Holland .
- 3Aleskerov and Monjardet (2002) Aleskerov, F., and Monjardet, B. , 2002. Utility Maximization, Choice and Preference. Springer, Berlin .
- 4Alabayrak and Aleskerov (2000) Albayrak, S. R., and Aleskerov, F. , 2000. Convexity of choice function sets. In: Bogazici University Research Paper ISS/EC-2000-01.
- 5Alcantud et al. (2022) Alcantud, J. C. R., Cantone, D., Giarlotta, A., and Watson, S. , 2022. Rationalization of indecisive choice behavior by majoritarian ballots. Ar Xiv: 2210.16885 [econ.TH].
- 6Ambrus and Rozen (2014) Ambrus, A., and Rozen, K. , 2014. Rationalising choice with multi-self models. Economic Journal 125(585): 1136–1156.
- 7Apesteguia and Ballester (2015) Apesteguia, J., and Ballester, M. A. , 2015. A measure of rationality and welfare. Journal of Political Economy 123: 1278–1310.
- 8Arrow (1950) Arrow, K. J. , 1950. A difficulty in the concept of social welfare. Journal of Political Economy 58: 328–346.
