Protecting the Protected Group: Circumventing Harmful Fairness
Omer Ben-Porat, Fedor Sandomirskiy, Moshe Tennenholtz

TL;DR
This paper introduces Welfare-Equalizing fairness constraints to address discrimination in ML, showing how they can improve protected group welfare and providing algorithms for optimal classifiers in self-interested scenarios.
Contribution
It proposes a unified Welfare-Equalizing fairness framework, generalizing existing notions like Demographic Parity and Equal Opportunity, and analyzes conditions for aiding disadvantaged groups.
Findings
Welfare-Equalizing constraints can improve protected group welfare.
The paper characterizes optimal classifiers under these fairness constraints.
Algorithms for computing optimal classifiers are provided.
Abstract
Machine Learning (ML) algorithms shape our lives. Banks use them to determine if we are good borrowers; IT companies delegate them recruitment decisions; police apply ML for crime-prediction, and judges base their verdicts on ML. However, real-world examples show that such automated decisions tend to discriminate against protected groups. This potential discrimination generated a huge hype both in media and in the research community. Quite a few formal notions of fairness were proposed, which take a form of constraints a "fair" algorithm must satisfy. We focus on scenarios where fairness is imposed on a self-interested party (e.g., a bank that maximizes its revenue). We find that the disadvantaged protected group can be worse off after imposing a fairness constraint. We introduce a family of \textit{Welfare-Equalizing} fairness constraints that equalize per-capita welfare of protected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBlockchain Technology Applications and Security · Ethics and Social Impacts of AI
Protecting the Protected Group: Circumventing Harmful Fairness
Omer Ben-Porat1, Fedor Sandomirskiy2,3, Moshe Tennenholtz2
Abstract
The recent literature on fair Machine Learning manifests that the choice of fairness constraints must be driven by the utilities of the population. However, virtually all previous work makes the unrealistic assumption that the exact underlying utilities of the population (representing private tastes of individuals) are known to the regulator that imposes the fairness constraint. In this paper we initiate the discussion of the mismatch, the unavoidable difference between the underlying utilities of the population and the utilities assumed by the regulator. We demonstrate that the mismatch can make the disadvantaged protected group worse off after imposing the fairness constraint and provide tools to design fairness constraints that help the disadvantaged group despite the mismatch.
1 Introduction
At first glance, algorithms may seem free of human biases such as sexism or racism. However, in many situations, they are not: the automated recruiting tool used by Amazon was favoring men (Doleac and Hansen 2016); judges in the US use the COMPAS algorithm to estimate the probability that the defendant will re-offend while this algorithm was accused of being biased against black people (Larson et al. 2016). See O’Neill (2016) for many more examples. These challenges call for imposing fairness constraints on algorithm design and, in particular, on machine-learned classifiers, which are the subjects of this paper. As a running example, consider a bank that gives loans to potential borrowers and is regulated by a policy-maker. The bank learns a decision rule (namely, a classifier) from historical data to decide for whom to approve or decline the loan to maximize its revenue that is increasing with the number of repaid loans. As repeatedly observed in the past, the resulting classifier may be biased against a protected group (e.g., ethnic minority). Hence, the regulator may wish to impose a fairness constraint on the bank.
Bias is deemed unjust. Beyond that, it affects the welfare of protected groups, as borrowers have preferences towards the different outcomes of the classification, captured by utility functions.111We use the term utility for the well-being of a single individual and welfare for the aggregated well-being of groups of individuals. Consequently, the goal of imposing fairness constraints is to improve the welfare of the disadvantaged group (the originally-discriminated one). A natural approach by which the regulator can achieve this goal is by assuming a utility function and requiring the bank’s classifier to equalize the protected groups’ welfare. The two most popular fairness constraints, Demographic Parity (Agarwal et al. 2018; Dwork et al. 2012) and Equal Opportunity (Hardt et al. 2016) (henceforth DP and EO, respectively) are special cases of this approach for particular utility functions. For instance, DP is recovered by assuming that every agent gets a utility of one when receiving the loan, and zero otherwise.
The possibility that fairness may harm the well-being of those it is designed to protect, i.e., that it can harm the disadvantaged group’s welfare, seems counter-intuitive. However, it is well known both theoretically and empirically that the most intuitive fairness constraint of Unawareness (which forbids using the sensitive attribute in classification) can be harmful (Corbett-Davies and Goel 2018; Dwork et al. 2018; Ustun, Liu, and Parkes 2019; Doleac and Hansen 2016). For example, Doleac and Hansen (2016) show that the “ban the box” policy, adopted by the United States and preventing employers from seeing applicants’ criminal background, decreased the welfare of discriminated minorities (the chances of getting a job). Additionally, Liu et al. (2018) discovered that DP and EO are not free of the same flaw if we consider long-term consequences. They show that fairness constraints may force the bank to give loans to those members of the disadvantaged group who, otherwise, would not have received the loans due to the high probability of default. Therefore, such unqualified borrowers are likely to have problems with paying back the loan. This increased default ratio would harm the disadvantaged group’s average credit score, thereby harming its welfare in the long run.
1.1 Our contribution
In this paper, we uncover another mechanism underlying harmful fairness even in static settings:
Imposing a fairness constraint can make the disadvantaged group worse off if the fairness constraint and the utilities of the population mismatch.
Following the recent trends of fair ML literature, we assume that agents may have different preferences over classification outcomes, which are captured by utility functions. For example, borrowers may differ in their value for getting the loan depending on their access to alternative sources of money and on the purpose of borrowing. With utilities in hand, we can use social welfare to evaluate a group’s well-being for any given classifier. To talk about the mismatch between utilities and fairness constraints, the latter has to be defined in utilitarian terms. As we described above, fairness constraints like DP and EO are naturally cast as equalizing some welfare functions of the protected groups. However, the difficulty in applying any welfare-based fairness constraint is that the utilities must be known to the designer, while these utilities represent individuals’ private tastes. Hence, even if domain experts determine the utilities used in a fairness constraint (henceforth, assumed utilities), they can only approximate the actual utilities of the population (underlying utilities in what follows). This discrepancy leads to the following conclusion.
In practice, the mismatch between the underlying utilities of the population and the utilities assumed by the regulator is unavoidable.
Together with the observation that the mismatch can make fairness harmful, this becomes a serious caution for regulators that design fairness constraints for a certain industry, e.g., banking. We complement this caution with a positive message. Naturally, a small discrepancy between underlying and assumed utilities must be innocuous. However, we characterize a much more applicable and promising connection; we show that
Fairness constraints help the disadvantaged group whenever the utilities assumed by the regulator and the underlying utilities of the population agree on which group is disadvantaged.
Finally, we suggest additional ways to deal with the mismatch if the underlying utilities can be approximated from data.
1.2 Related Work on Economic Ideas in Fair Classification
Welfare-Equalizing, our approach to fairness, has a long history in normative economics (Pazner and Schmeidler 1978; Roemer 1986) (where it is known under the name of egalitarianism) and political philosophy (Rawls 2009); it was used for fair resource allocation without money transfers (Li and Xue 2013), in the field of cooperative games (Dutta and Ray 1989) and bargaining problems (Kalai and Smorodinsky 1975). In contrast to recent papers on the utilitarian approach to fair classification (Heidari et al. 2018; Heidari, Gummadi, and Krause 2019), which suggest maximizing the minimal welfare among the protected groups, we strengthen this desideratum by making it a normative requirement: the welfare must be equal among the subgroups defined by a sensitive attribute. This normative condition allows one to separate the fairness constraint (which may be imposed by a regulator) from the selfish objective of the decision-maker (a revenue-maximizing bank in our running example) and thus allows one to analyze how decisions change after imposing the fairness constraint. Another advantage of the Welfare-Equalizing concept is the simple threshold structure of the optimal fair classifier (similar to the one for Demographic Parity or Equal Opportunity (Corbett-Davies et al. 2017)), which makes it efficiently computable.
This work joins recent attempts (Rambachan et al. 2020a, b; Hu and Chen 2020; Elzayn and Fish 2020; Hossain, Mladenovic, and Shah 2020) to bring better economic understanding to fairness in ML; we address some of them here and refer the reader to Finocchiaro et al. (2020) for a comprehensive survey. Hu and Chen (2020) propose an optimization framework for fair classification and welfare analysis. In their modeling, a learner executes a soft-margin SVM with the additional constraint of group fairness: limiting the two groups’ welfare discrepancy to a predefined quantity. They provide a sensitivity analysis, showing that applying stricter fairness constraints (decreasing the allowed discrepancy) can worsen welfare outcomes for both groups. Their findings are in line with ours, but our analysis is fundamentally different; in particular, they do not address the utility mismatch issue. Imposing fairness constraints on profit-maximizing entities, as we do in this paper, is an understudied point of view, as noted recently by Elzayn and Fish (2020).
There are also several recent attempts to harness economic principles to fair classification. For example, Gölz, Kahng, and Procaccia (2019) treat fair classification as an allocation of goods, where there is a fixed amount of resources to distribute. They examine the compatibility (or lack thereof) of Equalized Odds with axioms of fairness from the economic literature on fair division; see (Brandt et al. 2016) for a survey. Envy-freeness, the dominant fairness concept in economics, plays a crucial role in several recent papers at the intersection of economics and AI (Caragiannis et al. 2019; Cohler et al. 2011; Benade et al. 2018; Guruswami et al. 2005; Gal et al. 2016; Plaut and Roughgarden 2018), including several works on fair classification (Zafar et al. 2017; Balcan et al. 2019; Ustun, Liu, and Parkes 2019; Hossain, Mladenovic, and Shah 2020). However, these papers on fair classification focus on sample complexity and generalization (Balcan et al. 2019), or asserting that users favor treatment disparity (Ustun, Liu, and Parkes 2019; Zafar et al. 2017) in health applications.
The work most related to ours is the paper by Hossain, Mladenovic, and Shah (2020). Concurrently to and independently of our work, Hossain, Mladenovic, and Shah (2020) argue for group equability in fair classification, which coincides with our Welfare-Equalizing fairness constraint. They, too, show that their concept subsumes previously suggested fairness notions. However, there is a significant difference between the two works. First, Hossain, Mladenovic, and Shah (2020) are interested in learning the best classifier from data, and hence address issues of generalization from samples and differentiability. In contrast, we devote our paper to societal considerations of fair classification, and thus consider fairness as a post-processing step similarly to (Corbett-Davies et al. 2017; Hardt et al. 2016). Additionally, in contrast to Hossain, Mladenovic, and Shah (2020), our analysis focuses on the mismatch between the utilities assumed by the regulator and the actual, underlying utilities.
1.3 Paper structure
In Section 2, we present our formal model, define the Welfare-Equalizing fairness framework, and prove structural results for optimal fair classifiers in Subsection 2.2. Section 3 deals with the implications of a mismatch between the population’s underlying utilities and the utilities assumed by the regulator. Finally, Section 4 describes how to compute the bank-optimal fair classifier if the assumed utilities approximate the underlying utilities well enough.
2 Model
We consider a general classification problem, where agents have an ex-ante non-observable “quality” correlated with observable attributes. We keep using the metaphor of a bank that predicts the reliability of the population and makes lending decisions; however, the same setting captures student admissions, recruiting, assessing the recidivism risk for a criminal, etc.
There are three parties in the model: a heterogeneous population of potential borrowers; a bank that makes lending decisions based on the observable attributes of borrowers and cares only about its revenue; and a regulator that cares about fairness and can restrict the set of lending policies available to the bank by imposing a fairness constraint. We now present these parties formally.
Borrowers
We assume that each potential borrower (henceforth borrower) is associated with a pair of observable attributes . Here is a binary222The assumption of the dichotomy of is made for simplicity. Extending our results to the non-binary case (ethnicity) is straightforward. sensitive attribute (e.g., gender) and encodes all other characteristics of a borrower, e.g., employment history, salary, education, assets and so on. We do not impose any assumptions on . By and we denote the groups of all borrowers with the sensitive attribute equal to [math] or , respectively; we call a protected group. Furthermore, in addition to the observable attributes and , every borrower is also associated with an unobservable variable , which describes whether that borrower will pay back the loan or not. For brevity, we call borrowers with and , good and bad, respectively. The statistical characteristics of the population are described by a probability space ; so , and are random variables on . By small letters we denote realizations of , , and , i.e., generic elements of .
Each borrower obtains a utility when receiving the loan. This utility can depend on and in many ways, but what is more critical is that it must depend on the non-observable quality of the borrower, namely . To simplify the presentation, we assume that the utility from a rejected application is zero and that the average utility of a borrower with given and is non-negative,333Zero utility for not getting a loan is a normalization-condition: before borrowing money, everybody is at zero utility level. Non-negativity of can be regarded as a rationality assumption on borrowers: no rational agent would apply for a loan if she/he expects that getting the loan brings negative utility while not getting gives [math]. i.e., . However, both assumptions can be relaxed. We refer to as the underlying utility of the population.
The bank
We assume that the bank knows the joint distribution of from historical data. In particular, it knows the exact conditional probability of being a good borrower given the observable attributes; we denote it by .444Indeed, this is aligned with previous works that consider fairness as a post-processing step (Corbett-Davies et al. 2017; Hardt et al. 2016).
The bank makes lending decisions based on and but without observing . It uses a classifier where is the probability of giving a loan to a population of borrowers with and . Each loan given to a good borrower brings a revenue555In contrast to the rest of the literature, we allow the bank’s revenue to depend on the non-sensitive attribute . This becomes important if also encodes the type of loan a client is applying for, e.g., different borrowers may need a different amount of money and thus bring a different revenue/loss. of to the bank while each bad borrower leads to a loss of ; we assume that are bounded functions of . The bank’s revenue depends on the choice of a classifier , and is defined by
[TABLE]
To ease notation, we define and such that for every
[TABLE]
hence, we can rewrite Equation (1) by taking a conditional expectation with respect to as
[TABLE]
The goal of the bank is to maximize over the set of feasible classifiers, i.e., classifiers that satisfy the regulator’s constraints.
The regulator
The regulator evaluates the well-being of a group using its welfare: the expected utility of its members. For an underlying utility function and a classifier , the welfare of the subgroup is given by
[TABLE]
The regulator aims to equalize welfare among the protected groups. However, as we discuss in the introduction, the underlying utility is unknown to the regulator; thus, the regulator is forced to use a certain substitute instead. We refer to as the assumed utility; ideally, the assumed utility must be an approximation to the underlying one.
The objective of the regulator is captured by the following -Welfare-Equalizing constraint (-WE for abbreviation).
Definition 1** (-WE classifier).**
Given a utility-function , a classifier is -Welfare-Equalizing if
[TABLE]
i.e., if equalizes the welfare among the two protected groups. The set of all such classifiers is denoted by .
Note that Welfare-Equalizing constraint is defined with respect to the assumed utility.
2.1 Special cases of Welfare-Equalizing fairness
The framework of Welfare-Equalizing fairness allows one to analyze existing fairness constraints in a unified manner. For instance,
- •
The fairness constraint of Demographic Parity (DP) (e.g., (Agarwal et al. 2018; Dwork et al. 2012)) requires that the fraction of those who receive loans in the two groups must be the same. Formally, a classifier satisfies DP if . It is a special case of WE fairness with for any triplet .
- •
Motivated by drawbacks of DP, Hardt et al. (2016) concluded that good and bad borrowers within protected groups must be treated separately and introduced the concept of Equal Opportunity (EO). Under this fairness constraint, the fraction of good borrowers who get loans must be the same in the two subgroups. Formally, a classifier satisfies EO if
[TABLE]
We recover EO by setting . The coefficients normalize the maximal possible welfare in each group to . Such a rescaling is known under the name of “relative welfare” and is commonly used in economics to make welfare or utilities among groups comparable (Kalai and Smorodinsky 1975).
- •
Borrowers can differ in the amount of money they need. We can assume that information about is encoded in , so . Then, a straightforward generalization of EO is the following concept of Heterogeneous-EO given by with . We can capture any other heterogeneity similarly (e.g., different interest rates, time-period, and payment schedules).
2.2 Structural properties of the bank-optimal classifiers
In this subsection, we analyze classifiers that are optimal for the bank. We first state the result of Corbett-Davies et al. (2017), who characterize the structure of the bank-optimal unconstrained classifier. Then, we build upon their results for the constrained case. Namely, we assume that the regulator imposes the -WE fairness constraint on the bank and explore the structure of the bank-optimal classifiers. We show that the optimal classifiers have a generalized threshold structure, a fact that is extensively used in Sections 3 and 4.
Unconstrained classifier
If the regulator imposes no constraint on the bank, i.e., the bank is free to choose any classifier, then the revenue-maximizing classifier has a simple form (Corbett-Davies et al. 2017). Only borrowers with are profitable for the bank, which is equivalent to the probability of paying back being greater than (recall that and are defined by Equations (2) and (3)). Consequently, the optimal lending policy is given by the following threshold classifier : all borrowers with get loans () and all borrowers with are rejected ().666For definiteness, we assume that if the bank finds the two decisions equally profitable (the knife-edge case ), it chooses the one with fewer loans given (e.g., this policy minimizes paperwork).
Constrained classifier
Consider the general, constrained case, where the regulator imposes -WE fairness on the bank. For a fixed and given assumed utility , we denote by the classifier that maximizes the bank’s revenue (see Equation (1)) among all -WE classifiers . The set of -WE classifiers is non-empty since and, therefore, the bank’s optimization problem is well-defined. To ease notation, we denote by the assumed utility of a borrower associated with averaged over the possible values of ,
[TABLE]
Further, we denote by the maximal revenue that the bank could extract from the group at the (assumed) welfare level . Formally,
[TABLE]
where is given by Equation (3). The following Proposition 2 shows that the bank-optimal constrained classifier always exists and reveals its structure.
Proposition 2**.**
The bank-optimal -WE classifier exists. Furthermore, each optimal classifier has the following form:
[TABLE]
The group-dependent thresholds , belong to the super-gradient777For a concave function , the super-gradient is the set of all such that for all . If is continuous, then for any the super-gradient is non-empty, see Rockafellar (2015). of the subgroup-optimal revenue (a concave function of ) computed at the welfare level maximizing the total bank’s revenue . The functions are arbitrary888In particular, there always exists with constant , i.e., independent of . up to the constraint that provides the desired welfare level for both groups, .
Proof sketch of Proposition 2.
The revenue maximization over can be represented as a two-stage procedure. In the first stage, we find the revenue-maximizing classifier in each of the subgroups given the welfare level ; in the second stage, we optimize over . The welfare constraint in the first stage can be internalized using the Lagrangian approach; the corresponding Lagrange multipliers are equal to the “shadow prices”, i.e., the derivatives of the value functions with respect to . This internalization reduces finding the subgroup optimal classifier to the unconstrained problem; thus, the optimal classifier has a threshold structure similarly to . This structure is inherited by . Since the resulting linear program is infinite-dimensional and may be non-differentiable, the formal proof requires some functional-analytic arguments presented in the appendix. ∎
For the special cases of Demographic Parity and Equal Opportunity, the explicit form of the optimal classifiers was obtained by Corbett-Davies et al. (2017). Their result becomes an immediate corollary of Proposition 2; see Subsection A.3 in the appendix.
3 Mismatch of Fairness and Underlying Utilities
As we discussed in the introduction, the regulator aiming to equalize welfare among protected groups unavoidably assumes a certain approximation of the underlying utilities . For example, can be determined by the domain experts, while reflects the private tastes of the population and hence is not observable directly. We refer to the fact that is different from as a mismatch. This section explores how imposing the Welfare-Equalizing fairness constraint with respect to affects the underlying welfare, which is measured by .
We use the situation that exists before imposing the fairness constraint as the benchmark. We say that a group is -disadvantaged if under the bank-optimal unconstrained classifier, the welfare of is lower than the welfare of the other group Formally, the group is -disadvantaged if
[TABLE]
where is defined in Equation (5) and is the bank-optimal unconstrained classifier from Subsection 2.2. When the underlying utility is clear from the context, we say that the group is disadvantaged and omit the dependence on .
Ideally, imposing WE-fairness (or any other fairness constraint) should improve the welfare of the disadvantaged group. However, as we show next, this is not always the case. We say that a fairness constraint harms the group if , where is the bank-optimal classifier after imposing that fairness constraint. Put differently, the fairness constraint harms the group if the welfare of the group (measured with respect to underlying utilities) decreases after imposing the fairness constraint.
Harmful mismatch
We now demonstrate that the mismatch between assumed and underlying utilities can make the fairness constraint harmful to the disadvantaged group.
Example 3*.*
Let the underlying utility-function be , i.e., all borrowers equally benefit from receiving loans. However, the regulator does not know the underlying utilities and decides to impose the fairness constraint of EO. Equivalently, the regulator assumes the utility , for normalizing coefficients (as we explained in Subsection 2.1). Notice that the underlying utility is the one associated with DP, and the regulator’s assumed utility is the one associated with EO. Since , there is a mismatch.
Suppose that , and all the combinations of have the same probability of . Furthermore, assume that the fraction of good borrowers is given by the table
[TABLE]
In addition, let the revenue of the bank from a paid-back loan be , and the loss from a borrower’s default be for every .
We first want to determine which group is disadvantaged. The threshold for the bank-optimal unconstrained classifier equals . Hence, under , only borrowers with receive loans in the group . In , borrowers with receive loans since in such cases ; however, loans are not given to borrowers with since . Consequently, is disadvantaged: compared to .
Next, let us examine how imposing the fairness constraint of EO changes the outcome of the bank-optimal classifier. The proportion of loans given by to good borrowers in is equal to the welfare of this group with respect to the assumed utility , namely
[TABLE]
In contrast, for we have
[TABLE]
By imposing -WE-fairness, the regulator requires the bank to equalize these two quantities. To do so, the bank-optimal constrained classifier can either increase the amount of loans given to by approving some applications of or decrease the number of loans given to . However, giving loans to in is too costly: the cost is not compensated by the benefit from giving the same amount of loans to good borrowers with in . Therefore, the bank-optimal constrained classifier coincides with in and gives fewer loans to in the group : . Consequently, the bank equalizes the proportion of loans given to good borrowers in both groups: . However, since the underlying utility is measured by , not , we get that and the disadvantaged group is harmed by imposing the fairness constraint.
3.1 Can WE-fairness help the disadvantaged group despite the mismatch?
In this subsection, we examine when imposing WE-fairness could help the disadvantaged group, even in the presence of a mismatch.999We remind the reader that “helping” and “harming” is always with respect to the actual underlying utility. A natural observation is that if the assumed and underlying utilities “almost” match, the results of imposing -WE-fairness or -WE-fairness should be roughly the same. We postpone such quantitative statements to the end of this subsection and first address easy-to-check general conditions guaranteeing that fairness is not harmful. Theorem 4 below shows that only a lenient condition is required to assure that imposing WE-fairness benefits the disadvantaged group.
Theorem 4**.**
If and agree on which group is disadvantaged, then the -WE classifier weakly increases -welfare of the disadvantaged group, i.e.,
[TABLE]
At first glance, this theorem may look rather intuitive. However, the claim is non-trivial even if there is no mismatch, i.e., when . To see this, recall that the WE-fairness constraint is imposed on the bank: a self-interested party, which is going to find the revenue-optimal way to satisfy the constraint. One possible way to achieve welfare equality is to give no loans to both protected groups thus harming both of them. As we show in Section C in the appendix, such an undesired behavior is possible when Unawareness is imposed. However, it never happens under the WE-fairness; we use this inherent property of WE-fairness as a tool to prove Theorem 4. Momentarily, let us assume that there is no mismatch, i.e., that the assumed utility and the underlying one are exactly the same. In such a case, the following Lemma 5 suggests that not only imposing WE-fairness always improves the welfare of the disadvantaged group, but also that every individual in the disadvantaged group is weakly better off.
Lemma 5** (Matching utilities).**
The bank-optimal -WE classifier makes the -disadvantaged protected group weakly better off at the expense of the advantaged group. Formally,
[TABLE]
Moreover, any borrower from the -disadvantaged group who receives a loan under the unconstrained classifier, receives it under the bank-optimal -WE one.101010While our paper is focused on group notions of fairness, we stress that this result provides stronger “individual” guarantees. Formally, for every it holds that
[TABLE]
Proof of Lemma 5.
By Proposition 2, the welfare level achieved by the -WE classifier maximizes the revenue as a function of welfare level . The sub-group revenues , are concave functions; thus the welfare level lies between their maxima. These maxima are attained at the welfare levels of the bank-optimal unconstrained classifier; therefore, the welfare level is between and . See Figure 1 for illustration.
The second part of the lemma, the individual guarantees, follow from the threshold structure of the bank-optimal constrained classifier , which we identified in Proposition 2. Since the welfare level is above the maximum of (for being the disadvantaged group), its super-gradient contains ; therefore, , which corresponds to , is below ∎
Equipped with Lemma 5, we are ready to prove Theorem 4.
Proof of Theorem 4.
We apply Lemma 5 to the assumed utility function . By the second part of the lemma, after imposing the -WE constraint every borrower from the -disadvantaged group who received loans under the unconstrained classifier still receives them. Namely, for all . Multiplying both sides by the actual underlying utility , substituting and , and taking expectation, we get In other words, the bank-optimal classifier for the utilities assumed by the regulator improves the welfare of the group with respect to the underlying utilities . Since and agree on the disadvantaged group’s identity, this classifier improves the disadvantaged group’s underlying welfare. ∎
In the presence of a mismatch, it is easy to see that a weak converse to Theorem 4 holds, i.e., fairness always makes the disadvantaged group weakly worse off. To avoid a mismatch, one can use a simple quantitative criterion. We say that is an -approximation of for some if for all , and , Theorem 4 implies the following quantitative criterion on how well the regulator should approximate the underlying utilities to help the disadvantaged group.
Corollary 6**.**
If the utility assumed by the regulator is an -approximation of the underlying utility and the gap between the welfare of the groups with respect to is big enough, namely, , then -WE classifier helps the -disadvantaged group.
4 Computing Bank-Optimal Welfare-Equalizing Classifiers
In this section, we provide evidence for the applicability of WE-fairness by developing tools for computing bank-optimal WE classifiers. Our goal is to show how the bank can use the assumed utility proposed by the regulator to compute approximately optimal classifiers. We first discuss the case where the underlying and assumed utilities match (i.e., ) and the other objects are fully known (revenues and losses , and the probability of paying back). Later on, we relax this assumption. Due to space considerations and our desire to focus on the conceptual assets of the paper, we defer most of the analysis to the appendix, as well as an elaborated version of formal statements.
Consider the case where , and both and are known. If the data is tabular, i.e., is relatively small (say several thousand different borrower types), we can compute the bank-optimal -WE-classifier explicitly by standard LP-methods. For large sets of attributes, e.g., multidimensional or continuous, the size of the LP “explodes”, and a different approach should be taken. In this case, we use the structural insights from Proposition 2: the bank-optimal WE-classifier is parameterized by the two thresholds for ; therefore, to compute it we can restrict our attention to a finite-dimensional parametric family of classifiers. Due to the large-scale nature of the problem, we shall seek efficient algorithms for computing approximately bank-optimal WE classifiers, where these approximated solutions are defined as follows.
Definition 7**.**
A classifier is bank-optimal -WE if and .
Notice that such classifiers are doubly approximated: they approximate the revenue of the (exact) bank-optimal WE classifier, and also approximately equalize the welfare of the two classes. In the appendix, we demonstrate how the bank can efficiently find a classifier that approximates the revenue and (exactly) equalizes the welfare. Namely, we show how one can apply the ternary search method (as in Hardt et al. (2016)) to find -WE-classifier in run-time.
Next, we get rid of the full information assumption. Recent papers (Balcan et al. 2019; Hossain, Mladenovic, and Shah 2020) propose convex relaxations for imposing fairness constraints in settings like ours, which includes generalization bounds. However, since an extensive body of literature deals with estimating real-valued functions (ranging from linear regression to deep learning), we take a different approach. We suggest that the bank employs the assumed utility given by the regulator, and describe its performance guarantees in terms of the “quality” of . This perspective has been adopted recently for several other ML problems (Medina and Vassilvitskii 2017; Lykouris and Vassilvitskii 2018; Purohit, Svitkina, and Kumar 2018).
For simplicity, we assume that , , and are all upper-bounded by .
Proposition 8**.**
Fix a small and assume that the bank has access to a sample of and to estimators and such that and for small enough and . Then, a bank-optimal -WE classifier with
[TABLE]
can be computed with probability on a sample of size
[TABLE]
5 Conclusions
Our paper draws on the economic approach to fair classification. It initiates the discussion of the impact that the regulator’s misconception about the population characteristics may have on the protected groups’ well-being. Beyond that, we believe that our WE-fairness can serve as an anchor for grounding other fairness stances.
We have prioritized clarity over generality and focused on binary classification. Nevertheless, our results are more general. The key technical Proposition 2 can be extended to multiple classes (e.g., several loans with different periods and interest rates).111111As in the binary case, there is one threshold per group , but now the revenue and the utilities depend on the class. The optimal WE-classifier selects a class with the maximal . All other results (the mismatch analysis and the algorithms) are, essentially, corollaries of Proposition 2 and extend straightforwardly. Noisy utilities and revenues can be assumed for free if we interpret , , and in all formulas as the conditional expectations for a given triplet . Allowing for utilities with negative expectations also does not alter the statements. We see considerable scope for follow-up work. One prominent direction is to understand how the “price of fairness” is distributed among the parties, e.g., by how much the bank’s revenue and the advantaged group’s welfare drop.
Acknowledgements
We are grateful to Lily Hu, Nikita Kalinin, Alexander Nesterov, Margarita Niyazova, and Ivan Susin for insightful discussions and pointers to the relevant literature. We also thank the anonymous reviewers for their helpful comments and clarifications and Lillian Bluestein for proofreading.
The work of M. Tennenholtz is funded by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement n 740435). F. Sandomirskiy is partially supported by the Lady Davis Foundation, by Grant 19-01-00762 of the Russian Foundation for Basic Research, and by Basic Research Program of the National Research University Higher School of Economics. This work was done while O. Ben-Porat and F. Sandomirskiy were at the Technion—Israel Institute of Science and were partially funded by the European Union’s Horizon 2020 research and innovation programme (grant agreement n 740435).
Appendix A Threshold Structure of the Bank-optimal WE-classifier
Proof of Proposition 2.
The revenue maximization under the WE constraint can be split into two subsequent maximization problems:
[TABLE]
First, the bank finds the revenue-maximizing classifier that maximizes the revenue in the subgroup given some welfare-level (we refer to these classifiers as optimal marginal classifiers). Then, the bank finds the bank-optimal level of by maximizing the total revenue , where as in the statement of the proposition, denotes .
Thus, the bank-optimal WE classifier equals provided that the optimization problems for and have a solution. The threshold representation in Equation (7) will follow from a similar representation for the optimal marginal classifiers . Existence and the threshold representation of is the subject of the next two subsections.
A.1 Existence of and , concavity of
For a given welfare-level of a group , the set of feasible marginal classifiers is non-empty if and only if . Indeed, for any we have ; therefore, the set is empty outside of this interval. For any inside, the constant classifier and thus is non-empty. Therefore, for , is finite; outside this interval we assume . Let us show that is concave and continuous and that supremum is, in fact, maximum, i.e., that the bank-optimal (marginal) classifier exists.
For any and the convex combination for belongs to , where . Therefore, . Taking supremum over and we obtain . Thus is concave in .
Next, we prove that the maximum is attained. Consider , a sequence of classifiers such that . This sequence is a bounded set in the Hilbert space ; thus, by the Banach-Alaoglu theorem (John 1985, Section 5), this sequence contains a weakly-convergent subsequence . Since is a scalar product of and in , we get . Thus the maximum is attained.
A similar argument proves continuity of . By concavity, continuity on follows from upper-semi-continuity: for any sequence Consider the sequence of bank-optimal classifiers . There is a weakly-convergent subsequence with some as the weak limit. We have and similarly . Thus and .
The existence of the bank-optimal , where
[TABLE]
follows from the continuity of for .
A.2 Threshold structure of the bank-optimal marginal classifiers
By concavity and continuity of , the super-gradient at is non-empty. Pick an element from it.
By the definition of super-gradient, for any it holds that . Equivalently, for the bank-optimal classifier and an arbitrary classifier we have
[TABLE]
In other words, the bank-optimal classifier maximizes over all . The converse is also true: any maximizer gives a bank-optimal classifier provided that it belongs to .
Since
[TABLE]
the maximizer equals when and equals [math] if the inequality has the strict opposite sign. The condition imposes the constraint on otherwise arbitrary values of for with . ∎
A.3 Example: the bank-optimal classifiers for DP and EO
Proposition 2 immediately provides the bank-optimal classifiers under DP and EO: for the former, it is given by the “additive perturbation” of the optimal unconstrained classifier, while the latter is obtained by the “multiplicative perturbation.”
More precisely, for the optimal classifier satisfying DP we recover the result of Corbett-Davies et al. (2017): there exist group-dependent constants , such that
[TABLE]
where is the probability that a borrower with attributes and pays back, and the threshold is defined by (2).
For the bank-optimal EO classifier , there exist , such that
[TABLE]
Appendix B Unconstrained bank-optimal classifier is unfair
Here we show that if the bank is free to pick any classifier (i.e., the regulator imposes no constraints), the bank-optimal unconstrained classifier (formally defined in Subsection 2.2) can discriminate against one of the groups even if the groups are of equal size and contain the same fraction of good borrowers. We illustrate this claim by the following example.
Example 9* (Bank-optimal unconstrained classifier is unfair).*
Let and be binary, and let all four combinations have the probability . The probability of being a good borrower is given by the matrix
[TABLE]
In the group , the attribute poorly separates good and bad borrowers, while it is a perfect predictor of creditworthiness for . If losses from a defaulting client are equal to the revenue from two borrowers paying back, e.g. , then the revenue-maximizing classifier has the following form: the bank gives the loans to those borrowers that have the probability of paying back above the threshold . Thus, the bank gives loans to applicants with only. As a result, no loans are given to the group , although the prior distribution shows that in total of borrowers from each of the groups and are good. This contradicts the intuitive understanding of fairness. Moreover, for any utility function , the welfare of the group is below the welfare of the group , i.e., the optimal unconstrained classifier is not -Welfare-Equalizing for any .
Appendix C Unawareness Can Harm Both Groups and the Bank
Unawareness is perhaps the most intuitive fairness criterion. A classifier satisfies unawareness if for all . Informally, to deliver a “fair” outcome to both groups and , the classifier must ignore the protected attribute .
This natural idea is not innocuous: and can be dependent and thus information contained in can be used as a proxy for , making the disadvantaged group even worse off; see Doleac and Hansen (2016). Despite its flaws, unawareness is promoted by layers in the labor market, insurance market121212By the decision of the Court of Justice of the European Union, insurance premiums must be determined in a gender-blind way starting from December 2012. This initiative eventually increased the gap between premiums paid by females and males (Collinson 14 January 2017)., and also by GDPR.131313The General Data Protection Regulation is a law adopted by the European Union in 2016 that regulates data protection and privacy.
The bank-optimal unaware classifier can be constructed by the same logic as the optimal unconstrained one; see Subsection 2 and Corbett-Davies et al. (2017): an applicant receives money if the probability of being a good borrower given the observed attribute is above the threshold . Formally, for borrowers such that and if .
We now show that Unawareness can make all three parties strictly worse off: both groups and the bank, regardless of the underlying utility function . The following example shows this phenomenon when information losses caused by Unawareness are significant: the interpretation of the non-protected attribute depends on the protected141414For example, having children correlates with spending more time at work for men and has negative correlation for women (Parker and Wang 2013). attribute and without knowing the classifier cannot achieve good separation of good and bad borrowers; thus giving loans becomes too risky.
Example 10*.*
Suppose that and all the four combinations of attributes have an equal probability of . The fraction of good borrowers is given by the following table
[TABLE]
Notice that the fraction of good and bad borrowers is the same in both groups. Further, is a positive signal about the quality of a borrower in the group and a negative one for .
The bank-optimal unconstrained classifier gives loans to agents with ; thus, if the threshold is between and (for example, if the revenue from giving money to a good borrower equals of the losses from a bad one), then agents with equal to or get loans under . So one half of the members in each group receives loans and the bank gets a positive revenue.
After imposing Unawareness, the bank-optimal classifier compares the threshold with the average fraction of good borrowers for a given , namely . In our example, for every , and thus no loans are given for . Thus, if for all , Unawareness pushes the welfare of both groups to zero for any underlying utility as well as the bank’s revenue.
Appendix D Proof of Corollary 6
Proof of Corollary 6.
By Theorem 4, it is enough to show that, and agree on which group is disadvantaged. Without loss of generality, the group is -disadvantaged. We shall show that is also -disadvantaged. We get the following chain of inequalities:
[TABLE]
The first and the last steps follow from the definition of -approximation, and the second step from the condition in the corollary. Consequently, is -disadvantaged, which completes the proof. ∎
Appendix E Algorithms for Approximately Bank-optimal WE-Classifiers
This section is devoted to finding approximately bank-optimal WE-classifiers. It contains a formal version of the results discussed in Section 4 together with their proofs.
We first assume complete information about the problem (the utilities , revenues/losses , and the probability distribution are known), and then relax this assumption.
Throughout this section we assume that the bank-optimal unconstrained classifier provides non-zero utility to both groups. This positivity condition allows us to avoid degenerate cases.
E.1 Complete information
In this subsection we assume that the underlying utility coincides with the utility assumed by the regulator. Furthermore, these utilities are known to the bank as well as the distribution and functions . In particular, the bank can compute and , namely, the average revenue and the average utility for a given pair.
LP-based approach
As we mentioned in Section 4, the bank-optimal -WE-classifier is the solution to the linear program (LP) given in Figure 2. It maximizes the revenue under the constraint of equal welfare, and hence for a medium size set of attributes (say several thousands) we can compute explicitly by standard LP-methods.
Threshold-based approach
For large sets of attributes , e.g., multidimensional or continuous, solving the LP in Figure 2 is no longer feasible in practice. In this case, we harness the threshold structure of the bank-optimal classifier (see Proposition 2) and use binary search over the thresholds to compute approximately optimal solutions. The approximation notion is captured by Definition 7.
The next proposition shows that the bank can efficiently compute a classifier that approximates the optimal revenue and achieves an exact equality of welfare. To simplify the presentation, in this subsection we assume that the group is disadvantaged.
Proposition 11**.**
Assume that , and are known. Further, assume that the bank has access to an oracle that computes expectations w.r.t. in constant time. Then, for any positive such that
[TABLE]
an (, 0) bank-optimal -WE-classifier can be computed in , uniformly over all other parameters.
As we show below, the assumption on the expectation oracle can be relaxed by sampling, which leads to an approximate welfare equality.
Proof of Proposition 11.
The high-level structure of the proof is as follows. Similarly to the proof of Proposition 2, we represent the optimization problem as finding an approximately bank-optimal marginal classifier for a given welfare level in each subgroup, and then determining the revenue-maximizing . For a given welfare level , the revenue-maximizing classifiers and the revenue itself can be approximated by the binary search over thresholds . Since is a concave function, the revenue-maximizing can be determined by the Golden ratio or the ternary search as in Hardt et al. (2016). The technical difficulty is to bound the number of steps needed for the binary search to converge even though a small change in can result in a massive change both in welfare and revenue.
Now we discuss all the steps in detail. To simplify formulas, we can assume w.l.o.g. that by changing the scale.
Proposition 12 below guarantees that for each subgroup and welfare level , a marginal classifier with welfare and revenue at least can be computed in time by binary search over thresholds . We verify below that the upper bound on ensures that none of exceeds (a technical condition of Proposition 12).
Therefore, for a given we can also compute the optimal total revenue up to in . The total revenue is a continuous concave function of the welfare level (see the proof of Proposition 2) and thus it can be approximately maximized by the Golden-ratio search or by the ternary search as in Hardt et al. (2016).
For the moment, assume that we can compute exactly. Then iterations of the search will find such that , where is the revenue-maximizing welfare level. Indeed, the function is defined on the interval , where , and takes values within by the assumption on the magnitude of . On any interval with containing the bank-optimal welfare level , the value is within of (otherwise, by convexity, there exists such that ); thus, steps of the search are enough.
If, along the way, the values of are computed with precision , the output of the Golden-ratio search satisfies and, therefore, provides an -bank-optimal WE-classifier. Since all iterations of the search algorithm involve computing at a new point, the total run-time is .
In order to satisfy the technical condition of Proposition 12, we need one more twist: the search should start on the smaller interval
[TABLE]
where denotes a point such that the super-gradient of contains (a point can be easily computed as in the proof of Proposition 12). By the definition of , any satisfies the condition of Proposition 12 and it remains to check that the Golden-ratio search, restricted to , still approximates the optimum . Put differently, it is enough to show that . At , First-order conditions imply . Therefore, there exist such that . Pick that is non-negative. Then ; otherwise, which contradicts the normalization of . By Lemma 5, ; hence, . The upper bound on implies that both and are bounded by and thus belongs to . ∎
The next proposition is an auxiliary technical result used in the proof of Proposition 11. It shows that approximately bank-optimal marginal classifier for a group with a given welfare level can be found by binary search over thresholds. The threshold can also be thought as a super-gradient of the marginal revenue . The super-gradient of a concave function may “explode” if the point is close to the boundary. This is why we need an additional technical assumption that the super-gradient of at is not too big.
Proposition 12**.**
Under the assumptions of Proposition 11, for a given , , and w\in\big{[}0,\,\mathbb{E}[u|A=a]\big{]} such that the super-gradient contains with , a marginal classifier for a subgroup with welfare level and revenue of at least can be computed in .
Proof of Proposition 12.
Similarly to the proof of Proposition 11, we can assume w.l.o.g. that . Consider the threshold classifier such that if and equals zero otherwise. For any , this classifier generates the optimal subgroup revenue among all the classifiers having the same welfare level , i.e., equals ; moreover, belongs to the super-gradient of the concave function at (see the proof of Proposition 2).
Note that is a decreasing function of and thus we can use binary search in order to find a such that is close to . By the assumption on we can restrict the search to and thus after steps find and such that and . We can treat as the convex combination , , and consider a classifier . By the construction, has the right welfare level of .
Let us check that . For any concave function , , and , it holds that
[TABLE]
where is any super-gradient of at . The revenue of is given by the convex combination of values , while is the value at the convex combination . Applying the above inequality, we get ; hence, . ∎
E.2 Incomplete information
The assumption of expectation oracle from Proposition 11, as well as the exact knowledge of revenues/losses and can be relaxed. Moreover, in this subsection we allow for a mismatch between the unknown underlying utility function and the known utility assumed by the regulator. We assume that the mismatch is small and interpret as the estimator of .
The following proposition mirrors Proposition 8.
Proposition 13** (Full version of Proposition 8).**
Fix and assume that the bank has access to a sample of and to estimators of and of such that and . Let be such that that also satisfies the upper bound in Inequality (8). Then, an \Big{(}\varepsilon\cdot\max_{x,a}|{r}(x,a)|,\ \ \varepsilon\cdot\max_{x,a}\overline{v}(x,a)\Big{)} bank-optimal -WE classifier can be computed with probability of at least on a sample of size .
The proof of the proposition is deferred to the end of the subsection, and it relies on three auxiliary propositions. First, in Proposition 16 we prove that having a sample access to the distribution is enough for constructing an bank-optimal -WE classifier of a particular functional form.
Definition 14**.**
A classifier is a classifier in the reduced form151515Note that both bank-optimal and -bank-optimal classifiers from Propositions 2 and 11 have a reduced form. if (i.e., depends on and only through and ).
Note that in order to compute the outcome of a reduced form classifier, one must know the exact values of and ; we use their approximations instead. However, the classifier from Proposition 16 turns out to be sensitive to small errors in and . In particular, using the estimator instead of the exact underlying utility can dramatically change the outcome. In Proposition 17, we describe the smoothing technique that allows us to ensure that the outcome of classification is robust to small perturbations of and in the following sense.
Definition 15**.**
A reduced-form classifier is -robust if
[TABLE]
for all numbers and .
Finally, Proposition 18 bounds the losses in revenue and welfare of a -robust classifier that relies on estimators of and instead of the exact values.
Proposition 16**.**
For and satisfying Inequality (8), a reduced form of an bank-optimal -WE classifier can be computed with probability of at least on a sample of size .
Proof of Proposition 16.
We mimic the algorithm from Proposition 11 but change all the exact expectations to empirical averages.
By re-scaling, we assume . The algorithm from Proposition 11 computes the following conditional expectations times: welfares and revenues . By the Chernoff bound, for i.i.d. random variables in we have . Hence, we ensure that each particular expectation is computed with accuracy with probability using a sample of size conditional on . In order to have such a conditional sample, the unconditional sample must be -times bigger. By the union bound, selecting such that we guarantee that with probability of at least all expectations are computed with precision throughout the run of the algorithm. ∎
Proposition 17**.**
The result of Proposition 16 remain true if we additionally ask the computed classifier to be -robust.
Proof of Proposition 17.
We describe the smoothing technique as usual assuming . To compute an -robust classifier , we consider an auxiliary “smoothed” classification problem with , where are uniformly distributed on and are independent from each other and from , and define and .
Next, we compute the reduced form of an bank-optimal -WE classifier in the new problem and define a reduced-form classifier in the original problem (under ) by c(r,\overline{v})=\mathbb{E}_{(\xi_{v},\xi_{r})}c^{\circ}\big{(}r+\xi_{r},\overline{v}+\xi_{v}\big{)}. If we slightly perturb and , then is integrated over almost-coinciding squares in the -plane and hence , i.e., we get robustness. The constructed classifier equalizes welfare up to since and differ by at most .
It remains to check that generates -optimal revenue. The bank-optimal classifier in the original problem induces a classifier that equalizes welfare up to and has revenue in the new problem. We get . Since , we obtain that generates -optimal revenue in the original problem. ∎
Proposition 18**.**
Suppose we have estimators and such that and . Further, let be a reduced form of a -robust bank-optimal WE classifier and define as the version of that uses the estimators and , i.e., . Then, is bank-optimal with
[TABLE]
and
[TABLE]
Proof of Proposition 18.
By -robustness we get
[TABLE]
thus, by applying the triangle inequality we get
[TABLE]
The argument for the revenue is similar but simpler since we need unconditional expectations only. ∎
We are now ready to prove Proposition 13.
Proof of Proposition 13.
As usual assume . By Proposition 17, we can compute an -robust bank-optimal WE classifier in the reduced form on a sample of required size. The lower bound on is chosen in such a way that and thus, by Proposition 18, the classifier is an bank-optimal WE classifier. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Agarwal et al. (2018) Agarwal, A.; Beygelzimer, A.; Dudík, M.; Langford, J.; and Wallach, H. 2018. A reductions approach to fair classification. ar Xiv preprint ar Xiv:1803.02453 .
- 2Balcan et al. (2019) Balcan, M.-F. F.; Dick, T.; Noothigattu, R.; and Procaccia, A. D. 2019. Envy-free classification. In Advances in Neural Information Processing Systems , 1238–1248.
- 3Benade et al. (2018) Benade, G.; Kazachkov, A. M.; Procaccia, A. D.; and Psomas, C.-A. 2018. How to make envy vanish over time. In Proceedings of the 2018 ACM Conference on Economics and Computation , 593–610. ACM.
- 4Brandt et al. (2016) Brandt, F.; Conitzer, V.; Endriss, U.; Lang, J.; and Procaccia, A. D. 2016. Handbook of computational social choice . Cambridge University Press.
- 5Caragiannis et al. (2019) Caragiannis, I.; Kurokawa, D.; Moulin, H.; Procaccia, A. D.; Shah, N.; and Wang, J. 2019. The unreasonable fairness of maximum Nash welfare. ACM Transactions on Economics and Computation (TEAC) .
- 6Cohler et al. (2011) Cohler, Y. J.; Lai, J. K.; Parkes, D. C.; and Procaccia, A. D. 2011. Optimal envy-free cake cutting. In Twenty-Fifth AAAI Conference on Artificial Intelligence .
- 7Collinson (14 January 2017) Collinson, P. 14 January 2017. How an EU gender equality ruling widened inequality. The Guardian .
- 8Corbett-Davies and Goel (2018) Corbett-Davies, S.; and Goel, S. 2018. The measure and mismeasure of fairness: A critical review of fair machine learning. ar Xiv preprint ar Xiv:1808.00023 .
