36:[["$","audio",null,{"id":"tts"}],["$","$L3b",null,{"paperID":"1803.04383","publisher":"arxiv","paperJSON":{"title":"Delayed Impact of Fair Machine Learning","paperID":"1803.04383","avgLineHeight":13.55,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"Fairness in machine learning has predominantly been studied in static classification settings without concern for how decisions change the underlying population over time. Conventional wisdom suggests that fairness criteria promote the long-term well-being of those groups they aim to protect.","element":"span"}],[{"text":"We study how static fairness criteria interact with temporal indicators of well-being, such as long-term improvement, stagnation, and decline in a variable of interest. We demonstrate that even in a one-step feedback model, common fairness criteria in general do not promote improvement over time, and may in fact cause harm in cases where an unconstrained objective would not. We completely characterize the delayed impact of three standard criteria, contrasting the regimes in which these exhibit qualitatively different behavior. In addition, we find that a natural form of measurement error broadens the regime in which fairness criteria perform favorably.","element":"span"}],[{"text":"Our results highlight the importance of measurement and temporal modeling in the evaluation of fairness criteria, suggesting a range of new challenges and trade-offs.","element":"span"}]]},{"heading":"1 Introduction","paragraphs":[[{"text":"Machine learning commonly considers static objectives defined on a snapshot of the population at one instant in time; consequential decisions, in contrast, reshape the population over time. Lending practices, for example, can shift the distribution of debt and wealth in the population. Job advertisements allocate opportunity. ","element":"span"},{"text":"School admissions shape the level of education in a community.","element":"span"}],[{"text":"Existing scholarship on fairness in automated decision-making criticizes unconstrained machine learning for its potential to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"harm ","element":"span"},{"text":"historically underrepresented or disadvantaged groups in the population [","element":"span"},{"href":"#id-0","referenceIndex":5,"text":"Executive Office of the President","element":"a"},{"text":", ","element":"span"},{"href":"#id-0","referenceIndex":5,"text":"2016","element":"a"},{"text":", ","element":"span"},{"href":"#id-1","referenceIndex":1,"text":"Barocas and Selbst","element":"a"},{"text":", ","element":"span"},{"href":"#id-1","referenceIndex":1,"text":"2016","element":"a"},{"text":"]. Consequently, a variety of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"fairness criteria ","element":"span"},{"text":"have been proposed as constraints on standard learning objectives. Even though, in each case, these constraints are clearly intended to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"protect ","element":"span"},{"text":"the disadvantaged group by an appeal to intuition, a rigorous argument to that effect is often lacking.","element":"span"}],[{"text":"In this work, we formally examine under what circumstances fairness criteria do indeed promote the long-term well-being of disadvantaged groups measured in terms of a temporal variable of interest. Going beyond the standard classification setting, we introduce a one-step feedback model of decision-making that exposes how decisions change the underlying population over time.","element":"span"}],[{"text":"Our running example is a hypothetical lending scenario. There are two groups in the population with features described by a summary statistic, such as a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"credit score","element":"span"},{"text":", whose distribution differs between the two groups. The bank can choose thresholds for each group at which loans are offered. While group-dependent thresholds may face legal challenges [","element":"span"},{"href":"#id-2","referenceIndex":18,"text":"Ross and Yinger","element":"a"},{"text":", ","element":"span"},{"href":"#id-2","referenceIndex":18,"text":"2006","element":"a"},{"text":"], they are generally inevitable for some of the criteria we examine. The impact of a lending decision has multiple facets. A default event not only diminishes profit for the bank, it also worsens the financial situation of the borrower as reflected in a subsequent decline in credit score. A successful lending outcome leads to profit for the bank and also to an increase in credit score for the borrower.","element":"span"}],[{"text":"When thinking of one of the two groups as disadvantaged, it makes sense to ask what lending policies (choices of thresholds) lead to an expected improvement in the score distribution within that group. An unconstrained bank would maximize profit, choosing thresholds that meet a breakeven point above which it is profitable to give out loans. One frequently proposed fairness criterion, sometimes called demographic parity, requires the bank to lend to both groups at an equal rate. Subject to this requirement the bank would continue to maximize profit to the extent possible. Another criterion, originally called equality of opportunity, equalizes the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"true positive rates ","element":"span"},{"text":"between the two groups, thus requiring the bank to lend in both groups at an equal rate among individuals who repay their loan. Other criteria are natural, but for clarity we restrict our attention to these three.","element":"span"}],[{"text":"Do these fairness criteria benefit the disadvantaged group? When do they show a clear advantage over unconstrained classification? Under what circumstances does profit maximization work in the interest of the individual? These are important questions that we begin to address in this work.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Contributions","element":"span"}],[{"text":"We introduce a one-step feedback model that allows us to quantify the long-term impact of classi-fication on different groups in the population. We represent each of the two groups ","element":"span"},{"text":"A ","element":"span"},{"text":"and ","element":"span"},{"text":"B ","element":"span"},{"text":"by a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"score ","element":"span"},{"text":"distribution ","element":"span"},{"style":{"height":15.6},"width":220.73,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-0.png","element":"img","alt":" πA and πB,","inline":true,"padRight":true},{"text":"respectively. The support of these distributions is a finite set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"corresponding to the possible values that the score can assume. We think of the score as highlighting one variable of interest in a specific domain such that higher score values correspond to a higher probability of a positive outcome. An ","element":"span"},{"style":{"fontStyle":"italic"},"text":"institution ","element":"span"},{"text":"chooses selection policies ","element":"span"},{"style":{"height":17.6},"width":443.83,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-1.png","element":"img","alt":" τ A, τ B : X → [0, 1] that","inline":true,"padRight":true},{"text":"assign to each value in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"a number representing the rate of selection for that value. In our example, these policies specify the lending rate at a given credit score within a given group. The institution will always maximize their utility (defined formally later) subject to either (a) no constraint, or (b) equality of selection rates, or (c) equality of true positive rates.","element":"span"}],[{"text":"We assume the availability of a function ","element":"span"},{"style":{"height":17.6},"width":531.65,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-2.png","element":"img","alt":" ∆: X → R such that ∆(x","inline":true},{"text":") provides the expected change in score for a selected individual at score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":". The central quantity we study is the expected difference in the mean score in group ","element":"span"},{"style":{"height":17.6},"width":213.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-3.png","element":"img","alt":" j ∈ {A, B}","inline":true,"padRight":true},{"text":"that results from an institutions policy, ∆","element":"span"},{"style":{"height":14.59},"width":42.89,"height":36.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-4.png","element":"img","alt":"µj","inline":true,"padRight":true},{"text":"defined formally in Equation (","element":"span"},{"href":"#id-3","text":"2","element":"a"},{"text":"). When modeling the problem, the expected mean difference can also absorb external factors such as “reversion to the mean” so long as they are mean-preserving. Qualitatively, we distinguish between ","element":"span"},{"style":{"height":19.79},"width":1133.06,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-5.png","element":"img","alt":" long-term improvement (∆µj > 0), stagnation (∆µj = 0),","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":19.79},"width":288.54,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/1-6.png","element":"img","alt":" decline (∆µj <","inline":true,"padRight":true},{"text":"0). Our findings can be summarized as follows:","element":"span"}],[{"text":"1. Both fairness criteria (equal selection rates, equal true positive rates) can lead to all possible outcomes (improvement, stagnation, and decline) in natural parameter regimes. We provide a complete characterization of when each criterion leads to each outcome in Section ","element":"span"},{"text":"3","element":"span"},{"text":".","element":"span"}],[{"text":"• ","element":"span"},{"text":"There are a class of settings where equal selection rates cause decline, whereas equal true positive rates do not (Corollary ","element":"span"},{"href":"#id-4","text":"3.5","element":"a"},{"text":"),","element":"span"}],[{"text":"• ","element":"span"},{"text":"Under a mild assumption, the institution’s optimal unconstrained selection policy can never lead to decline (Proposition ","element":"span"},{"href":"#id-5","text":"3.1","element":"a"},{"text":").","element":"span"}],[{"text":"2. We introduce the notion of an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"outcome curve ","element":"span"},{"text":"(Figure ","element":"span"},{"href":"#id-6","text":"1","element":"a"},{"text":") which succinctly describes the different regimes in which one criterion is preferable over the others.","element":"span"}],[{"text":"3. We perform experiments on FICO credit score data from 2003 and show that under various models of bank utility and score change, the outcomes of applying fairness criteria are in line with our theoretical predictions.","element":"span"}],[{"text":"4. We discuss how certain types of measurement error (e.g., the bank underestimating the repayment ability of the disadvantaged group) affect our comparison. We find that measurement error narrows the regime in which fairness criteria cause decline, suggesting that measurement should be a factor when motivating these criteria.","element":"span"}],[{"text":"5. We consider alternatives to hard fairness constraints.","element":"span"}],[{"text":"• ","element":"span"},{"text":"We evaluate the optimization problem where fairness criterion is a regularization term in the objective. Qualitatively, this leads to the same findings.","element":"span"}],[{"text":"• ","element":"span"},{"text":"We discuss the possibility of optimizing for group score improvement ∆","element":"span"},{"style":{"height":14.59},"width":42.89,"height":36.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/2-0.png","element":"img","alt":"µj","inline":true,"padRight":true},{"text":"directly subject to institution utility constraints. The resulting solution provides an interesting possible alternative to existing fairness criteria.","element":"span"}],[{"text":"We focus on the impact of a selection policy over a single epoch. The motivation is that the designer of a system usually has an understanding of the time horizon after which the system is evaluated and possibly redesigned. Formally, nothing prevents us from repeatedly applying our model and tracing changes over multiple epochs. In reality, however, it is plausible that over greater time periods, economic background variables might dominate the effect of selection.","element":"span"}],[{"text":"Reflecting on our findings, we argue that careful temporal modeling is necessary in order to accurately evaluate the impact of different fairness criteria on the population. Moreover, an understanding of measurement error is important in assessing the advantages of fairness criteria relative to unconstrained selection. Finally, the nuances of our characterization underline how intuition may be a poor guide in judging the long-term impact of fairness constraints.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Related work","element":"span"}],[{"text":"Recent work by ","element":"span"},{"href":"#id-7","referenceIndex":9,"text":"Hu and Chen ","element":"a"},{"text":"[","element":"span"},{"href":"#id-7","referenceIndex":9,"text":"2018","element":"a"},{"text":"] considers a model for long-term outcomes and fairness in the labor market. They propose imposing the demographic parity constraint in a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"temporary ","element":"span"},{"text":"labor market in order to provably achieve an equitable long-term equilibrium in the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"permanent ","element":"span"},{"text":"labor market, reminiscent of economic arguments for affirmative action [","element":"span"},{"href":"#id-8","referenceIndex":6,"text":"Foster and Vohra","element":"a"},{"text":", ","element":"span"},{"href":"#id-8","referenceIndex":6,"text":"1992","element":"a"},{"text":"]. The equilibrium analysis of the labor market dynamics model allows for specific conclusions relating fairness criteria to long term outcomes. Our general framework is complementary to this type of domain specific approach.","element":"span"}],[{"href":"#id-9","referenceIndex":7,"text":"Fuster et al. ","element":"a"},{"text":"[","element":"span"},{"href":"#id-9","referenceIndex":7,"text":"2017","element":"a"},{"text":"] consider the problem of fairness in credit markets from a different perspective. Their goal is to study the effect of machine learning on interest rates in different groups at an equilibrium, under a static model without feedback.","element":"span"}],[{"href":"#id-10","referenceIndex":4,"text":"Ensign et al. ","element":"a"},{"text":"[","element":"span"},{"href":"#id-10","referenceIndex":4,"text":"2017","element":"a"},{"text":"] consider feedback loops in predictive policing, where the police more heavily monitor high crime neighborhoods, thus further increasing the measured number of crimes in those neighborhoods. While the work addresses an important temporal phenomenon using the theory of urns, it is rather different from our one-step feedback model both conceptually and technically.","element":"span"}],[{"text":"Demographic parity and its related formulations have been considered in numerous papers [e.g. ","element":"span"},{"href":"#id-11","referenceIndex":2,"text":"Calders et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-11","referenceIndex":2,"text":"2009","element":"a"},{"text":", ","element":"span"},{"href":"#id-12","referenceIndex":20,"text":"Zafar et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-12","referenceIndex":20,"text":"2017","element":"a"},{"text":"]. ","element":"span"},{"href":"#id-13","referenceIndex":8,"text":"Hardt et al. ","element":"a"},{"text":"[","element":"span"},{"href":"#id-13","referenceIndex":8,"text":"2016","element":"a"},{"text":"] introduced the equality of opportunity constraint that we consider and demonstrated limitations of a broad class of criteria. ","element":"span"},{"href":"#id-14","referenceIndex":14,"text":"Kleinberg ","element":"a"},{"href":"#id-14","referenceIndex":14,"text":"et al. ","element":"a"},{"text":"[","element":"span"},{"href":"#id-14","referenceIndex":14,"text":"2017","element":"a"},{"text":"] and ","element":"span"},{"href":"#id-15","referenceIndex":3,"text":"Chouldechova ","element":"a"},{"text":"[","element":"span"},{"href":"#id-15","referenceIndex":3,"text":"2016","element":"a"},{"text":"] point out the tension between “calibration by group” and equal true/false positive rates. These trade-offs carry over to some extent to the case where we only equalize true positive rates [","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"Pleiss et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"2017","element":"a"},{"text":"].","element":"span"}],[{"text":"A growing literature on fairness in the “bandits” setting of learning [see ","element":"span"},{"href":"#id-17","referenceIndex":10,"text":"Joseph et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-17","referenceIndex":10,"text":"2016","element":"a"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"et sequelae","element":"span"},{"text":"] deals with online decision making that ought not to be confused with our one-step feedback setting. Finally, there has been much work in the social sciences on analyzing the effect of affirmative action [see e.g., ","element":"span"},{"href":"#id-18","referenceIndex":12,"text":"Keith et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-18","referenceIndex":12,"text":"1985","element":"a"},{"text":", ","element":"span"},{"href":"#id-19","referenceIndex":11,"text":"Kalev et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-19","referenceIndex":11,"text":"2006","element":"a"},{"text":"].","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Discussion","element":"span"}],[{"text":"In this paper, we advocate for a view toward long-term outcomes in the discussion of “fair” machine learning. We argue that without a careful model of delayed outcomes, we cannot foresee the impact a fairness criterion would have if enforced as a constraint on a classification system. However, if such an accurate outcome model is available, we show that there are more direct ways to optimize for positive outcomes than via existing fairness criteria. We outline such an outcome-based solution in Section ","element":"span"},{"href":"#id-20","text":"4.3","element":"a"},{"text":". Specifically, in the credit setting, the outcome-based solution corresponds to giving out more loans to the protected group in a way that reduces profit for the bank compared to unconstrained profit maximization, but avoids loaning to those who are unlikely to benefit, resulting in a maximally improved group average credit score. The extent to which such a solution could form the basis of successful regulation depends on the accuracy of the available outcome model.","element":"span"}],[{"text":"This raises the question if our model of outcomes is rich enough to faithfully capture realistic phenomena. By focusing on the impact that selection has on individuals at a given score, we model the effects for those ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"selected as zero-mean. ","element":"span"},{"text":"For example, not getting a loan in our model has no negative effect on the credit score of an individual.","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/3-0.png","element":"img","alt":"1 ","inline":true,"padRight":true},{"text":"This does not mean that wrongful rejection (i.e., a false negative) has no visible manifestation in our model. If a classifier has a higher false negative rate in one group than in another, we expect the classifier to increase the disparity between the two groups (under natural assumptions). In other words, in our outcome-based model, the harm of denied opportunity manifests as growing disparity between the groups. The cost of a false negative could also be incorporated directly into the outcome-based model by a simple modification (see Footnote ","element":"span"},{"text":"2","element":"span"},{"text":"). This may be fitting in some applications where the immediate impact of a false negative to the individual is not zero-mean, but significantly reduces their future success probability.","element":"span"}],[{"text":"In essence, the formalism we propose requires us to understand the two-variable causal mechanism that translates decisions to outcomes. This can be seen as relaxing the requirements compared with recent work on avoiding discrimination through causal reasoning that often required stronger assumptions [","element":"span"},{"href":"#id-21","referenceIndex":15,"text":"Kusner et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-21","referenceIndex":15,"text":"2017","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":16,"text":"Nabi and Shpitser","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":16,"text":"2017","element":"a"},{"text":", ","element":"span"},{"href":"#id-23","referenceIndex":13,"text":"Kilbertus et al.","element":"a"},{"text":", ","element":"span"},{"href":"#id-23","referenceIndex":13,"text":"2017","element":"a"},{"text":"]. In particular, these works required knowledge of how sensitive attributes (such as gender, race, or proxies thereof) causally relate to various other variables in the data. Our model avoids the delicate modeling step involving the sensitive attribute, and instead focuses on an arguably more tangible economic mechanism. Nonetheless, depending on the application, such an understanding might necessitate greater domain knowledge and additional research into the specifics of the application. This is consistent with much scholarship that points to the context-sensitive nature of fairness in machine learning.","element":"span"}]]},{"heading":"2 Problem Setting","paragraphs":[[{"text":"We consider two ","element":"span"},{"style":{"fontStyle":"italic"},"text":"groups ","element":"span"},{"text":"A ","element":"span"},{"text":"and ","element":"span"},{"text":"B","element":"span"},{"text":", which comprise a ","element":"span"},{"style":{"height":16.4},"width":404.66,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-0.png","element":"img","alt":" gA and gB = 1 − gA","inline":true,"padRight":true},{"text":"fraction of the total population, and an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"institution ","element":"span"},{"text":"which makes a binary decision for each individual in each group, called ","element":"span"},{"style":{"fontStyle":"italic"},"text":"selection","element":"span"},{"text":". Individuals in each group are assigned ","element":"span"},{"style":{"fontStyle":"italic"},"text":"scores ","element":"span"},{"text":"in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":":= [","element":"span"},{"style":{"fontStyle":"italic"},"text":"C","element":"span"},{"text":"], and the scores for group ","element":"span"},{"style":{"height":17.6},"width":207.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-1.png","element":"img","alt":" j ∈ {A, B}","inline":true,"padRight":true},{"text":"are distributed according ","element":"span"},{"style":{"height":21},"width":352.41,"height":52.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-2.png","element":"img","alt":" πj ∈ SimplexC−1.","inline":true,"padRight":true},{"text":"The institution selects a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"policy ","element":"span"},{"style":{"height":20.26},"width":731.36,"height":50.65,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-3.png","element":"img","alt":"τ := (τ A, τ B) ∈ [0, 1]2C, where τ j(x","inline":true},{"text":") corresponds to the probability the institution selects an individual in group ","element":"span"},{"text":"j ","element":"span"},{"text":"with score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":". One should think of a score as an abstract quantity which summarizes how well an individual is suited to being selected; examples are provided at the end of this section.","element":"span"}],[{"text":"We assume that the institution is utility-maximizing, but may impose certain constraints to ensure that the policy ","element":"span"},{"style":{"height":16.4},"width":157.99,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-4.png","element":"img","alt":" τ is fair","inline":true},{"text":", in a sense described in Section ","element":"span"},{"href":"#id-24","text":"2.2","element":"a"},{"text":". We assume that there exists a function ","element":"span"},{"style":{"height":12.8},"width":200.27,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-5.png","element":"img","alt":" u : C → R","inline":true},{"text":", such that the institution’s expected utility for a policy ","element":"span"},{"style":{"height":8},"width":27,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-6.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"is given by","element":"span"}],[{"id":"id-29","style":{"width":"73%"},"width":1370,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-7.png","element":"img"}],[{"text":"Novel to this work, we focus on the effect of the selection policy ","element":"span"},{"style":{"height":8},"width":27,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-8.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"on the groups ","element":"span"},{"text":"A ","element":"span"},{"text":"and ","element":"span"},{"text":"B","element":"span"},{"text":". We quantify these ","element":"span"},{"style":{"fontStyle":"italic"},"text":"outcomes ","element":"span"},{"text":"in terms of an average effect that a policy ","element":"span"},{"style":{"height":13.13},"width":40.6,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-9.png","element":"img","alt":" τ j","inline":true,"padRight":true},{"text":"has on group ","element":"span"},{"text":"j","element":"span"},{"text":". Formally, for a function ","element":"span"},{"style":{"height":17.6},"width":274.46,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-10.png","element":"img","alt":" ∆(x) : X → R","inline":true},{"text":", we define the average change of the mean score ","element":"span"},{"style":{"height":18.99},"width":262,"height":47.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-11.png","element":"img","alt":" µj for group j","inline":true}],[{"id":"id-3","style":{"width":"69%"},"width":1300,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-12.png","element":"img"}],[{"text":"We remark that many of our results also go through if ∆","element":"span"},{"style":{"height":19.79},"width":85.87,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-13.png","element":"img","alt":"µj(τ","inline":true},{"text":") simply refers to an abstract change in well-being, not necessarily a change in the mean score. Furthermore, it is possible to modify the definition of ∆","element":"span"},{"style":{"height":19.79},"width":85.88,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-14.png","element":"img","alt":"µj(τ","inline":true},{"text":") such that it directly considers outcomes of those who are not selected.","element":"span"},{"style":{"height":18.73},"width":164.36,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-15.png","element":"img","alt":"2 Lastly,","inline":true,"padRight":true},{"text":"we assume that the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"success ","element":"span"},{"text":"of an individual is independent of their group given the score; that is, the score summarizes all relevant information about the success event, so there exists a function ","element":"span"},{"style":{"height":17.6},"width":214.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-16.png","element":"img","alt":"ρ : X → [0,","inline":true,"padRight":true},{"text":"1] such that individuals of score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"succeed with probability ","element":"span"},{"style":{"height":17.6},"width":97.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-17.png","element":"img","alt":" ρ(x).","inline":true}],[{"text":"We now introduce the specific domain of credit scores as a running example in the rest of the paper, after which we present two more examples showing the general applicability of our formulation to many domains.","element":"span"}],[{"id":"id-27","style":{"fontWeight":"bold"},"text":"Example 2.1 ","element":"span"},{"text":"(Credit scores)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"In the setting of loans, scores ","element":"span"},{"style":{"height":17.6},"width":124.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-18.png","element":"img","alt":" x ∈ [C","inline":true},{"text":"] represent credit scores, and the bank serves as the institution. The bank chooses to grant or refuse loans to individuals according to a policy ","element":"span"},{"style":{"height":8},"width":27,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/4-19.png","element":"img","alt":" τ","inline":true},{"text":". Both bank and personal utilities are given as functions of loan repayment, and therefore depend on the success probabilities ","element":"span"},{"style":{"height":17.6},"width":68.67,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-0.png","element":"img","alt":" ρ(x","inline":true},{"text":"), representing the probability that any individual with credit score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"can repay a loan within a fixed time frame. The expected utility to the bank is given by the expected return from a loan, which can be modeled as an affine function of ","element":"span"},{"style":{"height":17.6},"width":97.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-1.png","element":"img","alt":" ρ(x):","inline":true},{"style":{"height":17.6},"width":783.37,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-2.png","element":"img","alt":"u(x) = u+ρ(x) + u−(1 − ρ(x)), where u+","inline":true,"padRight":true},{"text":"denotes the profit when loans are repaid and ","element":"span"},{"style":{"height":15.02},"width":209.65,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-3.png","element":"img","alt":" u− the loss","inline":true,"padRight":true},{"text":"when they are defaulted on. Individual outcomes of being granted a loan are based on whether or not an individual repays the loan, and a simple model for ","element":"span"},{"style":{"height":17.6},"width":83.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-4.png","element":"img","alt":" ∆(x","inline":true},{"text":") may also be affine in ","element":"span"},{"style":{"height":17.6},"width":97.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-5.png","element":"img","alt":" ρ(x):","inline":true},{"style":{"height":17.6},"width":546.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-6.png","element":"img","alt":"∆(x) = c+ρ(x) + c−(1 − ρ(x","inline":true},{"text":")), modified accordingly at boundary states. The constant ","element":"span"},{"style":{"height":16.22},"width":204.03,"height":40.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-7.png","element":"img","alt":" c+ denotes","inline":true,"padRight":true},{"text":"the gain in credit score if loans are repaid and ","element":"span"},{"style":{"height":10.62},"width":44.88,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-8.png","element":"img","alt":" c−","inline":true,"padRight":true},{"text":"is the score penalty in case of default.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Example 2.2 ","element":"span"},{"text":"(Advertising)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"A second illustrative example is given by the case of advertising agencies making decisions about which groups to target. An individual with product interest score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"responds positively to an ad with probability ","element":"span"},{"style":{"height":17.6},"width":68.66,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-9.png","element":"img","alt":" ρ(x","inline":true},{"text":"). The ad agency experiences utility ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") related to click-through rates, which increases with ","element":"span"},{"style":{"height":17.6},"width":68.67,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-10.png","element":"img","alt":" ρ(x","inline":true},{"text":"). Individuals who see the ad but are uninterested may react negatively (becoming less interested in the product), and ","element":"span"},{"style":{"height":17.6},"width":83.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-11.png","element":"img","alt":" ∆(x","inline":true},{"text":") encodes the interest change. If the product is a positive good like education or employment opportunities, interest can correspond to well-being. Thus the advertising agency’s incentives to only show ads to individuals with extremely high interest may leave behind groups whose interest is lower on average. A related historical example occurred in advertisements for computers in the 1980s, where male consumers were targeted over female consumers, arguably contributing to the current gender gap in computing.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Example 2.3 ","element":"span"},{"text":"(College Admissions)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"The scenario of college admissions or scholarship allotments can also be considered within our framework. Colleges may select certain applicants for acceptance according to a score ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":", which could be thought encode a “college preparedness” measure. The students who are admitted might “succeed” (this could be interpreted as graduating, graduating with honors, finding a job placement, etc.) with some probability ","element":"span"},{"style":{"height":17.6},"width":68.67,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-12.png","element":"img","alt":" ρ(x","inline":true},{"text":") depending on their preparedness. The college might experience a utility ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") corresponding to alumni donations, or positive rating when a student succeeds; they might also show a drop in rating or a loss of invested scholarship money when a student is unsuccessful. The student’s success in college will affect their later success, which could be modeled generally by ","element":"span"},{"style":{"height":17.6},"width":83.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-13.png","element":"img","alt":" ∆(x","inline":true},{"text":"). In this scenario, it is challenging to ensure that a single summary statistic ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"captures enough information about a student; it may be more appropriate to consider ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"as a vector as well as more complex forms of ","element":"span"},{"style":{"height":17.6},"width":97.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-14.png","element":"img","alt":" ρ(x).","inline":true}],[{"text":"While a variety of applications are modeled faithfully within our framework, there are limitations to the accuracy with which real-life phenomenon can be measured by strictly binary decisions and success probabilities. Such binary rules are necessary for the definition and execution of existing fairness criteria, (see Sec. ","element":"span"},{"href":"#id-24","text":"2.2","element":"a"},{"text":") and as we will see, even modeling these facets of decision making as binary allows for complex and interesting behavior.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"The Outcome Curve","element":"span"}],[{"text":"We now introduce important outcome regimes, stated in terms of the change in average group score. A policy (","element":"span"},{"style":{"height":11.2},"width":124.18,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-15.png","element":"img","alt":"τ A, τ B","inline":true},{"text":") is said to cause ","element":"span"},{"style":{"height":19.79},"width":1044.34,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-16.png","element":"img","alt":" active harm to group j if ∆µj(τ j) < 0, stagnation if","inline":true,"padRight":true},{"text":"∆","element":"span"},{"style":{"height":19.79},"width":815.6,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-17.png","element":"img","alt":"µj(τ j) = 0, and improvement if ∆µj(τ j) >","inline":true,"padRight":true},{"text":"0. Under our model, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policies can be chosen in a standard fashion which applies the same threshold ","element":"span"},{"style":{"height":14.33},"width":147.18,"height":35.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-18.png","element":"img","alt":" τ MaxUtil ","inline":true,"padRight":true},{"text":"for both groups, and is agnostic to the distributions ","element":"span"},{"style":{"height":15.24},"width":209.76,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-19.png","element":"img","alt":" πA and πB","inline":true},{"text":". Hence, if we define","element":"span"}],[{"style":{"width":"65%"},"width":1230,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/5-20.png","element":"img"}],[{"id":"id-6","style":{"width":"98%"},"width":1845,"height":877,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-0.png","element":"img"}],[{"text":"Figure 1: The above figure shows the ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"outcome curve","element":"figcaption","subtype":"caption"},{"text":". The horizontal axis represents the selection rate for the population; the vertical axis represents the mean change in score. (a) depicts the full spectrum of outcome regimes, and colors indicate regions of active harm, relative harm, and no harm. In (b): a group that has much potential for gain, in (c): a group that has no potential for gain.","element":"figcaption","subtype":"caption"}],[{"text":"we say that a policy causes ","element":"span"},{"style":{"height":21.93},"width":1307.46,"height":54.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-1.png","element":"img","alt":" relative harm to group j if ∆µj(τ j) < ∆µMaxUtilj , and relative im-","inline":true},{"style":{"height":21.93},"width":652.51,"height":54.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-2.png","element":"img","alt":"provement if ∆µj(τ j) > ∆µMaxUtilj","inline":true,"padRight":true},{"text":". In particular, we focus on these outcomes for a disadvantaged group, and consider whether imposing a fairness constraint improves their outcomes relative to the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"strategy. From this point forward, we take ","element":"span"},{"text":"A ","element":"span"},{"text":"to be disadvantaged or protected group.","element":"span"}],[{"text":"Figure ","element":"span"},{"href":"#id-6","text":"1 ","element":"a"},{"text":"displays the important outcome regimes in terms of ","element":"span"},{"style":{"height":19.16},"width":720.56,"height":47.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-3.png","element":"img","alt":" selection rates βj := �x∈X πj(x)τ j(x).","inline":true,"padRight":true},{"text":"This succinct characterization is possible when considering decision rules based on (possibly randomized) score thresholding, in which all individuals with scores above a threshold are selected. In Section ","element":"span"},{"text":"5","element":"span"},{"text":", we justify the restriction to such ","element":"span"},{"style":{"fontStyle":"italic"},"text":"threshold policies ","element":"span"},{"text":"by showing it preserves optimality. In Section ","element":"span"},{"href":"#id-25","text":"5.1","element":"a"},{"text":", we show that the outcome curve is concave, thus implying that it takes the shape depicted in Figure ","element":"span"},{"href":"#id-6","text":"1","element":"a"},{"text":". To explicitly connect selection rates to decision policies, we define the rate function ","element":"span"},{"style":{"height":18.33},"width":103.71,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-4.png","element":"img","alt":" rπ(τ j","inline":true},{"text":") which returns the proportion of group ","element":"span"},{"text":"j ","element":"span"},{"text":"selected by the policy. We show that this function is invertible for a suitable class of threshold policies, and in fact the outcome curve is precisely the graph of the map from selection rate to outcome ","element":"span"},{"style":{"height":21.3},"width":312.95,"height":53.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-5.png","element":"img","alt":" β �→ ∆µA(r−1πA(β","inline":true},{"text":")). Next, we define ","element":"span"},{"text":"the values of ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-6.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"that mark boundaries of the outcome regions.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 2.1 ","element":"span"},{"text":"(Selection rates of interest)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"Given the protected group ","element":"span"},{"text":"A","element":"span"},{"text":", the following selection rates are of interest in distinguishing between qualitatively different classes of outcomes (Figure ","element":"span"},{"href":"#id-6","text":"1","element":"a"},{"text":"). We define ","element":"span"},{"style":{"height":17.93},"width":145.56,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-7.png","element":"img","alt":" βMaxUtil ","inline":true,"padRight":true},{"text":"as the selection rate for ","element":"span"},{"style":{"height":16.4},"width":401.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-8.png","element":"img","alt":" A under MaxUtil; β0","inline":true,"padRight":true},{"text":"as the harm threshold, such that ∆","element":"span"},{"style":{"height":21.3},"width":412.29,"height":53.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-9.png","element":"img","alt":"µA(r−1πA(β0)) = 0; β∗","inline":true,"padRight":true},{"text":"as the selection rate such that ∆","element":"span"},{"style":{"height":12.19},"width":53.89,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-10.png","element":"img","alt":"µA ","inline":true,"padRight":true},{"text":"is maximized; ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-11.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"as the outcome- ","element":"span"},{"text":"complement of the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"selection rate, ∆","element":"span"},{"style":{"height":21.3},"width":1030.16,"height":53.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/6-12.png","element":"img","alt":"µAr−1πA(β)) = ∆µA(r−1πA(βMaxUtil)) with β > βMaxUtil.","inline":true}],[{"id":"id-24","style":{"fontWeight":"bold"},"text":"2.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Decision Rules and Fairness Criteria","element":"span"}],[{"text":"We will consider policies that maximize the institution’s total expected utility, potentially subject to a constraint: ","element":"span"},{"style":{"height":19.53},"width":291,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-0.png","element":"img","alt":" τ ∈ C ∈ [0, 1]2C ","inline":true,"padRight":true},{"text":"which enforces some notion of “fairness”. Formally, the institution selects ","element":"span"},{"style":{"height":17.6},"width":542.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-1.png","element":"img","alt":" τ∗ ∈ argmax U(τ) s.t. τ ∈ C","inline":true},{"text":". We consider the three following constraints:","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 2.2 ","element":"span"},{"text":"(Fairness criteria)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"The ","element":"span"},{"style":{"fontStyle":"italic"},"text":"maximum utility ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":") policy corresponds to the nullconstraint ","element":"span"},{"style":{"height":19.53},"width":221.52,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-2.png","element":"img","alt":" C = [0, 1]2C","inline":true},{"text":", so that the institution is free to focus solely on utility. The ","element":"span"},{"style":{"fontStyle":"italic"},"text":"demographic parity ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":") policy results in equal selection rates between both groups. ","element":"span"},{"text":"Formally, the constraint is ","element":"span"},{"style":{"height":20.9},"width":1619.11,"height":52.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-3.png","element":"img","alt":" C =�(τ A, τ B) : �x∈X πA(x)τ A = �x∈X πB(x)τ B�. The equal opportunity (EqOpt)","inline":true,"padRight":true},{"text":"policy results in equal true positive rates (TPR) between both group, where TPR is defined as TPR","element":"span"},{"style":{"height":18.33},"width":137.63,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-4.png","element":"img","alt":"j(τ) :=","inline":true}],[{"style":{"height":31.27},"width":478.22,"height":78.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-5.png","element":"img","alt":"�x∈X πj(x)ρ(x)τ(x)�x∈X πj(x)ρ(x) . EqOpt","inline":true,"padRight":true},{"text":"ensures that the conditional probability of selection given that the individual will be successful is independent of the population, formally enforced by the constraint ","element":"span"},{"style":{"height":17.6},"width":818.18,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-6.png","element":"img","alt":" C = {(τ A, τ B) : TPRA(τ A) = TPRB(τ B)} .","inline":true}],[{"text":"Just as the expected outcome ∆","element":"span"},{"style":{"height":11.6},"width":31,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-7.png","element":"img","alt":"µ","inline":true,"padRight":true},{"text":"can be expressed in terms of selection rate for threshold policies, so can the total utility ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U","element":"span"},{"text":". In the unconstrained cause, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"text":"varies independently over the selection rates for group ","element":"span"},{"text":"A ","element":"span"},{"text":"and ","element":"span"},{"text":"B","element":"span"},{"text":"; however, in the presence of fairness constraints the selection rate for one group determines the allowable selection rate for the other. The selection rates must be equal for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":", but for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"we can define a ","element":"span"},{"style":{"height":19.93},"width":485.02,"height":49.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-8.png","element":"img","alt":" transfer function, G(A→B)","inline":true},{"text":", which for every loan rate ","element":"span"},{"style":{"height":16.8},"width":245.3,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-9.png","element":"img","alt":"β in group A","inline":true,"padRight":true},{"text":"gives the loan rate in group ","element":"span"},{"text":"B ","element":"span"},{"text":"that has the same true positive rate. Therefore, when considering threshold policies, decision rules amount to maximizing functions of single parameters. This idea is expressed in Figure ","element":"span"},{"href":"#id-26","text":"2","element":"a"},{"text":", and underpins the results to follow.","element":"span"}]]},{"heading":"3 Results","paragraphs":[[{"text":"In order to clearly characterize the outcome of applying fairness constraints, we make the following assumption.","element":"span"}],[{"id":"id-28","style":{"fontWeight":"bold"},"text":"Assumption 1 ","element":"span"},{"text":"(Institution utilities)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The institution’s individual utility function is more stringent than the expected score changes, ","element":"span"},{"style":{"height":17.6},"width":528.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-10.png","element":"img","alt":" u(x) > 0 =⇒ ∆(x) > 0","inline":true},{"style":{"fontStyle":"italic"},"text":". (For the linear form presented in Example ","element":"span"},{"href":"#id-27","style":{"fontStyle":"italic"},"text":"2.1","element":"a"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"style":{"height":23.4},"width":150.05,"height":58.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-11.png","element":"img","alt":"u−u+ < c−c+ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is necessary and sufficient.)","element":"span"}],[{"text":"This simplifying assumption quantifies the intuitive notion that institutions take a greater risk by accepting than the individual does by applying. For example, in the credit setting, a bank loses the amount loaned in the case of a default, but makes only interest in case of a payback. Using Assumption ","element":"span"},{"href":"#id-28","text":"1","element":"a"},{"text":", we can restrict the position of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"on the outcome curve in the following sense.","element":"span"}],[{"id":"id-5","style":{"fontWeight":"bold"},"text":"Proposition 3.1 ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"does not cause active harm)","element":"span"},{"href":"#id-28","style":{"height":17.94},"width":783.39,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-12.png","element":"img","alt":". Under Assumption 1, 0 ≤ ∆µMaxUtil ≤","inline":true,"padRight":true},{"text":"∆","element":"span"},{"style":{"height":15.93},"width":62.82,"height":39.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/7-13.png","element":"img","alt":"µ∗.","inline":true}],[{"text":"We direct the reader to Appendix ","element":"span"},{"text":"C ","element":"span"},{"text":"for the proof of the above proposition, and all subsequent results presented in this section. The results are corollaries to theorems presented in Section ","element":"span"},{"text":"6","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Prospects and Pitfalls of Fairness Criteria","element":"span"}],[{"text":"We begin by characterizing general settings under which fairness criteria act to improve outcomes over unconstrained ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"strategies. For this result, we will assume that group ","element":"span"},{"text":"A ","element":"span"},{"text":"is disadvantaged","element":"span"}],[{"id":"id-26","style":{"width":"38%"},"width":721,"height":795,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-0.png","element":"img"}],[{"text":"Figure 2: Both outcomes ∆","element":"figcaption","subtype":"caption"},{"style":{"height":11.6},"width":31,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-1.png","element":"img","alt":"µ","inline":true,"padRight":true},{"text":"and institution utilities ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"U ","element":"figcaption","subtype":"caption"},{"text":"can be plotted as a function of selection rate for one group. The maxima of the utility curves determine the selection rates resulting from various decision rules.","element":"figcaption","subtype":"caption"}],[{"text":"in the sense that the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"acceptance rate for ","element":"span"},{"text":"B ","element":"span"},{"text":"is large compared to relevant acceptance rates for ","element":"span"},{"text":"A","element":"span"},{"text":".","element":"span"}],[{"id":"id-33","style":{"fontWeight":"bold"},"text":"Corollary 3.2 ","element":"span"},{"text":"(Fairness Criteria can cause Relative Improvement)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(a) Under the assumption that ","element":"span"},{"style":{"height":19.65},"width":690.07,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-2.png","element":"img","alt":"βMaxUtilA < β and βMaxUtilB > βMaxUtilA","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":", there exist population proportions ","element":"span"},{"style":{"height":15.6},"width":222.35,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-3.png","element":"img","alt":" g0 < g1 < 1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for all ","element":"span"},{"style":{"height":22.48},"width":1129.01,"height":56.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-4.png","element":"img","alt":" gA ∈ [g0, g1], βMaxUtilA < βDemParityA < β. That is, DemParity","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes relative improvement.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"(b) Under the assumption that there exist ","element":"span"},{"style":{"height":19.65},"width":940.44,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-5.png","element":"img","alt":" βMaxUtilA < β < β′ < β such that βMaxUtilB >","inline":true},{"style":{"height":20.33},"width":435.82,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-6.png","element":"img","alt":"G(A→B)(β), G(A→B)(β′)","inline":true},{"style":{"fontStyle":"italic"},"text":", there exist population proportions ","element":"span"},{"style":{"height":15.6},"width":255.41,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-7.png","element":"img","alt":" g2 < g3 < 1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for all ","element":"span"},{"style":{"height":13.6},"width":95.9,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-8.png","element":"img","alt":" gA ∈","inline":true,"padRight":true},{"text":"[","element":"span"},{"style":{"height":22.48},"width":858.79,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-9.png","element":"img","alt":"g2, g3], βMaxUtilA < βEqOptA < β. That is, EqOpt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes relative improvement.","element":"span"}],[{"text":"This result gives the conditions under which we can guarantee the existence of settings in which fairness criteria cause improvement relative to ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":". Relying on machinery proved in Section ","element":"span"},{"text":"6","element":"span"},{"text":", the result follows from comparing the position of optima on the utility curve to the outcome curve. Figure ","element":"span"},{"href":"#id-26","text":"2 ","element":"a"},{"text":"displays a illustrative example of both the outcome curve and the institutions’ utility ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"text":"as a function of the selection rates in group ","element":"span"},{"text":"A","element":"span"},{"text":". In the utility function (","element":"span"},{"href":"#id-29","text":"1","element":"a"},{"text":"), the contributions of each group are weighted by their population proportions ","element":"span"},{"style":{"height":13.13},"width":32.81,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-10.png","element":"img","alt":" gj","inline":true},{"text":", and thus the resulting selection rates are sensitive to these proportions.","element":"span"}],[{"text":"As we see in the remainder of this section, fairness criteria can achieve nearly any position along the outcome curve under the right conditions. This fact comes from the potential mismatch between the outcomes, controlled by ","element":"span"},{"style":{"height":12.4},"width":42,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-11.png","element":"img","alt":" ∆","inline":true},{"text":", and the institution’s utility ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":".","element":"span"}],[{"text":"The next theorem implies that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"can be bad for long term well-being of the protected group by being over-generous, under the mild assumption that ∆","element":"span"},{"style":{"height":19.65},"width":358.03,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-12.png","element":"img","alt":"µA(βMaxUtilB ) < 0:","inline":true}],[{"id":"id-30","style":{"fontWeight":"bold"},"text":"Corollary 3.3 ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"can cause harm by being over-eager)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix a selection rate ","element":"span"},{"style":{"height":16.4},"width":210.05,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-13.png","element":"img","alt":" β. Assume","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"that ","element":"span"},{"style":{"height":19.65},"width":524.1,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-14.png","element":"img","alt":" βMaxUtilB > β > βMaxUtilA","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":". Then, there exists a population proportion ","element":"span"},{"style":{"height":12},"width":37.81,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-15.png","element":"img","alt":" g0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for all ","element":"span"},{"style":{"height":22.48},"width":501.48,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-16.png","element":"img","alt":"gA ∈ [0, g0], βDemParityA > β","inline":true},{"style":{"fontStyle":"italic"},"text":". In particular, when ","element":"span"},{"style":{"height":16.8},"width":365.76,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-17.png","element":"img","alt":" β = β0, DemParity","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes active harm, and when ","element":"span"},{"style":{"height":16.8},"width":347.42,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/8-18.png","element":"img","alt":"β = β, DemParity","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes relative harm.","element":"span"}],[{"text":"The assumption ∆","element":"span"},{"style":{"height":19.65},"width":296.36,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-0.png","element":"img","alt":"µA(βMaxUtilB ) <","inline":true,"padRight":true},{"text":"0 implies that a policy which selects individuals from group ","element":"span"},{"text":"A ","element":"span"},{"text":"at the selection rate that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"would have used for group ","element":"span"},{"text":"B ","element":"span"},{"text":"necessarily lowers average score in ","element":"span"},{"text":"A","element":"span"},{"text":". This is one natural notion of protected group ","element":"span"},{"text":"A","element":"span"},{"text":"’s ‘disadvantage’ relative to group ","element":"span"},{"text":"B","element":"span"},{"text":". In this case, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"penalizes the scores of group ","element":"span"},{"text":"A ","element":"span"},{"text":"even more than a naive ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policy, as long as group proportion ","element":"span"},{"style":{"height":12},"width":43.81,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-1.png","element":"img","alt":" gA","inline":true,"padRight":true},{"text":"is small enough. Again, small ","element":"span"},{"style":{"height":12},"width":43.82,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-2.png","element":"img","alt":" gA","inline":true,"padRight":true},{"text":"is another notion of group disadvantage.","element":"span"}],[{"text":"Using credit scores as an example, Corollary ","element":"span"},{"href":"#id-30","text":"3.3 ","element":"a"},{"text":"tells us that an overly aggressive fairness criterion will give too many loans to people in a protected group who cannot pay them back, hurting the group’s credit scores on average. In the following theorem, we show that an analogous result holds for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":".","element":"span"}],[{"id":"id-31","style":{"fontWeight":"bold"},"text":"Corollary 3.4 ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"can cause harm by being over-eager)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"height":21.25},"width":438.66,"height":53.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-3.png","element":"img","alt":" βMaxUtilB > G(A→B)(β)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":19.65},"width":276.99,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-4.png","element":"img","alt":" β > βMaxUtilA","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":". Then, there exists a population proportion ","element":"span"},{"style":{"height":12},"width":37.82,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-5.png","element":"img","alt":" g0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for all ","element":"span"},{"style":{"height":17.6},"width":226.62,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-6.png","element":"img","alt":" gA ∈ [0, g0],","inline":true},{"style":{"height":22.48},"width":235.42,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-7.png","element":"img","alt":"βEqOptA > β","inline":true},{"style":{"fontStyle":"italic"},"text":". In particular, when ","element":"span"},{"style":{"height":16.8},"width":311.41,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-8.png","element":"img","alt":" β = β0, EqOpt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes active harm, and when ","element":"span"},{"style":{"height":16.8},"width":294.78,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-9.png","element":"img","alt":" β = β, EqOpt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes relative harm.","element":"span"}],[{"text":"We remark that in Corollary ","element":"span"},{"href":"#id-31","text":"3.4","element":"a"},{"text":", we rely on the ","element":"span"},{"style":{"height":19.93},"width":481.96,"height":49.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-10.png","element":"img","alt":" transfer function, G(A→B)","inline":true},{"text":", which for every loan rate ","element":"span"},{"style":{"height":16.8},"width":245.65,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-11.png","element":"img","alt":" β in group A","inline":true,"padRight":true},{"text":"gives the loan rate in group ","element":"span"},{"text":"B ","element":"span"},{"text":"that has the same true positive rate. Notice that if ","element":"span"},{"style":{"height":16.33},"width":139.52,"height":40.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-12.png","element":"img","alt":" G(A→B) ","inline":true,"padRight":true},{"text":"were the identity function, Corollary ","element":"span"},{"href":"#id-30","text":"3.3 ","element":"a"},{"text":"and Corollary ","element":"span"},{"href":"#id-31","text":"3.4 ","element":"a"},{"text":"would be exactly the same. Indeed, our framework (detailed in Section ","element":"span"},{"text":"6 ","element":"span"},{"text":"and Appendix ","element":"span"},{"text":"B","element":"span"},{"text":") unifies the analyses for a large class of fairness constraints that includes ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"as specific cases, and allows us to derive results about impact on ∆","element":"span"},{"style":{"height":11.6},"width":31,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-13.png","element":"img","alt":"µ","inline":true,"padRight":true},{"text":"using general techniques. In the next section, we present further results that compare the fairness criteria, demonstrating the usefulness of our technical framework.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Comparing ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"style":{"fontWeight":"bold"},"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"}],[{"text":"Our analysis of the acceptance rates of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"in Section ","element":"span"},{"text":"6 ","element":"span"},{"text":"suggests that it is difficult to compare ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"without knowing the full distributions ","element":"span"},{"style":{"height":15.6},"width":311.44,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-14.png","element":"img","alt":" πA, πB, which is","inline":true,"padRight":true},{"text":"necessary to compute the transfer function ","element":"span"},{"style":{"height":16.33},"width":139.52,"height":40.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-15.png","element":"img","alt":" G(A→B)","inline":true},{"text":". In fact, we have found that settings exist both in which ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"causes harm while ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"causes improvement and in which ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"causes improvement while ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"causes harm. There cannot be one general rule as to which fairness criteria provides better outcomes in all settings. We now present simple sufficient conditions on the geometry of the distributions for which ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"is always better than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"in terms of ∆","element":"span"},{"style":{"height":12.19},"width":67.46,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-16.png","element":"img","alt":"µA.","inline":true}],[{"id":"id-4","style":{"fontWeight":"bold"},"text":"Corollary 3.5 ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"may avoid active harm where ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"fails)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix a selection rate ","element":"span"},{"style":{"height":16.4},"width":39.99,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-17.png","element":"img","alt":" β.","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Suppose ","element":"span"},{"style":{"height":11.2},"width":129.76,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-18.png","element":"img","alt":" πA, πB","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are identical up to a translation with ","element":"span"},{"style":{"height":17.6},"width":863.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-19.png","element":"img","alt":" µA < µB, i.e. πA(x) = πB(x+(µB −µA)).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"For simplicity, take ","element":"span"},{"style":{"height":17.6},"width":85.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-20.png","element":"img","alt":" ρ(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"to be linear in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":". Suppose","element":"span"}],[{"style":{"width":"13%"},"width":245,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-21.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Then there exists an interval ","element":"span"},{"text":"[","element":"span"},{"style":{"height":17.6},"width":286.83,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-22.png","element":"img","alt":"g1, g2] ⊆ [0, 1]","inline":true},{"style":{"fontStyle":"italic"},"text":", such that ","element":"span"},{"style":{"height":18.33},"width":793.52,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-23.png","element":"img","alt":" ∀gA > g1, βEqOpt < β while ∀gA < g2,","inline":true},{"style":{"height":17.93},"width":290.4,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-24.png","element":"img","alt":"βDemParity > β","inline":true},{"style":{"fontStyle":"italic"},"text":". In particular, when ","element":"span"},{"style":{"height":16.4},"width":151.65,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-25.png","element":"img","alt":" β = β0","inline":true},{"style":{"fontStyle":"italic"},"text":", this implies ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"style":{"fontStyle":"italic"},"text":"causes active harm but ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"style":{"fontStyle":"italic"},"text":"causes improvement for ","element":"span"},{"style":{"height":17.6},"width":263.53,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-26.png","element":"img","alt":" gA ∈ [g1, g2]","inline":true},{"style":{"fontStyle":"italic"},"text":", but for any ","element":"span"},{"style":{"height":16.4},"width":461.38,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-27.png","element":"img","alt":" gA such that DemParity","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes improvement, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"style":{"fontStyle":"italic"},"text":"also causes improvement.","element":"span"}],[{"text":"To interpret the conditions under which Corollary ","element":"span"},{"href":"#id-4","text":"3","element":"a"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"5 ","element":"span"},{"text":"holds, consider when we might have ","element":"span"},{"style":{"height":21.53},"width":302.87,"height":53.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-28.png","element":"img","alt":"β0 > �x>µA πA","inline":true},{"text":". This is precisely when ∆","element":"span"},{"style":{"height":21.64},"width":336.65,"height":54.09,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-29.png","element":"img","alt":"µA(�x>µA πA) >","inline":true,"padRight":true},{"text":"0, that is, ∆","element":"span"},{"style":{"height":13.79},"width":103.12,"height":34.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/9-30.png","element":"img","alt":"µA >","inline":true,"padRight":true},{"text":"0 for a policy that ","element":"span"},{"text":"selects every individual whose score is above the group ","element":"span"},{"text":"A ","element":"span"},{"text":"mean, which is reasonable in reality. Indeed, the converse would imply that group ","element":"span"},{"text":"A ","element":"span"},{"text":"has such low scores that even selecting all above average individuals in ","element":"span"},{"text":"A ","element":"span"},{"text":"would hurt the average score. In such a case, Corollary ","element":"span"},{"href":"#id-4","text":"3.5 ","element":"a"},{"text":"suggests that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"is better than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"at avoiding active harm, because it is more conservative. A natural question then is: can ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"cause relative harm by being too stingy?","element":"span"}],[{"id":"id-32","style":{"fontWeight":"bold"},"text":"Corollary 3.6 ","element":"span"},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"never loans less than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", but ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"might)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Recall the definition of the TPR functions ","element":"span"},{"text":"TPR","element":"span"},{"style":{"height":11.2},"width":12,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-0.png","element":"img","alt":"j","inline":true},{"style":{"fontStyle":"italic"},"text":", and suppose that the ","element":"span"},{"style":{"height":18.33},"width":446.93,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-1.png","element":"img","alt":" MaxUtil policy τ MaxUtil ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is such that","element":"span"}],[{"style":{"width":"81%"},"width":1528,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-2.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Then ","element":"span"},{"style":{"height":22.48},"width":941.61,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-3.png","element":"img","alt":" βEqOptA < βMaxUtilA < βDemParityA . That is, EqOpt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"causes relative harm by selecting at a rate lower than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"text":"The above theorem shows that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"is never stingier than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"to the protected group ","element":"span"},{"text":"A","element":"span"},{"text":", as long as a ","element":"span"},{"text":"A ","element":"span"},{"text":"is disadvantaged in the sense that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"selects a larger proportion of ","element":"span"},{"text":"B ","element":"span"},{"text":"than ","element":"span"},{"text":"A","element":"span"},{"text":". On the other hand, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"can select less of group ","element":"span"},{"text":"A ","element":"span"},{"text":"than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", and by definition, cause relative harm. This is a surprising result about ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":", and this phenomenon arises from high levels of in-group inequality for group ","element":"span"},{"text":"A","element":"span"},{"text":". Moreover, we show in Appendix ","element":"span"},{"text":"C ","element":"span"},{"text":"that there are parameter settings where the conditions in Corollary ","element":"span"},{"href":"#id-32","text":"3.6 ","element":"a"},{"text":"are satisfied even under a stringent notion of disadvantage we call CDF domination, described therein.","element":"span"}]]},{"heading":"4 Relaxations of Constrained Fairness","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"4.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Regularized fairness","element":"span"}],[{"text":"In many cases, it may be unrealistic for an institution to ensure that fairness constraints are met exactly. However, one can consider “soft” formulations of fairness constraints which either penalized the differences in acceptance rate (","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":") or the differences in TPR (","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":"). In Appendix ","element":"span"},{"text":"B","element":"span"},{"text":", we formulate these soft constraints as regularized objectives. For example, a soft-","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"can be rendered as","element":"span"}],[{"style":{"width":"73%"},"width":1371,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":13.2},"width":72.48,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-5.png","element":"img","alt":" λ >","inline":true,"padRight":true},{"text":"0 is a regularization parameter, and Φ(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") is a convex regularization function. We show that the solutions to these objectives are threshold policies, and can be fully characterized in terms of the group-wise selection rate. We also make rigorous the notion that policies which solve the softconstraint objective interpolate between ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policies at ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-6.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"= 0 and hard-constrained policies (","element":"span"},{"style":{"height":17.6},"width":607.74,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-7.png","element":"img","alt":"DemParity or EqOpt) as λ → ∞","inline":true},{"text":". This fact is clearly demonstrated by the form of the solutions in the special case of the regularization function Φ(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") = ","element":"span"},{"style":{"fontStyle":"italic"},"text":"|","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"style":{"fontStyle":"italic"},"text":"|","element":"span"},{"text":", provided in the appendix.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"4.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Fairness Under Measurement Error","element":"span"}],[{"text":"Next, consider the implications of an institution with imperfect knowledge of scores. ","element":"span"},{"text":"Under a simple model in which the estimate of an individual’s score ","element":"span"},{"style":{"height":12},"width":136.02,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-8.png","element":"img","alt":" X ∼ π","inline":true,"padRight":true},{"text":"is prone to errors ","element":"span"},{"style":{"fontStyle":"italic"},"text":"e","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":") such that ","element":"span"},{"style":{"height":17.6},"width":543.35,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-9.png","element":"img","alt":" X + e(X) := �X ∼ �π.","inline":true,"padRight":true},{"text":"Constraining the error to be negative results in the setting that scores are systematically ","element":"span"},{"style":{"fontStyle":"italic"},"text":"underestimated","element":"span"},{"text":". ","element":"span"},{"text":"In this setting, it is equivalent to consider the CDF of underestimated distribution ","element":"span"},{"style":{"height":12.8},"width":364.3,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-10.png","element":"img","alt":" �π to be dominated","inline":true,"padRight":true},{"text":"by the CDF true distribution ","element":"span"},{"style":{"height":15.6},"width":187.64,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/10-11.png","element":"img","alt":" π, that is","inline":true}],[{"style":{"height":20.76},"width":759.22,"height":51.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-0.png","element":"img","alt":"�x≥c �π(x) ≤ �x≥c π(x) for all c ∈ [C","inline":true},{"text":"]. Then we can compare the institution’s behavior under ","element":"span"},{"text":"this estimation to its behavior under the truth.","element":"span"}],[{"id":"id-76","style":{"fontWeight":"bold"},"text":"Proposition 4.1 ","element":"span"},{"text":"(Underestimation causes underselection)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix the distribution of ","element":"span"},{"style":{"height":15.24},"width":298.38,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-1.png","element":"img","alt":" B as πB and let","inline":true},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-2.png","element":"img","alt":"β","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the acceptance rate of ","element":"span"},{"text":"A ","element":"span"},{"style":{"fontStyle":"italic"},"text":"when the institution makes the decision using perfect knowledge of the distribution ","element":"span"},{"style":{"height":16.4},"width":265.81,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-3.png","element":"img","alt":" πA. Denote �β","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"as the acceptance rate when the group is instead taken as ","element":"span"},{"style":{"height":15.24},"width":188.88,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-4.png","element":"img","alt":" �πA. Then","inline":true},{"style":{"height":22.48},"width":913.4,"height":56.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-5.png","element":"img","alt":"βMaxUtilA > �βMaxUtilA and βDemParityA > �βDemParityA","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":". If the errors are further such that the true TPR dominates the estimated TPR, it is also true that ","element":"span"},{"style":{"height":22.48},"width":329.73,"height":56.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-6.png","element":"img","alt":" βEqOptA > �βEqOptA .","inline":true}],[{"text":"Because fairness criteria encourage a higher selection rate for disadvantaged groups (Corollary ","element":"span"},{"href":"#id-33","text":"3.2","element":"a"},{"text":"), systematic underestimation widens the regime of their applicability. Furthermore, since the estimated ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policy underloans, the region for relative improvement in the outcome curve (Figure ","element":"span"},{"href":"#id-6","text":"1","element":"a"},{"text":") is larger, corresponding to more regimes under which fairness criteria can yield favorable outcomes. Thus the potential for measurement error should be a factor when motivating these criteria.","element":"span"}],[{"id":"id-20","style":{"fontWeight":"bold"},"text":"4.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Outcome-based alternative","element":"span"}],[{"text":"As explained in the preceding sections, fairness criteria may actively harm disadvantaged groups. It is thus natural to consider a modified decision rule which involves the explicit maximization of ∆","element":"span"},{"style":{"height":12.19},"width":53.89,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-7.png","element":"img","alt":"µA","inline":true},{"text":". In this case, imagine that the institution’s primary goal is to aid the disadvantaged group, subject to a limited profit loss compared to the maximum possible expected profit ","element":"span"},{"style":{"height":15.13},"width":264.4,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-8.png","element":"img","alt":" UMaxUtil. The","inline":true,"padRight":true},{"text":"corresponding problem is as follows.","element":"span"}],[{"style":{"width":"70%"},"width":1324,"height":70,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-9.png","element":"img"}],[{"text":"Unlike the fairness constrained objective, this objective no longer depends on group ","element":"span"},{"text":"B ","element":"span"},{"text":"and instead depends on our model of the mean score change in group ","element":"span"},{"style":{"height":16.99},"width":159.58,"height":42.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-10.png","element":"img","alt":" A, ∆µA.","inline":true}],[{"id":"id-77","style":{"fontWeight":"bold"},"text":"Proposition 4.2 ","element":"span"},{"text":"(Outcome-based solution)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In the above setting, the optimal bank policy ","element":"span"},{"style":{"height":14.84},"width":139.66,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-11.png","element":"img","alt":" τ A is a","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"threshold policy with selection rate ","element":"span"},{"style":{"height":17.6},"width":560.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-12.png","element":"img","alt":" β = min{β∗, βmax}, where β∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the outcome-optimal loan rate and ","element":"span"},{"style":{"height":16.4},"width":90.13,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-13.png","element":"img","alt":" βmax ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the maximum loan rate under the bank’s “budget”.","element":"span"}],[{"text":"The above formulation’s advantage over fairness constraints is that it directly optimizes the outcome of ","element":"span"},{"text":"A ","element":"span"},{"text":"and can be approximately implemented given reasonable ability to predict outcomes. Importantly, this objective shifts the focus to outcome modeling, highlighting the importance of domain specific knowledge. Future work can consider strategies that are robust to outcome model errors.","element":"span"}]]},{"heading":"5 Optimality of Threshold Policies","paragraphs":[[{"text":"Next, we move towards statements of the main theorems underlying the results presented in Section ","element":"span"},{"text":"3","element":"span"},{"text":". We begin by establishing notation which we shall use throughout. Recall that ","element":"span"},{"style":{"height":12.8},"width":181.84,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-14.png","element":"img","alt":" ◦ denotes","inline":true,"padRight":true},{"text":"the Hadamard product between vectors. We identify functions mapping ","element":"span"},{"style":{"height":12.4},"width":146.59,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-15.png","element":"img","alt":" X → R","inline":true,"padRight":true},{"text":"with vectors in ","element":"span"},{"style":{"height":15.13},"width":57.52,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-16.png","element":"img","alt":"RC","inline":true},{"text":". We also define the group-wise utilities","element":"span"}],[{"style":{"width":"67%"},"width":1261,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/11-17.png","element":"img"}],[{"text":"so that for ","element":"span"},{"style":{"height":17.6},"width":871.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-0.png","element":"img","alt":" τ = (τ A, τ B), U(τ) := gAUA(τ A) + gBUB(τ B).","inline":true,"padRight":true},{"text":"First, we formally describe threshold policies, and rigorously justify why we may always assume without loss of generality that the institution adopts policies of this form.","element":"span"}],[{"id":"id-59","style":{"fontWeight":"bold"},"text":"Definition 5.1 ","element":"span"},{"text":"(Threshold selection policy)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"A single group selection policy ","element":"span"},{"style":{"height":19.53},"width":195.21,"height":48.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-1.png","element":"img","alt":" τ ∈ [0, 1]C ","inline":true,"padRight":true},{"text":"is called a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"threshold policy ","element":"span"},{"text":"if it has the form of a randomized threshold on score:","element":"span"}],[{"style":{"width":"77%"},"width":1445,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-2.png","element":"img"}],[{"text":"As a technicality, if no members of a population have a given score ","element":"span"},{"style":{"height":12.8},"width":133.86,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-3.png","element":"img","alt":" x ∈ X","inline":true},{"text":", there may be multiple threshold policies which yield equivalent selection rates for a given population. To avoid redundancy, we introduce the notation ","element":"span"},{"style":{"height":18.81},"width":177.61,"height":47.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-4.png","element":"img","alt":" τ j ∼=πj τ ′j ","inline":true,"padRight":true},{"text":"to mean that the set of scores on which ","element":"span"},{"style":{"height":20},"width":181.66,"height":49.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-5.png","element":"img","alt":" τ j and τ ′j","inline":true,"padRight":true},{"text":"differ has probability 0 under ","element":"span"},{"style":{"height":13.13},"width":43.4,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-6.png","element":"img","alt":" πj","inline":true},{"text":"; formally, ","element":"span"},{"style":{"height":22.52},"width":347.44,"height":56.3,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-7.png","element":"img","alt":"�x:τ j(x)̸=τ j(x) πj(x","inline":true},{"text":") = 0. For any distribution ","element":"span"},{"style":{"height":17.99},"width":138.89,"height":44.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-8.png","element":"img","alt":" πj, ∼=πj","inline":true,"padRight":true},{"text":"is an equivalence relation. Moreover, we see that if ","element":"span"},{"style":{"height":20},"width":508.8,"height":49.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-9.png","element":"img","alt":" τ j ∼=πj τ ′j, then τ j and τ ′j ","inline":true,"padRight":true},{"text":"both provide the ","element":"span"},{"text":"same utility for the institution, induce the same outcomes for individuals in group ","element":"span"},{"text":"j","element":"span"},{"text":", and have the same selection and true positive rates. Hence, if (","element":"span"},{"style":{"height":11.2},"width":124.18,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-10.png","element":"img","alt":"τ A, τ B","inline":true},{"text":") is an optimal solution to any of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":", or ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":", so is any (","element":"span"},{"style":{"height":13.32},"width":124.18,"height":33.29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-11.png","element":"img","alt":"τ ′A, τ ′B","inline":true},{"text":") for which ","element":"span"},{"style":{"height":17.72},"width":531.23,"height":44.29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-12.png","element":"img","alt":" τ A ∼=πA τ ′A and τ B ∼=πB τ ′B.","inline":true}],[{"text":"For threshold policies in particular, their equivalence class under ","element":"span"},{"style":{"height":17.99},"width":67.39,"height":44.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-13.png","element":"img","alt":"∼=πj","inline":true,"padRight":true},{"text":"is uniquely determined by the selection rate function,","element":"span"}],[{"style":{"width":"65%"},"width":1223,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-14.png","element":"img"}],[{"text":"which denotes the fraction of group ","element":"span"},{"text":"j ","element":"span"},{"text":"which is selected. Indeed, we have the following lemma (proved in Appendix ","element":"span"},{"href":"#id-34","text":"A.1","element":"a"},{"text":"):","element":"span"}],[{"id":"id-58","style":{"height":20},"width":539.95,"height":49.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-15.png","element":"img","alt":"Lemma 5.1. Let τ j and τ ′j ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be threshold policies. Then ","element":"span"},{"style":{"height":18.81},"width":180.69,"height":47.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-16.png","element":"img","alt":" τ j ∼=πj τ ′j ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"if and only if ","element":"span"},{"style":{"height":20.8},"width":331.33,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-17.png","element":"img","alt":" rπj(τ j) = rπj(τ ′j).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Further, ","element":"span"},{"style":{"height":19.98},"width":128.48,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-18.png","element":"img","alt":" rπj(τ j)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a bijection from ","element":"span"},{"style":{"height":18.33},"width":705.42,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-19.png","element":"img","alt":" Tthresh(πj) to [0, 1], where Tthresh(πj)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the set of equivalence classes between threshold policies under ","element":"span"},{"style":{"height":23.21},"width":485.48,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-20.png","element":"img","alt":"∼=πj. Finally, πj ◦ r−1πj (βj)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is well defined.","element":"span"}],[{"text":"Remark that ","element":"span"},{"style":{"height":23.21},"width":119.82,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-21.png","element":"img","alt":" r−1πj (βj","inline":true},{"text":") is an equivalence class rather than a single policy. However, ","element":"span"},{"style":{"height":23.21},"width":258.28,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-22.png","element":"img","alt":" πj ◦r−1πj (τ j) is","inline":true,"padRight":true},{"text":"well defined, meaning that ","element":"span"},{"style":{"height":15.6},"width":295.43,"height":38.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-23.png","element":"img","alt":" πj ◦τ j = πj ◦τ ′j ","inline":true,"padRight":true},{"text":"for any two policies in the same equivalence class. Since ","element":"span"},{"text":"all quantities of interest will only depend on policies ","element":"span"},{"style":{"height":17.53},"width":347.18,"height":43.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-24.png","element":"img","alt":" τ j through πj ◦ τ j","inline":true},{"text":", it does not matter ","element":"span"},{"style":{"fontStyle":"italic"},"text":"which ","element":"span"},{"text":"representative of ","element":"span"},{"style":{"height":23.21},"width":119.82,"height":58.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-25.png","element":"img","alt":" r−1πj (βj","inline":true},{"text":") we pick. Hence, abusing notation slightly, we shall represent ","element":"span"},{"style":{"height":18.33},"width":194.56,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-26.png","element":"img","alt":" Tthresh(πj)","inline":true,"padRight":true},{"text":"by choosing one representative from each equivalence class under ","element":"span"},{"style":{"height":21.51},"width":100.07,"height":53.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-27.png","element":"img","alt":"∼=πj3.","inline":true}],[{"text":"It turns out the policies which arise in this away are always optimal in the sense that, for a given loan rate ","element":"span"},{"style":{"height":17.82},"width":39.68,"height":44.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-28.png","element":"img","alt":" βj","inline":true},{"text":", the threshold policy ","element":"span"},{"style":{"height":23.21},"width":122.82,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-29.png","element":"img","alt":" r−1πj (βj","inline":true},{"text":") is the (essentially unique) policy which maximizes ","element":"span"},{"text":"both the institution’s utility and the utility of the group. Defining the group-wise utility,","element":"span"}],[{"style":{"width":"67%"},"width":1261,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/12-30.png","element":"img"}],[{"text":"we have the following result:","element":"span"}],[{"id":"id-35","style":{"fontWeight":"bold"},"text":"Proposition 5.1 ","element":"span"},{"text":"(Threshold policies are preferable)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"height":17.6},"width":300.15,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-0.png","element":"img","alt":" u(x) and ∆(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are strictly increasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":". Given any loaning policy ","element":"span"},{"style":{"height":13.13},"width":40.61,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-1.png","element":"img","alt":" τ j","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for population with distribution ","element":"span"},{"style":{"height":13.02},"width":46.39,"height":32.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-2.png","element":"img","alt":" πj","inline":true},{"style":{"fontStyle":"italic"},"text":", then the policy","element":"span"}],[{"style":{"width":"99%"},"width":1867,"height":156,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Moreover, both inequalities hold with equality if and only if ","element":"span"},{"style":{"height":22.73},"width":269.03,"height":56.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-4.png","element":"img","alt":" τ j ∼=πj τ threshj .","inline":true}],[{"text":"The map ","element":"span"},{"style":{"height":23.21},"width":326.83,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-5.png","element":"img","alt":" τ j �→ r−1πj (rπj(τ j","inline":true},{"text":")) can be thought of transforming an arbitrary policy ","element":"span"},{"style":{"height":16.73},"width":178.89,"height":41.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-6.png","element":"img","alt":" τ j into a","inline":true,"padRight":true},{"text":"threshold policy with the same selection rate. In this language, the above proposition states that this map never reduces institution utility or individual outcomes. We can also show that optimal ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"policies are threshold policies, as well as all ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"policies under an additional assumption:","element":"span"}],[{"id":"id-39","style":{"fontWeight":"bold"},"text":"Proposition 5.2 ","element":"span"},{"text":"(Existance of optimal threshold policies under fairness constraints)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is strictly increasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then all optimal ","element":"span"},{"style":{"height":19.98},"width":783.81,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-7.png","element":"img","alt":" MaxUtil policies (τ A, τ B) satisfy τ j ∼=πj","inline":true},{"style":{"height":23.41},"width":513.39,"height":58.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-8.png","element":"img","alt":"r−1πj �rπj(τ j)�for j ∈ {A, B}","inline":true},{"style":{"fontStyle":"italic"},"text":". The same holds for all optimal ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"style":{"fontStyle":"italic"},"text":"policies, and if in addition ","element":"span"},{"style":{"height":17.6},"width":196.02,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-9.png","element":"img","alt":"u(x)/ρ(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is increasing, the same is true for all optimal ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"style":{"fontStyle":"italic"},"text":"policies.","element":"span"}],[{"text":"To prove proposition ","element":"span"},{"href":"#id-35","text":"5.1","element":"a"},{"text":", we invoke the following general lemma which is proved using standard convex analysis arguments (in Appendix ","element":"span"},{"href":"#id-36","text":"A.2","element":"a"},{"text":"):","element":"span"}],[{"id":"id-37","style":{"height":20.61},"width":843.74,"height":51.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-10.png","element":"img","alt":"Lemma 5.2. Let v ∈ RC, and let w ∈ RC>0","inline":true},{"style":{"fontStyle":"italic"},"text":", and suppose either that ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is increasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":", and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is increasing or, ","element":"span"},{"style":{"height":21.84},"width":1324.14,"height":54.59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-11.png","element":"img","alt":" ∀x ∈ X, w(x) = 0. Let π ∈ SimplexC−1 and fix t ∈ [0, �x∈X π(x) ·","inline":true}],[{"id":"id-38","style":{"width":"99%"},"width":1867,"height":151,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"satisfies ","element":"span"},{"style":{"height":19.13},"width":359.92,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-13.png","element":"img","alt":" τ ∗ ∼=π r−1π (rπ(τ ∗))","inline":true},{"style":{"fontStyle":"italic"},"text":". Moreover, at least one maximizer ","element":"span"},{"style":{"height":17.6},"width":416.9,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-14.png","element":"img","alt":" τ ∗ ∈ Tthresh(π) exists.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-35","style":{"fontStyle":"italic"},"text":"5.1","element":"a"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"We will first prove Proposition ","element":"span"},{"href":"#id-35","text":"5.1 ","element":"a"},{"text":"for the function ","element":"span"},{"style":{"height":17.13},"width":39.31,"height":42.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-15.png","element":"img","alt":" Uj","inline":true},{"text":". Given our nominal policy ","element":"span"},{"style":{"height":19.98},"width":413.64,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-16.png","element":"img","alt":" τ j, let βj = rπj(τ j).","inline":true,"padRight":true},{"text":"We now apply Lemma ","element":"span"},{"href":"#id-37","text":"5.2 ","element":"a"},{"text":"with ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = 1. ","element":"span"},{"text":"For this choice of ","element":"span"},{"style":{"height":19.98},"width":1431.53,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-17.png","element":"img","alt":" v and w, ⟨v, τ⟩ = Uj(τ) and that ⟨πj ◦ w, τ = rπj(τ). Then, if τ j ∈","inline":true,"padRight":true},{"text":"arg max","element":"span"},{"href":"#id-38","style":{"height":19.98},"width":657.46,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-18.png","element":"img","alt":"τ Uj(τ) s.t. rπj(τ) = βj, Lemma 12","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":23.21},"width":373.58,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-19.png","element":"img","alt":" τ j ∼=πj r−1πj (rπj(τ j)).","inline":true}],[{"style":{"width":"96%"},"width":1800,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-20.png","element":"img"}],[{"text":"which will imply that ","element":"span"},{"style":{"height":13.13},"width":40.6,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-21.png","element":"img","alt":" τ j","inline":true,"padRight":true},{"text":"is a maximizer since ","element":"span"},{"style":{"height":23.21},"width":342.46,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-22.png","element":"img","alt":" τ j ∼=πj r−1πj (rπj(τ j","inline":true},{"text":")) implies that ","element":"span"},{"style":{"height":19.98},"width":309.97,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-23.png","element":"img","alt":" Uj(τ j) = τ j ∼=πj","inline":true},{"style":{"height":23.21},"width":195.61,"height":58.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-24.png","element":"img","alt":"r−1πj (rπj(τ j","inline":true},{"text":")). By Lemma ","element":"span"},{"href":"#id-37","text":"5.2 ","element":"a"},{"text":"there exists a maximizer ","element":"span"},{"style":{"height":20.8},"width":279.71,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-25.png","element":"img","alt":" τ ∗j ∈ Tthresh(π","inline":true},{"text":"), which means that ","element":"span"},{"style":{"height":19.93},"width":100.5,"height":49.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-26.png","element":"img","alt":" τ ∗j =","inline":true},{"style":{"height":23.21},"width":441.01,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-27.png","element":"img","alt":"r−1πj (rπj(τ ∗j )). Since τ ∗j ","inline":true,"padRight":true},{"text":"is feasible, we must have ","element":"span"},{"style":{"height":20.8},"width":318.24,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-28.png","element":"img","alt":" rπj(τ ∗j ) = rπj(τ j","inline":true},{"text":"), and thus ","element":"span"},{"style":{"height":23.21},"width":357.52,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-29.png","element":"img","alt":" τ ∗j = r−1πj (rπj(τ j)),","inline":true,"padRight":true},{"text":"as needed. The same argument follows verbatim if we instead choose ","element":"span"},{"style":{"height":17.6},"width":231.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-30.png","element":"img","alt":" v(x) = ∆(x","inline":true},{"text":"), and compute ","element":"span"},{"style":{"height":19.79},"width":319.28,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-31.png","element":"img","alt":"⟨v, τ⟩ = ∆µj(τ).","inline":true}],[{"text":"We now argue Proposition ","element":"span"},{"href":"#id-39","text":"5.2 ","element":"a"},{"text":"for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", as it is a straightforward application of Lemma ","element":"span"},{"href":"#id-37","text":"5.2","element":"a"},{"text":". We will prove Proposition ","element":"span"},{"href":"#id-39","text":"5.2 ","element":"a"},{"text":"for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"separately in Sections ","element":"span"},{"href":"#id-40","text":"6.1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-41","text":"6.2","element":"a"},{"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-39","style":{"fontStyle":"italic"},"text":"5.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"follows from lemma ","element":"span"},{"href":"#id-37","text":"5.2 ","element":"a"},{"text":"with ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":"), and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"= 0 and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontWeight":"bold"},"text":"0","element":"span"},{"text":".","element":"span"}],[{"style":{"width":"1%"},"width":30,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/13-32.png","element":"img"}],[{"id":"id-25","style":{"fontWeight":"bold"},"text":"5.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Quantiles and Concavity of the Outcome Curve","element":"span"}],[{"text":"To further our analysis, we now introduce left and right quantile functions, allowing us to specify thresholds in terms of both selection rate and score cutoffs.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 5.2 ","element":"span"},{"text":"(Upper quantile function)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"Define Q to be the upper quantile function corresponding to ","element":"span"},{"style":{"height":14.8},"width":113.7,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-0.png","element":"img","alt":" π, i.e.","inline":true}],[{"style":{"width":"90%"},"width":1696,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-1.png","element":"img"}],[{"text":"Crucially Q(","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-2.png","element":"img","alt":"β","inline":true},{"text":") is continuous from the right, and Q","element":"span"},{"style":{"height":18.73},"width":71.31,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-3.png","element":"img","alt":"+(β","inline":true},{"text":") is continuous from the left. Further, Q(","element":"span"},{"style":{"height":18.73},"width":222.92,"height":46.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-4.png","element":"img","alt":"·) and Q+(·","inline":true},{"text":") allow us to compute derivatives of key functions, like the mapping from selection rate ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-5.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"to the group outcome associated with a policy of that rate, ∆","element":"span"},{"style":{"height":19.13},"width":157,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-6.png","element":"img","alt":"µ(r−1π (β","inline":true},{"text":")). Because we take ","element":"span"},{"style":{"height":8},"width":30,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-7.png","element":"img","alt":"π","inline":true,"padRight":true},{"text":"to have discrete support, all functions in this work are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"piecewise linear","element":"span"},{"text":", so we shall need to distinguish between the left and right derivatives, defined as follows","element":"span"}],[{"style":{"width":"88%"},"width":1661,"height":95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-8.png","element":"img"}],[{"text":"For ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"supported on [","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b","element":"span"},{"text":"], we say that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"is left- (resp. right-) differentiable if ","element":"span"},{"style":{"height":17.6},"width":119.54,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-9.png","element":"img","alt":" ∂−f(x","inline":true},{"text":") exists for all ","element":"span"},{"style":{"height":17.6},"width":458.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-10.png","element":"img","alt":" x ∈ (a, b] (resp. ∂+f(y","inline":true},{"text":") exists for all ","element":"span"},{"style":{"height":17.6},"width":164.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-11.png","element":"img","alt":" y ∈ [a, b","inline":true},{"text":")). We now state the fundamental derivative computation which underpins the results to follow:","element":"span"}],[{"id":"id-43","style":{"height":14.62},"width":405.12,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-12.png","element":"img","alt":"Lemma 5.3. Let ex","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the vector such that ","element":"span"},{"style":{"height":17.6},"width":898.98,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-13.png","element":"img","alt":" ex(x) = 1, and ex(x′) = 0 for x′ ̸= x. Then","inline":true},{"style":{"height":23.61},"width":515.5,"height":59.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-14.png","element":"img","alt":"πj ◦ r−1πj (β) : [0, 1] → [0, 1]C","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is continuous, and has left and right derivatives","element":"span"}],[{"style":{"width":"82%"},"width":1552,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-15.png","element":"img"}],[{"text":"The above lemma is proved in Appendix ","element":"span"},{"href":"#id-42","text":"A.3","element":"a"},{"text":". Moreover, Lemma ","element":"span"},{"href":"#id-43","text":"5.3 ","element":"a"},{"text":"implies that the outcome curve is concave under the assumption that ","element":"span"},{"style":{"height":17.6},"width":83.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-16.png","element":"img","alt":" ∆(x","inline":true},{"text":") is monotone:","element":"span"}],[{"id":"id-46","style":{"height":16},"width":481.15,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-17.png","element":"img","alt":"Proposition 5.3. Let π","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be a distribution over ","element":"span"},{"style":{"fontStyle":"italic"},"text":"C ","element":"span"},{"style":{"fontStyle":"italic"},"text":"states. Then ","element":"span"},{"style":{"height":19.13},"width":331.51,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-18.png","element":"img","alt":" β �→ ∆µ(r−1π (β))","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is concave. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"fact, if ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is any non-decreasing map from ","element":"span"},{"style":{"height":19.13},"width":478.72,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-19.png","element":"img","alt":" X → R, β �→ ⟨w, r−1π (β)⟩","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is concave.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Recall that a univariate function ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"is concave (and finite) on [","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b","element":"span"},{"text":"] if and only (a) ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"is left- and right-differentiable, (b) for all ","element":"span"},{"style":{"height":17.6},"width":511.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-20.png","element":"img","alt":" x ∈ (a, b), ∂−f(x) ≥ ∂+f(x","inline":true},{"text":") and (c) for any ","element":"span"},{"style":{"height":17.6},"width":471.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-21.png","element":"img","alt":" x > y, ∂−f(x) ≤ ∂+f(y).","inline":true}],[{"text":"Observe that ∆","element":"span"},{"href":"#id-43","style":{"height":19.13},"width":1117.42,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-22.png","element":"img","alt":"µ(r−1π (β)) = ⟨∆, π ◦ r−1π (β)⟩. By Lemma 5.3, π ◦ r−1π (β","inline":true},{"text":") has right and left ","element":"span"},{"text":"derivatives ","element":"span"},{"style":{"height":19.33},"width":320.56,"height":48.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-23.png","element":"img","alt":" eQ(β) and eQ+(β)","inline":true},{"text":". Hence, we have that","element":"span"}],[{"style":{"width":"81%"},"width":1527,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-24.png","element":"img"}],[{"text":"Using the fact that ","element":"span"},{"style":{"height":17.6},"width":83.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-25.png","element":"img","alt":" ∆(x","inline":true},{"text":") is monotone, and that Q ","element":"span"},{"style":{"height":17.93},"width":106,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-26.png","element":"img","alt":" ≤ Q+","inline":true},{"text":", we see that ","element":"span"},{"style":{"height":19.13},"width":684.65,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-27.png","element":"img","alt":" ∂+∆µ(f−1π (βB)) ≤ ∂−∆µ(f−1π (βB)),","inline":true,"padRight":true},{"text":"and that ","element":"span"},{"style":{"height":19.13},"width":642.73,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-28.png","element":"img","alt":" ∂∆µ(f−1π (βB)) and ∂+∆µ(f−1π (βB","inline":true},{"text":")) are non-increasing, from which it follows that ∆","element":"span"},{"style":{"height":19.13},"width":219.39,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-29.png","element":"img","alt":"µ(f−1π (βB))","inline":true,"padRight":true},{"text":"is concave. The general concavity result holds by replacing ","element":"span"},{"style":{"height":17.6},"width":323.01,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/14-30.png","element":"img","alt":" ∆(x) with w(x).","inline":true}],[{"id":"id-44","style":{"width":"35%"},"width":665,"height":701,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-0.png","element":"img"}],[{"text":"Figure 3: ","element":"figcaption","subtype":"caption"},{"text":"Considering the utility as a function of selection rates, fairness constraints correspond to restricting the optimization to one-dimensional curves. The ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"figcaption","subtype":"caption"},{"text":"(DP) constraint is a straight line with slope 1, while the ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"figcaption","subtype":"caption"},{"text":"(EO) constraint is a curve given by the graph of ","element":"figcaption","subtype":"caption"},{"style":{"height":16.34},"width":153.69,"height":40.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-1.png","element":"img","alt":" G(A→B).","inline":true,"padRight":true},{"text":"The derivatives considered throughout Section ","element":"figcaption","subtype":"caption"},{"text":"6 ","element":"span","subtype":"caption"},{"text":"are taken with respect to the selection rate ","element":"figcaption","subtype":"caption"},{"style":{"height":16.4},"width":47.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-2.png","element":"img","alt":" βA","inline":true,"padRight":true},{"text":"(horizontal axis); projecting the EO and DP constraint curves to the horizontal axis recovers concave utility curves such as those shown in the lower panel of Figure ","element":"figcaption","subtype":"caption"},{"href":"#id-26","text":"2 ","element":"a","subtype":"caption"},{"text":"(where ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"figcaption","subtype":"caption"},{"text":"in is represented by a horizontal line through the MU optimal solution).","element":"figcaption","subtype":"caption"}]]},{"heading":"6 Proofs of Main Theorems","paragraphs":[[{"text":"We are now ready to present and prove theorems that characterize the selection rates under fairness constraints, namely ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":". ","element":"span"},{"text":"These characterizations are crucial for proving the results in Section ","element":"span"},{"text":"3","element":"span"},{"text":". Our computations also generalize readily to other linear constraints, in a way that will become clear in Section ","element":"span"},{"href":"#id-41","text":"6.2","element":"a"},{"text":".","element":"span"}],[{"id":"id-40","style":{"fontWeight":"bold"},"text":"6.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A Characterization Theorem for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"}],[{"text":"In this section, we provide a theorem that gives an explicit characterization for the range of selection rates ","element":"span"},{"style":{"height":16.4},"width":154.78,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-3.png","element":"img","alt":" βA for A","inline":true,"padRight":true},{"text":"when the bank loans according to ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":". Observe that the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"objective corresponds to solving the following linear program:","element":"span"}],[{"style":{"width":"50%"},"width":939,"height":74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-4.png","element":"img"}],[{"text":"Let us introduce the auxiliary variable ","element":"span"},{"style":{"height":17.6},"width":502.98,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-5.png","element":"img","alt":" β := ⟨πA, τ A⟩ = ⟨πB, τ B⟩","inline":true,"padRight":true},{"text":"corresponding to the selection rate which is held constant across groups, so that all feasible solutions lie on the green DP line in Figure ","element":"span"},{"href":"#id-44","text":"3","element":"a"},{"text":". We can then express the following equivalent linear program:","element":"span"}],[{"style":{"width":"59%"},"width":1111,"height":74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-6.png","element":"img"}],[{"text":"This is equivalent because, for a given ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-7.png","element":"img","alt":" β","inline":true},{"text":", Proposition ","element":"span"},{"href":"#id-39","text":"5.2 ","element":"a"},{"text":"says that the utility maximizing policies are of the form ","element":"span"},{"style":{"height":23.21},"width":206.95,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/15-8.png","element":"img","alt":" τ j = r−1πj (β","inline":true},{"text":"). We now prove this:","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-39","style":{"fontStyle":"italic"},"text":"5.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"Noting that ","element":"span"},{"style":{"height":19.98},"width":322.02,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-0.png","element":"img","alt":" rπj(τ j) = ⟨πj, τ j⟩","inline":true},{"text":", we see that, by Lemma ","element":"span"},{"href":"#id-37","text":"5.2","element":"a"},{"text":", under the special case where ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = 1, the optimal solution (","element":"span"},{"style":{"height":18.51},"width":332.86,"height":46.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-1.png","element":"img","alt":"τ ∗A(β), τ ∗B(β)) for","inline":true,"padRight":true},{"text":"fixed ","element":"span"},{"style":{"height":18.07},"width":464.94,"height":45.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-2.png","element":"img","alt":" rπA(τ A) = rπB(τ B) = β","inline":true,"padRight":true},{"text":"can be chosen to coincide with the threshold policies. Optimizing over ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-3.png","element":"img","alt":" β","inline":true},{"text":", the global optimal must coincide with thresholds.","element":"span"}],[{"style":{"width":"1%"},"width":30,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-4.png","element":"img"}],[{"text":"Hence, any optimal policy is equivalent to the threshold policy ","element":"span"},{"style":{"height":21.3},"width":579.3,"height":53.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-5.png","element":"img","alt":" τ = (r−1πA(β), r−1πB(β)), where β","inline":true,"padRight":true},{"text":"solves the following optimization:","element":"span"}],[{"id":"id-45","style":{"width":"66%"},"width":1238,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-6.png","element":"img"}],[{"text":"We shall show that the above expression is in fact a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"concave ","element":"span"},{"text":"function in ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-7.png","element":"img","alt":" β","inline":true},{"text":", and hence the set of optimal selection rates can be characterized by first order conditions. This is presented formally in the following theorem:","element":"span"}],[{"id":"id-47","style":{"fontWeight":"bold"},"text":"Theorem 6.1 ","element":"span"},{"text":"(Selection rates for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":")","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The set of optimal selection rates ","element":"span"},{"href":"#id-45","style":{"height":17.6},"width":331.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-8.png","element":"img","alt":" β∗ satisfying (17)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"forms a continuous interval ","element":"span"},{"text":"[","element":"span"},{"style":{"height":22.71},"width":389.6,"height":56.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-9.png","element":"img","alt":"β−DemParity, β+DemParity]","inline":true},{"style":{"fontStyle":"italic"},"text":", such that for any ","element":"span"},{"style":{"height":17.6},"width":346.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-10.png","element":"img","alt":" β ∈ [0, 1], we have","inline":true}],[{"style":{"width":"54%"},"width":1013,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-11.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Note that we can write","element":"span"}],[{"style":{"width":"67%"},"width":1257,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-12.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") is non-decreasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":", Proposition ","element":"span"},{"href":"#id-46","text":"5.3 ","element":"a"},{"text":"implies that ","element":"span"},{"style":{"height":21.5},"width":550.52,"height":53.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-13.png","element":"img","alt":" β �→ U��r−1πA(β), r−1πB(β)��is","inline":true,"padRight":true},{"text":"concave in ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-14.png","element":"img","alt":" β","inline":true},{"text":". Hence, all optimal selection rates ","element":"span"},{"style":{"height":16.4},"width":43.98,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-15.png","element":"img","alt":" β∗ ","inline":true,"padRight":true},{"text":"lie in an interval [","element":"span"},{"style":{"height":17.93},"width":127.7,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-16.png","element":"img","alt":"β−, β+","inline":true},{"text":"]. To further characterize this interval, let us us compute left- and right-derivatives.","element":"span"}],[{"style":{"width":"87%"},"width":1642,"height":271,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-17.png","element":"img"}],[{"text":"The same argument shows that","element":"span"}],[{"style":{"width":"53%"},"width":1008,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-18.png","element":"img"}],[{"text":"By concavity of ","element":"span"},{"style":{"height":21.5},"width":392.49,"height":53.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-19.png","element":"img","alt":" U��r−1πA(β), r−1πB(β)��","inline":true},{"text":", a positive right derivative at ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-20.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":16.4},"width":313.1,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-21.png","element":"img","alt":" β < β∗ for all β∗","inline":true,"padRight":true},{"text":"satisfying (","element":"span"},{"href":"#id-45","text":"17","element":"a"},{"text":"), and similarly, a negative left derivative at ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-22.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":16.8},"width":504.59,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-23.png","element":"img","alt":" β > β∗ for all β∗ satisfying","inline":true,"padRight":true},{"text":"(","element":"span"},{"href":"#id-45","text":"17","element":"a"},{"text":").","element":"span"}],[{"style":{"width":"1%"},"width":30,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-24.png","element":"img"}],[{"text":"With a result of the above form, we can now easily prove statements such as that in Corollary ","element":"span"},{"href":"#id-30","text":"3.3 ","element":"a"},{"text":"(see appendix ","element":"span"},{"text":"C ","element":"span"},{"text":"for proofs), by fixing a selection rate of interest (e.g. ","element":"span"},{"style":{"height":16.4},"width":41.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/16-25.png","element":"img","alt":" β0","inline":true},{"text":") and inverting the inequalities in Theorem ","element":"span"},{"href":"#id-47","text":"6.1 ","element":"a"},{"text":"to find the exact population proportions under which, for example, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"results in a higher selection rate than ","element":"span"},{"style":{"height":16.4},"width":55.61,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-0.png","element":"img","alt":" β0.","inline":true}],[{"id":"id-41","style":{"fontWeight":"bold"},"text":"6.2 ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"style":{"fontWeight":"bold"},"text":"and General Constraints","element":"span"}],[{"text":"Next, we will provide a theorem that gives an explicit characterization for the range of selection rates ","element":"span"},{"style":{"height":16.4},"width":161.51,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-1.png","element":"img","alt":"βA for A","inline":true,"padRight":true},{"text":"when the bank loans according to ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":". Observe that the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"objective corresponds to solving the following linear program:","element":"span"}],[{"id":"id-48","style":{"width":"82%"},"width":1548,"height":75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-2.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":24.3},"width":201.76,"height":60.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-3.png","element":"img","alt":" wj = ρ⟨ρ,πj⟩","inline":true},{"text":". This problem is similar to the demographic parity optimization in (","element":"span"},{"href":"#id-45","text":"17","element":"a"},{"text":"), except ","element":"span"},{"text":"for the fact that the constraint includes the weights. Whereas we parameterized demographic parity solutions in terms of the acceptance rate ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-4.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"in equation (","element":"span"},{"href":"#id-45","text":"17","element":"a"},{"text":"), we will parameterize equation (","element":"span"},{"href":"#id-48","text":"18","element":"a"},{"text":") in terms of the true positive rate (TPR), ","element":"span"},{"href":"#id-48","style":{"height":17.6},"width":756.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-5.png","element":"img","alt":" t := ⟨wA ◦ πA, τ A⟩. Thus, (18) becomes","inline":true}],[{"id":"id-51","style":{"width":"88%"},"width":1659,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-6.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":19.95},"width":521.82,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-7.png","element":"img","alt":" tmax = minj∈{A,B}{⟨πj, wj⟩}","inline":true,"padRight":true},{"text":"is the largest possible TPR. The magenta EO curve in Figure ","element":"span"},{"href":"#id-44","text":"3 ","element":"a"},{"text":"illustrates that feasible solutions to this optimization problem lie on a curve parametrized by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Note that the objective function decouples for ","element":"span"},{"style":{"height":17.6},"width":186.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-8.png","element":"img","alt":" j ∈ {A, B}","inline":true,"padRight":true},{"text":"for the inner optimization problem,","element":"span"}],[{"id":"id-49","style":{"width":"75%"},"width":1417,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-9.png","element":"img"}],[{"text":"We will now show that all optimal solutions for this inner optimization problem are ","element":"span"},{"style":{"height":13.13},"width":43.4,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-10.png","element":"img","alt":" πj","inline":true},{"text":"-a.e. equal to a policy in ","element":"span"},{"style":{"height":18.33},"width":178.55,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-11.png","element":"img","alt":" Tthresh(πj","inline":true},{"text":"), and thus can be written as ","element":"span"},{"style":{"height":23.21},"width":119.83,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-12.png","element":"img","alt":" r−1πj (βj","inline":true},{"text":"), depending only on the resulting selection ","element":"span"},{"text":"rate.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-39","style":{"fontStyle":"italic"},"text":"5.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"We apply Lemma ","element":"span"},{"href":"#id-37","text":"5.2 ","element":"a"},{"text":"to the inner optimization in (","element":"span"},{"href":"#id-49","text":"20","element":"a"},{"text":") with ","element":"span"},{"style":{"height":28.61},"width":572.72,"height":71.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-13.png","element":"img","alt":"v(x) = u(x) and w(x) = ρ(x)⟨ρ,πj⟩","inline":true},{"text":". The claim follows from the assumption that ","element":"span"},{"style":{"height":17.6},"width":179.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-14.png","element":"img","alt":" u(x)/ρ(x","inline":true},{"text":") is increasing ","element":"span"},{"text":"by optimizing over ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":".","element":"span"}],[{"text":"This selection rate ","element":"span"},{"style":{"height":17.93},"width":36.69,"height":44.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-15.png","element":"img","alt":" βj","inline":true,"padRight":true},{"text":"is uniquely determined by the TPR ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"(proof appears in Appendix ","element":"span"},{"href":"#id-50","text":"B.1","element":"a"},{"text":"):","element":"span"}],[{"id":"id-62","style":{"fontWeight":"bold"},"text":"Lemma 6.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then the function","element":"span"}],[{"style":{"width":"27%"},"width":523,"height":61,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"is a bijection from ","element":"span"},{"text":"[0","element":"span"},{"style":{"height":18.33},"width":333.05,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-17.png","element":"img","alt":", 1] to [0, ⟨πj, w⟩].","inline":true}],[{"text":"Hence, for any ","element":"span"},{"style":{"height":17.6},"width":201.62,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-18.png","element":"img","alt":" t ∈ [0, tmax","inline":true},{"text":"], the mapping from TPR to acceptance rate, ","element":"span"},{"style":{"height":23.21},"width":116.59,"height":58.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-19.png","element":"img","alt":" T −1j,wj(t","inline":true},{"text":"), is well defined ","element":"span"},{"text":"and any solution to (","element":"span"},{"href":"#id-49","text":"20","element":"a"},{"text":") is ","element":"span"},{"style":{"height":13.13},"width":43.4,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-20.png","element":"img","alt":" πj","inline":true},{"text":"-a.e. equal to the policy ","element":"span"},{"href":"#id-51","style":{"height":23.21},"width":436.32,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-21.png","element":"img","alt":" r−1πj (T −1j,wj(t)). Thus (19","inline":true},{"text":") reduces to","element":"span"}],[{"id":"id-52","style":{"width":"70%"},"width":1325,"height":114,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/17-22.png","element":"img"}],[{"text":"The above expression parametrizes the optimization problem in terms of a single variable. We shall show that the above expression is in fact a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"concave ","element":"span"},{"text":"function in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", and hence the set of optimal selection rates can be characterized by first order conditions. This is presented formally in the following theorem:","element":"span"}],[{"id":"id-63","style":{"fontWeight":"bold"},"text":"Theorem 6.2 ","element":"span"},{"text":"(Selection rates for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":")","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The set of optimal selection rates ","element":"span"},{"style":{"height":16.8},"width":395.47,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-0.png","element":"img","alt":" β∗ for group A satsi-","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"fying ","element":"span"},{"text":"(","element":"span"},{"href":"#id-51","text":"19","element":"a"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"forms a continuous interval ","element":"span"},{"text":"[","element":"span"},{"style":{"height":22.72},"width":254.11,"height":56.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-1.png","element":"img","alt":"β−EqOpt, β+EqOpt]","inline":true},{"style":{"fontStyle":"italic"},"text":", such that for any ","element":"span"},{"style":{"height":17.6},"width":346.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-2.png","element":"img","alt":" β ∈ [0, 1], we have","inline":true}],[{"style":{"width":"63%"},"width":1184,"height":280,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Here, ","element":"span"},{"style":{"height":26.84},"width":584.82,"height":67.09,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-4.png","element":"img","alt":" G(A→B)w (β) := T −1B,wB(T −1A,wA(β))","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denotes the (well-defined) map from selection rates ","element":"span"},{"style":{"height":16.8},"width":164.55,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-5.png","element":"img","alt":" βA for A","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"to the selection rate ","element":"span"},{"style":{"height":16.8},"width":166.48,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-6.png","element":"img","alt":" βB for B","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that the policies ","element":"span"},{"style":{"height":21.3},"width":659.58,"height":53.24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-7.png","element":"img","alt":" τ ∗A := r−1πA(βA) and τ ∗B := r−1πB(βB)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfy the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"constraint in ","element":"span"},{"text":"(","element":"span"},{"href":"#id-48","text":"18","element":"a"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Starting with the equivalent problem in (","element":"span"},{"href":"#id-52","text":"21","element":"a"},{"text":"), we use the concavity result of Lemma ","element":"span"},{"href":"#id-53","text":"B.1","element":"a"},{"text":". Because the objective function is the positive weighted sum of two concave functions, it is also concave. Hence, all optimal true positive rates ","element":"span"},{"style":{"height":12.74},"width":32.76,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-8.png","element":"img","alt":" t∗ ","inline":true,"padRight":true},{"text":"lie in an interval [","element":"span"},{"style":{"height":17.54},"width":105.25,"height":43.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-9.png","element":"img","alt":"t−, t+","inline":true},{"text":"]. To further characterize [","element":"span"},{"style":{"height":17.53},"width":105.25,"height":43.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-10.png","element":"img","alt":"t−, t+","inline":true},{"text":"], we can compute left- and right-derivatives, again using the result of Lemma ","element":"span"},{"href":"#id-53","text":"B.1","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"86%"},"width":1614,"height":252,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-11.png","element":"img"}],[{"text":"The same argument shows that","element":"span"}],[{"style":{"width":"78%"},"width":1476,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-12.png","element":"img"}],[{"text":"By concavity, a positive right derivative at ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"implies that ","element":"span"},{"style":{"height":13.2},"width":315.86,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-13.png","element":"img","alt":" t < t∗ for all t∗ ","inline":true,"padRight":true},{"text":"satisfying (","element":"span"},{"href":"#id-52","text":"21","element":"a"},{"text":"), and similarly, a negative left derivative at ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"implies that ","element":"span"},{"style":{"height":13.2},"width":283.32,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-14.png","element":"img","alt":" t > t∗ for all t∗ ","inline":true,"padRight":true},{"text":"satisfying (","element":"span"},{"href":"#id-52","text":"21","element":"a"},{"text":").","element":"span"}],[{"style":{"width":"96%"},"width":1801,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-15.png","element":"img"}],[{"text":"Thus we translate directly into a statement about the selection rates ","element":"span"},{"style":{"height":16.8},"width":260.96,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-16.png","element":"img","alt":" β for group A","inline":true,"padRight":true},{"text":"by seeing that ","element":"span"},{"style":{"height":25.52},"width":764.66,"height":63.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-17.png","element":"img","alt":"T −1A,wA(t) = β and T −1B,wB(t) = G(A→B)w (β).","inline":true}],[{"text":"Lastly, we remark that the results derived in this section go through verbatim for any linear constraint of the form ","element":"span"},{"style":{"height":17.6},"width":549.16,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/18-18.png","element":"img","alt":" ⟨w, πA ◦ τ A⟩ = ⟨w, πB ◦ τ B⟩","inline":true},{"text":", as long as ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") is increasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":", and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0.","element":"span"}],[{"id":"id-55","style":{"width":"62%"},"width":1177,"height":669,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-0.png","element":"img"}],[{"text":"Figure 4: The empirical payback rates as a function of credit score and CDF for both groups from the TransUnion TransRisk dataset.","element":"figcaption","subtype":"caption"}]]},{"heading":"7 Simulations","paragraphs":[[{"text":"We examine the outcomes induced by fairness constraints in the context of FICO scores for two race groups. FICO scores are a proprietary classifier widely used in the United States to predict credit worthiness. Our FICO data is based on a sample of 301,536 TransUnion TransRisk scores from 2003 [","element":"span"},{"href":"#id-54","referenceIndex":19,"text":"US Federal Reserve","element":"a"},{"text":", ","element":"span"},{"href":"#id-54","referenceIndex":19,"text":"2007","element":"a"},{"text":"], preprocessed by ","element":"span"},{"href":"#id-13","referenceIndex":8,"text":"Hardt et al. ","element":"a"},{"text":"[","element":"span"},{"href":"#id-13","referenceIndex":8,"text":"2016","element":"a"},{"text":"]. These scores, corresponding to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"in our model, range from 300 to 850 and are meant to predict credit risk. Empirical data labeled by race allows us to estimate the distributions ","element":"span"},{"style":{"height":17.53},"width":200,"height":43.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-1.png","element":"img","alt":" πj, where j","inline":true,"padRight":true},{"text":"represents race, which is restricted to two values: white non-Hispanic (labeled “white” in figures), and black. Using national demographic data, we set the population proportions to be 18% and 82%.","element":"span"}],[{"text":"Individuals were labeled as defaulted if they failed to pay a debt for at least 90 days on at least one account in the ensuing 18-24 month period; we use this data to estimate the success probability given score, ","element":"span"},{"style":{"height":19.79},"width":79.68,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-2.png","element":"img","alt":" ρj(x","inline":true},{"text":"), which we allow to vary by group to match the empirical data (see Figure ","element":"span"},{"href":"#id-55","text":"4","element":"a"},{"text":"). Our outcome curve framework allows for this relaxation; however, this discrepancy can also be attributed to group-dependent mismeasurement of score, and adjusting the scores accordingly would allow for a single ","element":"span"},{"style":{"height":17.6},"width":68.66,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-3.png","element":"img","alt":" ρ(x","inline":true},{"text":"). We use the success probabilities to define the affine utility and score change functions defined in Example ","element":"span"},{"href":"#id-27","text":"2.1","element":"a"},{"text":". We model individual penalties as a score drop of ","element":"span"},{"style":{"height":10.62},"width":139.4,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-4.png","element":"img","alt":" c− = −","inline":true},{"text":"150 in the case of a default, and in increase of ","element":"span"},{"style":{"height":11.82},"width":44.88,"height":29.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-5.png","element":"img","alt":" c+","inline":true,"padRight":true},{"text":"= 75 in the case of successful repayment.","element":"span"}],[{"text":"In Figure ","element":"span"},{"href":"#id-56","text":"5","element":"a"},{"text":", we display the empirical CDFs along with selection rates resulting from different loaning strategies for two different settings of bank utilities. In the case that the bank experiences a loss/profit ratio of ","element":"span"},{"style":{"height":23.39},"width":149,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-6.png","element":"img","alt":"u−u+ = −","inline":true},{"text":"10, no fairness criteria surpass the active harm rate ","element":"span"},{"style":{"height":16.4},"width":41.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-7.png","element":"img","alt":" β0","inline":true},{"text":"; however, in ","element":"span"},{"text":"the case of ","element":"span"},{"style":{"height":23.4},"width":396.29,"height":58.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-8.png","element":"img","alt":"u−u+ = −4, DemParity","inline":true,"padRight":true},{"text":"overloans, in line with the statement in Corollary ","element":"span"},{"href":"#id-30","text":"3.3","element":"a"},{"text":".","element":"span"}],[{"text":"These results are further examined in Figure ","element":"span"},{"href":"#id-57","text":"6","element":"a"},{"text":", which displays the normalized ","element":"span"},{"href":"#id-30","text":"outc","element":"a"},{"text":"ome curves and the utility curves for both the white and the black group. To plot the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"utility curves, the group that is not on display has selection rate fixed at ","element":"span"},{"style":{"height":17.94},"width":145.55,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-9.png","element":"img","alt":" βMaxUtil","inline":true},{"text":". In this figure, the top panel corresponds to the average change in credit scores for each group under different loaning rates ","element":"span"},{"style":{"height":16.4},"width":38.98,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/19-10.png","element":"img","alt":" β;","inline":true,"padRight":true},{"text":"the bottom panels shows the corresponding ","element":"span"},{"style":{"fontStyle":"italic"},"text":"total ","element":"span"},{"text":"utility ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"text":"(summed over both groups and weighted by group population sizes) for the bank.","element":"span"}],[{"id":"id-56","style":{"width":"67%"},"width":1256,"height":635,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-0.png","element":"img"}],[{"text":"Figure 5: The empirical CDFs of both groups are plotted along with the decision thresholds resulting from ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"figcaption","subtype":"caption"},{"text":", ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"figcaption","subtype":"caption"},{"text":", and ","element":"figcaption","subtype":"caption"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"figcaption","subtype":"caption"},{"text":"for a model with bank utilities set to (a) ","element":"figcaption","subtype":"caption"},{"style":{"height":23.39},"width":326.3,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-1.png","element":"img","alt":"u−u+ = −4 and (b)","inline":true}],[{"style":{"height":15.74},"width":141.3,"height":39.34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-2.png","element":"img","alt":"u+ = −","inline":true},{"text":"10. The threshold for active harm is displayed; in (a) ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"causes active harm while ","element":"span"},{"text":"in (b) it does not. ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"never cause active harm.","element":"span"}],[{"text":"Figure ","element":"span"},{"href":"#id-57","text":"6 ","element":"a"},{"text":"highlights that the position of the utility optima in the lower panel determines the loan (selection) rates. In this specific instance, the utility and change ratios are fairly close, ","element":"span"},{"style":{"height":23.4},"width":179.2,"height":58.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-3.png","element":"img","alt":"u−u+ = −4,","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":23.4},"width":138.57,"height":58.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-4.png","element":"img","alt":"c−c+ = −","inline":true},{"text":"2, meaning that the bank’s profit motivations align with individual outcomes to some ","element":"span"},{"text":"extent. Here, we can see that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"loans much closer to optimal than ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":", similar to the setting suggested by Corollary ","element":"span"},{"href":"#id-33","text":"3.2","element":"a"},{"text":".","element":"span"}],[{"text":"Although one might hope for decisions made under fairness constraints to positively affect the black group, we observe the opposite behavior. The ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policy (solid orange line) and the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"policy result in similar expected credit score change for the black group. ","element":"span"},{"text":"However, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"(dashed green line) causes a negative expected credit score change in the black group, corresponding to active harm. For the white group, the bank utility curve has almost the same shape under the fairness criteria as it does under ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", the main difference being that fairness criteria lowers the total expected profit from this group.","element":"span"}],[{"text":"This behavior stems from a discrepancy in the outcome and profit curves for each population. While incentives for the bank and positive results for individuals are somewhat aligned for the majority group, under fairness constraints, they are more heavily misaligned in the minority group, as seen in graphs (left) in Figure ","element":"span"},{"href":"#id-57","text":"6","element":"a"},{"text":". We remark that in other settings where the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"unconstrained ","element":"span"},{"text":"profit maximization is misaligned with individual outcomes (e.g., when ","element":"span"},{"style":{"height":23.39},"width":141.6,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/20-5.png","element":"img","alt":"u−u+ = −","inline":true},{"text":"10), fairness criteria may ","element":"span"},{"text":"perform more favorably for the minority group by pulling the utility curve into a shape consistent with the outcome curve.","element":"span"}],[{"text":"By analyzing the resulting affects of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":", and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"on actual credit score lending data, we show the applicability of our model to real-world applications. In particular, some results shown in Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"hold empirically for the FICO TransUnion TransRisk scores.","element":"span"}],[{"id":"id-57","style":{"width":"86%"},"width":1625,"height":1616,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/21-0.png","element":"img"}],[{"text":"Figure 6: The outcome and utility curves are plotted for both groups against the group selection rates. ","element":"figcaption","subtype":"caption"},{"text":"The relative positions of the utility maxima determine the position of the decision rule thresholds. We hold ","element":"figcaption","subtype":"caption"},{"style":{"height":23.4},"width":334.41,"height":58.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/21-1.png","element":"img","alt":"u−u+ = −4 as fixed.","inline":true}]]},{"heading":"8 Conclusion and Future Work","paragraphs":[[{"text":"We argue that without a careful model of delayed outcomes, we cannot foresee the impact a fairness criterion would have if enforced as a constraint on a classification system. However, if such an accurate outcome model is available, we show that there are more direct ways to optimize for positive outcomes than via existing fairness criteria.","element":"span"}],[{"text":"Our formal framework exposes a concise, yet expressive way to model outcomes via the expected change in a variable of interest caused by an institutional decision. This leads to the natural concept of an outcome curve that allows us to interpret and compare solutions effectively. In essence, the formalism we propose requires us to understand the two-variable causal mechanism that translates decisions to outcomes. Depending on the application, such an understanding might necessitate greater domain knowledge and additional research into the specifics of the application. This is consistent with much scholarship that points to the context-sensitive nature of fairness in machine learning.","element":"span"}],[{"text":"An interesting direction for future work is to consider other characteristics of impact beyond the change in population ","element":"span"},{"style":{"fontStyle":"italic"},"text":"mean","element":"span"},{"text":". Variance and individual-level outcomes are natural and important considerations. Moreover, it would be interesting to understand the robustness of outcome optimization to modeling and measurement errors.","element":"span"}]]},{"heading":"Acknowledgements","paragraphs":[[{"text":"We thank Lily Hu, Aaron Roth, and Cathy O’Neil for discussions and feedback on an earlier version of the manuscript. We thank the students of CS294: Fairness in Machine Learning (Fall 2017, University of California, Berkeley) for inspiring class discussions and comments on a presentation that was a precursor of this work. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE 1752814.","element":"span"}]]},{"heading":"References","paragraphs":[[{"id":"id-1","text":"Solon Barocas and Andrew D. Selbst. Big data’s disparate impact. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"California Law Review","element":"span"},{"text":", 104, 2016.","element":"span"}],[{"id":"id-11","text":"Toon Calders, Faisal Kamiran, and Mykola Pechenizkiy. Building classifiers with independency ","element":"span"},{"text":"constraints. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. IEEE ICDMW","element":"span"},{"text":", ICDMW ’09, pages 13–18, 2009.","element":"span"}],[{"id":"id-15","text":"Alexandra Chouldechova. Fair prediction with disparate impact: A study of bias in recidivism ","element":"span"},{"text":"prediction instruments. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FATML","element":"span"},{"text":", 2016.","element":"span"}],[{"id":"id-10","text":"Danielle Ensign, Sorelle A Friedler, Scott Neville, Carlos Scheidegger, and Suresh Venkatasubra- ","element":"span"},{"text":"manian. Runaway feedback loops in predictive policing. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1706.09847","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-0","text":"Executive Office of the President. Big data: A report on algorithmic systems, opportunity, and ","element":"span"},{"text":"civil rights. Technical report, White House, May 2016.","element":"span"}],[{"id":"id-8","text":"Dean P Foster and Rakesh V Vohra. An economic argument for affirmative action. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Rationality and Society","element":"span"},{"text":", 4(2):176–188, 1992.","element":"span"}],[{"id":"id-9","text":"Andreas Fuster, Paul Goldsmith-Pinkham, Tarun Ramadorai, and Ansgar Walther. Predictably ","element":"span"},{"text":"unequal? the effects of machine learning on credit markets. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"SSRN","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-13","text":"Moritz Hardt, Eric Price, and Nati Srebro. Equality of opportunity in supervised learning. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. ","element":"span"},{"text":"30","element":"span"},{"style":{"fontStyle":"italic"},"text":"th NIPS","element":"span"},{"text":", 2016.","element":"span"}],[{"id":"id-7","text":"Lily Hu and Yiling Chen. A short-term intervention for long-term fairness in the labor market. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. ","element":"span"},{"text":"27","element":"span"},{"style":{"fontStyle":"italic"},"text":"th WWW","element":"span"},{"text":", 2018.","element":"span"}],[{"id":"id-17","text":"Matthew Joseph, Michael Kearns, Jamie H Morgenstern, and Aaron Roth. Fairness in learning: ","element":"span"},{"text":"Classic and contextual bandits. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. ","element":"span"},{"text":"30","element":"span"},{"style":{"fontStyle":"italic"},"text":"th NIPS","element":"span"},{"text":", pages 325–333, 2016.","element":"span"}],[{"id":"id-19","text":"Alexandra Kalev, Frank Dobbin, and Erin Kelly. Best Practices or Best Guesses? Assessing the ","element":"span"},{"text":"Efficacy of Corporate Affirmative Action and Diversity Policies. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"American Sociological Review","element":"span"},{"text":", 71(4):589–617, 2006.","element":"span"}],[{"id":"id-18","text":"Stephen N. Keith, Robert M. Bell, August G. Swanson, and Albert P. Williams. Effects of affir- ","element":"span"},{"text":"mative action in medical schools. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"New England Journal of Medicine","element":"span"},{"text":", 313(24):1519–1525, 1985.","element":"span"}],[{"id":"id-23","text":"Niki Kilbertus, Mateo Rojas-Carulla, Giambattista Parascandolo, Moritz Hardt, Dominik Janzing, ","element":"span"},{"text":"and Bernhard Sch¨olkopf. Avoiding discrimination through causal reasoning. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In Proc. ","element":"span"},{"text":"30","element":"span"},{"style":{"fontStyle":"italic"},"text":"th NIPS","element":"span"},{"text":", pages 656–666, 2017.","element":"span"}],[{"id":"id-14","text":"Jon M. Kleinberg, Sendhil Mullainathan, and Manish Raghavan. Inherent trade-offs in the fair ","element":"span"},{"text":"determination of risk scores. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. ","element":"span"},{"text":"8","element":"span"},{"style":{"fontStyle":"italic"},"text":"th ITCS","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-21","text":"Matt J. Kusner, Joshua R. Loftus, Chris Russell, and Ricardo Silva. Counterfactual fairness. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In Proc. ","element":"span"},{"text":"30","element":"span"},{"style":{"fontStyle":"italic"},"text":"th NIPS","element":"span"},{"text":", pages 4069–4079, 2017.","element":"span"}],[{"id":"id-22","text":"Razieh Nabi and Ilya Shpitser. Fair inference on outcomes. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv:1705.10378v1","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-16","text":"Geoff Pleiss, Manish Raghavan, Felix Wu, Jon Kleinberg, and Kilian Q Weinberger. On fairness ","element":"span"},{"text":"and calibration. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Advances in Neural Information Processing Systems 30","element":"span"},{"text":", pages 5684–5693, 2017.","element":"span"}],[{"id":"id-2","text":"Stephen Ross and John Yinger. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The Color of Credit: Mortgage Discrimination, Research Methodology, and Fair-Lending Enforcement","element":"span"},{"text":". MIT Press, Cambridge, 2006.","element":"span"}],[{"id":"id-54","text":"US Federal Reserve. Report to the congress on credit scoring and its effects on the availability and ","element":"span"},{"text":"affordability of credit, 2007.","element":"span"}],[{"id":"id-12","text":"Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rogriguez, and Krishna P. Gummadi. Fair- ","element":"span"},{"text":"ness Constraints: Mechanisms for Fair Classification. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. ","element":"span"},{"text":"20","element":"span"},{"style":{"fontStyle":"italic"},"text":"th AISTATS","element":"span"},{"text":", pages 962–970. PMLR, 2017.","element":"span"}]]},{"heading":"A Optimality of Threshold Policies","paragraphs":[[{"id":"id-34","style":{"fontWeight":"bold"},"text":"A.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-58","style":{"fontWeight":"bold"},"text":"5.1","element":"a"}],[{"text":"We begin with the first statement of the lemma. Suppose ","element":"span"},{"style":{"height":18.81},"width":177.61,"height":47.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-0.png","element":"img","alt":" τ j ∼=πj τ ′j","inline":true},{"text":". Then there exists a set ","element":"span"},{"style":{"height":13.2},"width":123.88,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-1.png","element":"img","alt":" S ⊂ X","inline":true,"padRight":true},{"text":"such that ","element":"span"},{"style":{"height":18.33},"width":84.38,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-2.png","element":"img","alt":" πj(x","inline":true},{"text":") = 0 for all ","element":"span"},{"style":{"height":13.2},"width":107.3,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-3.png","element":"img","alt":" x ∈ S","inline":true},{"text":", and for all ","element":"span"},{"style":{"height":20.8},"width":529.54,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-4.png","element":"img","alt":" x /∈ S, τ j(x) = τ ′j(x). Thus,","inline":true}],[{"style":{"width":"51%"},"width":971,"height":219,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-5.png","element":"img"}],[{"text":"Conversely, suppose that ","element":"span"},{"style":{"height":20.8},"width":903.7,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-6.png","element":"img","alt":" rπj(τ j) = rπj(τ ′j). Let τ j = τ c,γ and τ ′j = τ c′,γ′","inline":true,"padRight":true},{"text":"as in Definition ","element":"span"},{"href":"#id-59","text":"5.1","element":"a"},{"text":". We ","element":"span"},{"text":"now have the following cases:","element":"span"}],[{"text":"1. Case 1: ","element":"span"},{"style":{"height":20.8},"width":1040.13,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-7.png","element":"img","alt":" c = c′. Then τ j(x) = τ ′j(x) for all x ∈ X − {c}. Hence,","inline":true}],[{"style":{"width":"43%"},"width":819,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-8.png","element":"img"}],[{"text":"This implies that either ","element":"span"},{"style":{"height":20.8},"width":237.22,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-9.png","element":"img","alt":" τ j(c) = τ ′j(c","inline":true},{"text":"), and thus ","element":"span"},{"style":{"height":20.8},"width":543,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-10.png","element":"img","alt":" τ j(x) = τ ′j(x) for all x ∈ X","inline":true},{"text":", or otherwise ","element":"span"},{"style":{"height":17.6},"width":67.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-11.png","element":"img","alt":"π(c","inline":true},{"text":") = 0, in which case we still have ","element":"span"},{"style":{"height":18.81},"width":177.61,"height":47.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-12.png","element":"img","alt":" τ j ∼=πj τ ′j ","inline":true,"padRight":true},{"text":"(since the two policies agree every outside the ","element":"span"},{"text":"set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"style":{"fontStyle":"italic"},"text":"}","element":"span"},{"text":").","element":"span"}],[{"text":"2. Case 2: ","element":"span"},{"style":{"height":16.8},"width":123.04,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-13.png","element":"img","alt":" c ̸= c′","inline":true},{"text":". We assume assume without loss of generality that ","element":"span"},{"style":{"height":14.8},"width":221.48,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-14.png","element":"img","alt":" c′ < c ≤ C","inline":true},{"text":". Since the policies ","element":"span"},{"style":{"height":17.73},"width":299.76,"height":44.33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-15.png","element":"img","alt":" τ c′,1 and τ c′+1,0","inline":true,"padRight":true},{"text":"are identity for ","element":"span"},{"style":{"height":13.2},"width":122.27,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-16.png","element":"img","alt":" c′ < C","inline":true},{"text":", we may also assume without loss of generality that ","element":"span"},{"style":{"height":17.6},"width":135.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-17.png","element":"img","alt":" γ′ ∈ [0,","inline":true,"padRight":true},{"text":"1). Thus for all ","element":"span"},{"style":{"height":20.8},"width":936.62,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-18.png","element":"img","alt":" x ∈ S := {c′, c′ + 1, . . . , C}, we have τ ′j(x) < τ j(x","inline":true},{"text":"). This implies ","element":"span"},{"text":"that","element":"span"}],[{"style":{"width":"37%"},"width":697,"height":292,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/24-19.png","element":"img"}],[{"style":{"width":"84%"},"width":1580,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-0.png","element":"img"}],[{"text":"Next, we show that ","element":"span"},{"style":{"height":10.62},"width":42.69,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-1.png","element":"img","alt":" rπ","inline":true,"padRight":true},{"text":"is a bijection from ","element":"span"},{"style":{"height":17.6},"width":563.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-2.png","element":"img","alt":" Tthresh(π) → [0, 1]. That rπ","inline":true,"padRight":true},{"text":"is injective follows immediately from the fact if ","element":"span"},{"style":{"height":20.8},"width":615.87,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-3.png","element":"img","alt":" rπj(τ) = rπj(τ ′j), then τ j ∼=πj τ ′j","inline":true},{"text":". To show it is surjective, we exhibit ","element":"span"},{"text":"for every ","element":"span"},{"style":{"height":17.6},"width":131.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-4.png","element":"img","alt":" β ∈ [0,","inline":true,"padRight":true},{"text":"1] a threshold policy ","element":"span"},{"style":{"height":19.98},"width":538.04,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-5.png","element":"img","alt":" τ c,γ for which rπj(τ c,γ) = β","inline":true},{"text":". We may assume ","element":"span"},{"style":{"height":16.4},"width":234.44,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-6.png","element":"img","alt":" β < 1, since","inline":true,"padRight":true},{"text":"the all-ones policy has a selection rate of 1.","element":"span"}],[{"text":"Recall the definition of the inverse CDF","element":"span"}],[{"style":{"width":"36%"},"width":679,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-7.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":26.16},"width":1452.3,"height":65.41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-8.png","element":"img","alt":" β < 1, Qj(β) ≤ C. Let β+ = �Cx=Qj(β) π(x), and let β− = �Cx=Qj(β)+1 π(x","inline":true},{"text":"). Note that by ","element":"span"},{"text":"definition, we have ","element":"span"},{"style":{"height":18.33},"width":745.51,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-9.png","element":"img","alt":" β− ≤ β < β+, and β+ − β− = π(Qj(β","inline":true},{"text":")). Hence, if we define ","element":"span"},{"style":{"height":26.99},"width":293.82,"height":67.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-10.png","element":"img","alt":" γ = β−β−β+−β− , we","inline":true,"padRight":true},{"text":"have","element":"span"}],[{"style":{"width":"84%"},"width":1590,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-11.png","element":"img"}],[{"id":"id-36","style":{"fontWeight":"bold"},"text":"A.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-37","style":{"fontWeight":"bold"},"text":"5.2","element":"a"}],[{"text":"Given ","element":"span"},{"style":{"height":19.53},"width":200.15,"height":48.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-12.png","element":"img","alt":" τ ∈ [0, 1]C","inline":true},{"text":", we define the ","element":"span"},{"style":{"height":17.6},"width":518.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-13.png","element":"img","alt":" normal cone at τ as NC(τ","inline":true},{"text":") := ConicalHull","element":"span"},{"style":{"height":19.53},"width":406.9,"height":48.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-14.png","element":"img","alt":"{z : τ + z ∈ [0, 1]C}.","inline":true,"padRight":true},{"text":"We can describe NC(","element":"span"},{"style":{"height":8},"width":27,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-15.png","element":"img","alt":"τ","inline":true},{"text":") explicitly as:","element":"span"}],[{"style":{"width":"55%"},"width":1046,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-16.png","element":"img"}],[{"text":"Immediately from the above definition, we have the following useful identity, which is that for any vector ","element":"span"},{"style":{"height":18.73},"width":150.63,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-17.png","element":"img","alt":" g ∈ RC,","inline":true}],[{"id":"id-60","style":{"width":"87%"},"width":1646,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-18.png","element":"img"}],[{"text":"Now consider the optimization problem (","element":"span"},{"href":"#id-38","text":"12","element":"a"},{"text":"). By the first order KKT conditions, we know that for any optimizer ","element":"span"},{"style":{"height":10.62},"width":45.6,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-19.png","element":"img","alt":" τ ∗","inline":true,"padRight":true},{"text":"of the above objective, there exists some ","element":"span"},{"style":{"height":13.2},"width":110.79,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-20.png","element":"img","alt":"�λ ∈ R","inline":true,"padRight":true},{"text":"such that, for all ","element":"span"},{"style":{"height":17.6},"width":225.14,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-21.png","element":"img","alt":" z ∈ NC(τ ∗)","inline":true}],[{"style":{"width":"24%"},"width":461,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-22.png","element":"img"}],[{"text":"By (","element":"span"},{"href":"#id-60","text":"22","element":"a"},{"text":"), we must have that","element":"span"}],[{"style":{"width":"46%"},"width":870,"height":189,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-23.png","element":"img"}],[{"text":"Now ","element":"span"},{"style":{"height":17.6},"width":89.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-24.png","element":"img","alt":" τ ∗(x","inline":true},{"text":") is not necessarily a threshold policy. To conclude the theorem, it suffices to exhibit a threshold policy ","element":"span"},{"style":{"height":17.6},"width":483.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-25.png","element":"img","alt":" �τ ∗ such that τ ∗(x) ∼=π �τ ∗","inline":true},{"text":". (Note that ","element":"span"},{"style":{"height":17.6},"width":89.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-26.png","element":"img","alt":" �τ ∗(x","inline":true},{"text":") will also be feasible for the constraint, and have the same objective value; hence ","element":"span"},{"style":{"height":10.62},"width":45.6,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/25-27.png","element":"img","alt":" �τ ∗","inline":true,"padRight":true},{"text":"will be optimal as well.)","element":"span"}],[{"text":"Given ","element":"span"},{"style":{"height":17.6},"width":968.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-0.png","element":"img","alt":" τ ∗ and �λ, let c∗ = min{c ∈ X : v(x) + �λw(x) ≥ 0}","inline":true},{"text":". If either (a) ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") = 0 for all ","element":"span"},{"style":{"height":12.8},"width":115.54,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-1.png","element":"img","alt":" x ∈ X","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") is strictly increasing or (b) ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") is strictly increasing, then the modified policy","element":"span"}],[{"style":{"width":"27%"},"width":510,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-2.png","element":"img"}],[{"text":"is a threshold policy, and ","element":"span"},{"style":{"height":17.6},"width":238.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-3.png","element":"img","alt":" τ ∗(x) ∼=π �τ ∗","inline":true},{"text":". Moreover, ","element":"span"},{"style":{"height":17.6},"width":898.14,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-4.png","element":"img","alt":" ⟨w, �τ ∗⟩ = ⟨w, �τ ∗⟩ and ⟨π, �τ ∗⟩ = ⟨π, �τ ∗⟩, which","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":10.62},"width":45.6,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-5.png","element":"img","alt":" �τ ∗","inline":true,"padRight":true},{"text":"is an optimal policy for the objective in Lemma ","element":"span"},{"href":"#id-37","text":"5.2","element":"a"},{"text":".","element":"span"}],[{"id":"id-42","style":{"fontWeight":"bold"},"text":"A.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-43","style":{"fontWeight":"bold"},"text":"5.3","element":"a"}],[{"text":"We shall prove","element":"span"}],[{"style":{"width":"65%"},"width":1228,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-6.png","element":"img"}],[{"text":"where the derivative is with respect to ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-7.png","element":"img","alt":" β","inline":true},{"text":". The computation of the left-derivative is analogous. Since we are concerned with right-derivatives, we shall take ","element":"span"},{"style":{"height":23.21},"width":713.91,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-8.png","element":"img","alt":" β ∈ [0, 1). Since πj ◦ r−1πj (β) does not","inline":true,"padRight":true},{"text":"depend on the choice of representative for ","element":"span"},{"style":{"height":23.21},"width":64.25,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-9.png","element":"img","alt":" r−1πj ","inline":true,"padRight":true},{"text":", we can choose a cannonical representation for ","element":"span"},{"style":{"height":23.21},"width":78.18,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-10.png","element":"img","alt":" r−1πj .","inline":true,"padRight":true},{"text":"In Section ","element":"span"},{"href":"#id-34","text":"A.1","element":"a"},{"text":", we saw that the threshold policy ","element":"span"},{"style":{"height":15.71},"width":186.26,"height":39.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-11.png","element":"img","alt":" τ Qj(β),γ(β)","inline":true,"padRight":true},{"text":"had acceptance rate ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-12.png","element":"img","alt":" β","inline":true},{"text":", where we had defined","element":"span"}],[{"style":{"width":"70%"},"width":1329,"height":263,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-13.png","element":"img"}],[{"text":"Note then that for each ","element":"span"},{"style":{"height":20.92},"width":281.6,"height":52.29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-14.png","element":"img","alt":" x, τ Qj(β),γ(β)(x","inline":true},{"text":") is piece-wise linear, and thus admits left and right derivatives. We first claim that","element":"span"}],[{"id":"id-61","style":{"width":"72%"},"width":1362,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-15.png","element":"img"}],[{"text":"To see this, note that Q","element":"span"},{"style":{"height":18.32},"width":53.98,"height":45.81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-16.png","element":"img","alt":"j(β","inline":true},{"text":") is right continuous, so for all ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-17.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"sufficiently small, Q","element":"span"},{"style":{"height":18.32},"width":327.5,"height":45.81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-18.png","element":"img","alt":"j(β + ϵ) = Qj(β).","inline":true,"padRight":true},{"text":"Hence, for all ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-19.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"sufficiently small and all ","element":"span"},{"style":{"height":20.91},"width":1093.89,"height":52.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-20.png","element":"img","alt":" x ̸= Q(β), we have τ Qj(β+ϵ),γ(β+ϵ)(x) = τ Qj(β+ϵ),γ(β+ϵ)(x),","inline":true,"padRight":true},{"text":"as needed. Thus, Equation (","element":"span"},{"href":"#id-61","text":"26","element":"a"},{"text":") implies that ","element":"span"},{"style":{"height":23.21},"width":244.27,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-21.png","element":"img","alt":" ∂+πj ◦ r−1πj (β","inline":true},{"text":") is supported on ","element":"span"},{"style":{"height":18.33},"width":171.04,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-22.png","element":"img","alt":" x = Qj(β","inline":true},{"text":"), and hence","element":"span"}],[{"style":{"width":"54%"},"width":1022,"height":68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-23.png","element":"img"}],[{"text":"To conclude, we must show that ","element":"span"},{"style":{"height":26.7},"width":541.98,"height":66.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-24.png","element":"img","alt":" ∂+πj(x)τ Qj(β),γ(β)(x)��x=Qj(β) ","inline":true,"padRight":true},{"text":"= 1. To show this, we have","element":"span"}],[{"style":{"width":"68%"},"width":1288,"height":266,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/26-25.png","element":"img"}],[{"style":{"width":"46%"},"width":878,"height":67,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-0.png","element":"img"}]]},{"heading":"B Characterization of Fairness Solutions","paragraphs":[[{"id":"id-50","style":{"fontWeight":"bold"},"text":"B.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Derivative Computation for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"}],[{"text":"In this section, we prove Lemma ","element":"span"},{"href":"#id-62","text":"6.1","element":"a"},{"text":", which we recall below.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 6.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then the function","element":"span"}],[{"style":{"width":"27%"},"width":523,"height":61,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-1.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"is a bijection from ","element":"span"},{"text":"[0","element":"span"},{"style":{"height":18.32},"width":333.05,"height":45.81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-2.png","element":"img","alt":", 1] to [0, ⟨πj, w⟩].","inline":true}],[{"text":"We will prove Lemma ","element":"span"},{"href":"#id-62","text":"6.1 ","element":"a"},{"text":"in tandem with the following derivative computation which we applied in the proof of Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":".","element":"span"}],[{"id":"id-53","style":{"fontWeight":"bold"},"text":"Lemma B.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The function","element":"span"}],[{"style":{"width":"30%"},"width":577,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"is concave in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and has left and right derivatives","element":"span"}],[{"style":{"width":"70%"},"width":1323,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof of Lemmas ","element":"span"},{"href":"#id-62","style":{"fontStyle":"italic"},"text":"6.1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"href":"#id-53","style":{"fontStyle":"italic"},"text":"B.1","element":"a"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"Consider a ","element":"span"},{"style":{"height":23.21},"width":522.68,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-5.png","element":"img","alt":" β ∈ [0, 1]. Then, πj ◦ r−1πj (β","inline":true},{"text":") is continuous and left and ","element":"span"},{"text":"right differentiable by Lemma ","element":"span"},{"href":"#id-43","text":"5.3","element":"a"},{"text":", and its left and right derivatives are indicator vectors ","element":"span"},{"style":{"height":20.11},"width":191.34,"height":50.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-6.png","element":"img","alt":" eQj(β) and","inline":true},{"style":{"height":20.28},"width":121.68,"height":50.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-7.png","element":"img","alt":"eQ+j (β)","inline":true},{"text":", respectively. Consequently, ","element":"span"},{"style":{"height":23.21},"width":416.11,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-8.png","element":"img","alt":" β �→ ⟨wj, πj ◦ r−1πj (β)⟩","inline":true,"padRight":true},{"text":"has left and right derivatives ","element":"span"},{"style":{"height":18.33},"width":177.35,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-9.png","element":"img","alt":" wj(Q(β))","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":19.46},"width":170.72,"height":48.66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-10.png","element":"img","alt":" wj(Q+(β","inline":true},{"text":")), respectively; both of which are both strictly positive by the assumption ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0. Hence, ","element":"span"},{"style":{"height":23.21},"width":513,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-11.png","element":"img","alt":" Tj,wj(β) = ⟨wj, πj ◦ r−1πj (β)⟩","inline":true,"padRight":true},{"text":"is strictly increasing in ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-12.png","element":"img","alt":" β","inline":true},{"text":", and so the map is injective. It is also ","element":"span"},{"text":"surjective because ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-13.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"= 0 induces the policy ","element":"span"},{"style":{"height":17.93},"width":263,"height":44.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-14.png","element":"img","alt":" τ j = 0 and β","inline":true,"padRight":true},{"text":"= 1 induces the policy ","element":"span"},{"style":{"height":18.33},"width":271.14,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-15.png","element":"img","alt":" τ j = 1 (up to","inline":true},{"style":{"height":13.13},"width":43.39,"height":32.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-16.png","element":"img","alt":"πj","inline":true},{"text":"-measure zero). Hence, ","element":"span"},{"style":{"height":19.98},"width":126.59,"height":49.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-17.png","element":"img","alt":" Tj,wj(β","inline":true},{"text":") is an order preserving bijection with left- and right-derivatives, and we can compute the left and right derivatives of its inverse as follows:","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":664,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/27-18.png","element":"img"}],[{"style":{"height":23.21},"width":735.02,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-0.png","element":"img","alt":"∂+Uj(rπj(T −1j,wj(t1))) ≥ ∂−Uj(rπj(T −1j,wj(t2","inline":true},{"text":"))), and that for all ","element":"span"},{"style":{"height":23.21},"width":800.57,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-1.png","element":"img","alt":" t, ∂+Uj(rπj(T −1j,wj(t))) ≤ ∂−Uj(rπj(T −1j,wj(t))).","inline":true,"padRight":true},{"text":"These facts establish that the mapping ","element":"span"},{"style":{"height":23.21},"width":327.38,"height":58.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-2.png","element":"img","alt":" t �→ Uj(rπj(T −1j,wj(t","inline":true},{"text":"))) is concave.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"B.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Characterizations Under Soft Constraints","element":"span"}],[{"text":"Given a convex penalty Φ : ","element":"span"},{"style":{"height":17.42},"width":448.48,"height":43.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-3.png","element":"img","alt":" R → R≥0, and λ ∈ R≥0","inline":true},{"text":", one can write down the general form for soft constrained utility optimization","element":"span"}],[{"id":"id-64","style":{"width":"79%"},"width":1483,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.25},"width":234.43,"height":38.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-5.png","element":"img","alt":" wA and wB","inline":true,"padRight":true},{"text":"represent generic constraints. ","element":"span"},{"text":"Again, we shall assume that for ","element":"span"},{"style":{"height":17.6},"width":218.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-6.png","element":"img","alt":" j ∈ {A, B},","inline":true},{"style":{"height":18.33},"width":200.89,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-7.png","element":"img","alt":"u(x)/wj(x","inline":true},{"text":") is non-decreasing. Recall that for ","element":"span"},{"style":{"height":18.33},"width":286.13,"height":45.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-8.png","element":"img","alt":" wj = (1, 1, . . . ,","inline":true,"padRight":true},{"text":"1), one recovers the soft version of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity","element":"span"},{"text":", whereas for ","element":"span"},{"style":{"height":24.3},"width":200.76,"height":60.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-9.png","element":"img","alt":" wj = ρ⟨ρ,πj⟩","inline":true},{"text":", one recovers the soft constrained version of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":".","element":"span"}],[{"text":"The same argument presented in Section ","element":"span"},{"href":"#id-41","text":"6.2 ","element":"a"},{"text":"shows that the optimal policies are of the form","element":"span"}],[{"style":{"width":"19%"},"width":366,"height":62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-10.png","element":"img"}],[{"text":"where (","element":"span"},{"style":{"height":14.4},"width":98.48,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-11.png","element":"img","alt":"tA, tB","inline":true},{"text":") are solutions to the following optimization problem:","element":"span"}],[{"style":{"width":"88%"},"width":1653,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-12.png","element":"img"}],[{"text":"The following lemma gives us a first order characterization of these optimal TPRs, (","element":"span"},{"style":{"height":17.6},"width":129.03,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-13.png","element":"img","alt":"tA, tB).","inline":true}],[{"id":"id-67","style":{"fontWeight":"bold"},"text":"Lemma B.2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"All optimal policies are equivalent to threshold policies with selection rate ","element":"span"},{"text":"(","element":"span"},{"style":{"height":17.6},"width":134.91,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-14.png","element":"img","alt":"βA, βB)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"which satisfy","element":"span"}],[{"style":{"width":"81%"},"width":1528,"height":175,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-15.png","element":"img"}],[{"style":{"height":18.22},"width":850.59,"height":45.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-16.png","element":"img","alt":"where ∆ = tA − tB = TA,wA(βA) − TB,wB(βB).","inline":true}],[{"style":{"height":17.6},"width":267.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-17.png","element":"img","alt":"Proof. Let ∂(·","inline":true},{"text":") denote the super-gradient set of a concave function. Note that if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"F ","element":"span"},{"text":"is left-and-right differentiable and concave, then ","element":"span"},{"style":{"height":17.6},"width":486.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-18.png","element":"img","alt":" ∂F(x) = [∂+F(x), ∂−F(x","inline":true},{"text":")]. By concavity of ","element":"span"},{"style":{"height":17.13},"width":39.31,"height":42.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-19.png","element":"img","alt":" Uj","inline":true,"padRight":true},{"text":"and convexity of Φ, we must have that","element":"span"}],[{"style":{"width":"97%"},"width":1823,"height":551,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/28-20.png","element":"img"}],[{"style":{"width":"80%"},"width":1511,"height":401,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-0.png","element":"img"}],[{"text":"Substituting ∆ = ","element":"span"},{"style":{"height":18.22},"width":601.28,"height":45.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-1.png","element":"img","alt":" tA − tB = TA,wA(βA) − TB,wB(βB","inline":true},{"text":") concludes the proof.","element":"span"}],[{"text":"In general, a closed form solution for the soft constrained problem may be difficult to state. However, for the case of Φ(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") = ","element":"span"},{"style":{"fontStyle":"italic"},"text":"|","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"style":{"fontStyle":"italic"},"text":"|","element":"span"},{"text":", we can state an explicit closed form solution:","element":"span"}],[{"id":"id-66","style":{"fontWeight":"bold"},"text":"Proposition B.1 ","element":"span"},{"text":"(Special case of Φ(","element":"span"},{"style":{"height":22.93},"width":940.01,"height":57.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-2.png","element":"img","alt":"t) = |t|). Let Φ(t) = |t|, fix λ, and let [βλ,−A , βλ,+A ]","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the interval of optimal selection rates for Equation ","element":"span"},{"text":"(","element":"span"},{"href":"#id-64","text":"27","element":"a"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"with regularization ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-3.png","element":"img","alt":" λ","inline":true},{"style":{"fontStyle":"italic"},"text":". Finally, suppose that for any optimal ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"style":{"fontStyle":"italic"},"text":"selection rates ","element":"span"},{"text":"(","element":"span"},{"style":{"height":19.65},"width":1159.81,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-4.png","element":"img","alt":"βMaxUtilA , βMaxUtilB ), one has TA,wA(βMaxUtilA ) < TB,wB(βMaxUtilB ).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"text":"[","element":"span"},{"style":{"height":20.98},"width":142.04,"height":52.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-5.png","element":"img","alt":"β−A , β+A ]","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the optimal loan rates in ","element":"span"},{"text":"(","element":"span"},{"href":"#id-64","text":"27","element":"a"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then there exists a ","element":"span"},{"style":{"height":15.02},"width":42.45,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-6.png","element":"img","alt":" λ∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for ","element":"span"},{"style":{"height":15.6},"width":146.85,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-7.png","element":"img","alt":" λ ≥ λ∗,","inline":true,"padRight":true},{"text":"[","element":"span"},{"style":{"height":20.98},"width":142.04,"height":52.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-8.png","element":"img","alt":"β−A , β+A ]","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"coincides with the hard constrained solution. Moreover, for ","element":"span"},{"style":{"height":17.6},"width":555.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-9.png","element":"img","alt":" λ < λ∗, any β ∈ [0, 1] satifies","inline":true}],[{"style":{"width":"41%"},"width":777,"height":235,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-10.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Given a set of optimal constraint values (","element":"span"},{"style":{"height":18.22},"width":567.34,"height":45.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-11.png","element":"img","alt":"tA, tB) = (TA,wA(βA), TB,wB(βB","inline":true},{"text":")) for optimal selection rates (","element":"span"},{"style":{"height":16.4},"width":116.33,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-12.png","element":"img","alt":"βA, βB","inline":true},{"text":") for a given parameter ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-13.png","element":"img","alt":" λ","inline":true},{"text":". By Proposition ","element":"span"},{"href":"#id-65","text":"B.2 ","element":"a"},{"text":"below, it follows that if ","element":"span"},{"style":{"height":15.24},"width":204.35,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-14.png","element":"img","alt":" tA = tB for","inline":true,"padRight":true},{"text":"all optimal solutions, then for all ","element":"span"},{"style":{"height":14.8},"width":120.82,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-15.png","element":"img","alt":" λ′ ≥ λ","inline":true},{"text":", all optimal solutions must also have ","element":"span"},{"style":{"height":14.04},"width":150.85,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-16.png","element":"img","alt":" tA = tB.","inline":true}],[{"text":"Hence, it suffices to show that (a) there exists a finite ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-17.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"such that all solutions must have ","element":"span"},{"style":{"height":14.04},"width":137.27,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-18.png","element":"img","alt":"tA = tB","inline":true},{"text":", and (b) if ","element":"span"},{"style":{"height":16.8},"width":137.27,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-19.png","element":"img","alt":" tA ̸= tB","inline":true},{"text":", then the display in (","element":"span"},{"href":"#id-66","text":"B.1","element":"a"},{"text":") holds.","element":"span"}],[{"text":"To prove (a) and (b), suppose ","element":"span"},{"style":{"height":16.8},"width":137.27,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-20.png","element":"img","alt":" tA ̸= tB","inline":true},{"text":". By Proposition ","element":"span"},{"href":"#id-65","text":"B.2 ","element":"a"},{"text":"below and the fact that ","element":"span"},{"style":{"height":19.35},"width":331.55,"height":48.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-21.png","element":"img","alt":" TA,wA(βMaxUtil) <","inline":true},{"style":{"height":19.65},"width":614.22,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-22.png","element":"img","alt":"TB,wB(βMaxUtilB ), we have tA < tB","inline":true},{"text":". Moreover we can compute that","element":"span"}],[{"style":{"width":"24%"},"width":462,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-23.png","element":"img"}],[{"text":"it follows from the first order condition in Lemma ","element":"span"},{"href":"#id-67","text":"B.2 ","element":"a"},{"text":"that, if ","element":"span"},{"style":{"height":16.8},"width":137.27,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-24.png","element":"img","alt":" tA ̸= tB","inline":true}],[{"id":"id-68","style":{"width":"73%"},"width":1371,"height":116,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-25.png","element":"img"}],[{"text":"which immediately implies point (b). Point (a) follows from the above display by noting that, since ","element":"span"},{"style":{"height":18.32},"width":665.76,"height":45.81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-26.png","element":"img","alt":"wj(x) > 0 and u(x) < ∞ for all x","inline":true},{"text":", where exists a ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-27.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"sufficiently large such that (","element":"span"},{"href":"#id-68","text":"29","element":"a"},{"text":") cannot hold for any ","element":"span"},{"style":{"height":16.4},"width":61.25,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/29-28.png","element":"img","alt":" βA.","inline":true}],[{"id":"id-70","style":{"fontWeight":"bold"},"text":"B.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Qualitative Behavior of Soft Constraints","element":"span"}],[{"text":"We now present a proposition which formalizes the intuition that soft constraints interpolate between ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"and the general hard constraint (","element":"span"},{"href":"#id-48","text":"18","element":"a"},{"text":") in Section ","element":"span"},{"href":"#id-41","text":"6.2 ","element":"a"},{"text":"(for arbitrary ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":", not just for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt","element":"span"},{"text":"). Because optimal policies may not be unique, we define the solution sets","element":"span"}],[{"style":{"width":"64%"},"width":1206,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-0.png","element":"img"}],[{"text":"with the set ","element":"span"},{"style":{"height":17.6},"width":95.27,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-1.png","element":"img","alt":" P(∞","inline":true},{"text":") denoting the set of solutions to (","element":"span"},{"href":"#id-48","text":"18","element":"a"},{"text":").","element":"span"}],[{"text":"At a high level, we parameterize the soft constrained solution in terms of the value of the constraint ","element":"span"},{"style":{"height":17.6},"width":485.37,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-2.png","element":"img","alt":" tA = ⟨τ A, wA ◦ πA⟩ for A","inline":true,"padRight":true},{"text":"and the difference in constraint values ∆ = ","element":"span"},{"style":{"height":17.6},"width":312.7,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-3.png","element":"img","alt":" ⟨τ A, wA ◦ πA⟩ −","inline":true},{"style":{"height":17.6},"width":737.38,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-4.png","element":"img","alt":"⟨τ B, wB ◦ πB⟩, where (τ A, τ B) ∈ P(λ","inline":true},{"text":"). We show that ","element":"span"},{"style":{"height":14.04},"width":38.76,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-5.png","element":"img","alt":" tA","inline":true,"padRight":true},{"text":"interpolates between the value of the constraint on ","element":"span"},{"style":{"height":12.8},"width":125.34,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-6.png","element":"img","alt":" A at λ","inline":true,"padRight":true},{"text":"= 0 and at ","element":"span"},{"style":{"height":12.8},"width":131.59,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-7.png","element":"img","alt":" λ = ∞","inline":true},{"text":", and that ∆ interpolates between the difference at ","element":"span"},{"style":{"height":12.8},"width":109.6,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-8.png","element":"img","alt":" λ = 0","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":") and at ∆ = 0 at ","element":"span"},{"style":{"height":12.8},"width":135.54,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-9.png","element":"img","alt":" λ = ∞","inline":true},{"text":". To be rigorous, we note that the possible values for ","element":"span"},{"style":{"height":15.24},"width":127.33,"height":38.1,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-10.png","element":"img","alt":" tA and","inline":true,"padRight":true},{"text":"∆ for each ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-11.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"are actually contiguous intervals. Hence, to make the interpolation precise, we define the following partial order on such intervals:","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition B.1 ","element":"span"},{"text":"(Interval order)","element":"span"},{"style":{"height":15.6},"width":221.02,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-12.png","element":"img","alt":". Let S1, S2","inline":true,"padRight":true},{"text":"be two intervals. We say that ","element":"span"},{"style":{"height":17.6},"width":428.59,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-13.png","element":"img","alt":" S1 ≺ S2 if max {x ∈","inline":true},{"style":{"height":17.6},"width":738.3,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-14.png","element":"img","alt":"S1 } < min {x ∈ S2} and S1 ⪯ S2","inline":true,"padRight":true},{"text":"if both max ","element":"span"},{"style":{"height":17.6},"width":855.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-15.png","element":"img","alt":" {x ∈ S1} ≤ max {x ∈ S2} and min{x ∈","inline":true},{"style":{"height":17.6},"width":424.74,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-16.png","element":"img","alt":"S1} ≤ min {x ∈ S2}","inline":true},{"text":". We say that an interval-valued function ","element":"span"},{"style":{"height":17.6},"width":645.15,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-17.png","element":"img","alt":" S(λ) is non-decreasing (resp. non","inline":true},{"style":{"height":17.6},"width":1304.54,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-18.png","element":"img","alt":"increasing) in λ if S(λ) ⪯ S(λ′) (resp S(λ′) ⪯ S(λ′) for λ ≤ λ′).","inline":true}],[{"text":"In these terms, the interpolation of the soft constraints can be stated as follows:","element":"span"}],[{"id":"id-65","style":{"fontWeight":"bold"},"text":"Proposition B.2 ","element":"span"},{"text":"(Soft constraints interpolate between ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"and hard constrained solution)","element":"span"},{"style":{"fontWeight":"bold"},"text":".","element":"span"}],[{"style":{"width":"89%"},"width":1668,"height":208,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-19.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"are closed intervals. Moreover,","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"1. In all cases, ","element":"span"},{"text":"lim","element":"span"},{"style":{"height":17.6},"width":522.83,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-20.png","element":"img","alt":"λ→∞ max{|∆| ∈ D(λ)} = 0.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"2. If ","element":"span"},{"text":"0 ","element":"span"},{"style":{"height":17.6},"width":145.93,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-21.png","element":"img","alt":" ∈ D(λ)","inline":true},{"style":{"fontStyle":"italic"},"text":", then there exists a ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"style":{"fontStyle":"italic"},"text":"solution satisfying ","element":"span"},{"text":"(","element":"span"},{"href":"#id-48","text":"18","element":"a"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Thus, for all ","element":"span"},{"style":{"height":15.6},"width":139.29,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-22.png","element":"img","alt":" λ > 0,","inline":true},{"style":{"height":17.6},"width":276.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-23.png","element":"img","alt":"P(λ) = P(∞).","inline":true}],[{"style":{"fontStyle":"italic"},"text":"3. If ","element":"span"},{"style":{"height":17.6},"width":680.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-24.png","element":"img","alt":" D(λ) ≺ {0}, then D(λ) and TA(λ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are non-decreasing on ","element":"span"},{"style":{"height":17.6},"width":208.17,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-25.png","element":"img","alt":" λ ∈ (0, ∞]","inline":true},{"style":{"fontStyle":"italic"},"text":", and vice versa if ","element":"span"},{"style":{"height":17.6},"width":230.9,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-26.png","element":"img","alt":"D(λ) ≻ {0}.","inline":true}],[{"style":{"height":17.6},"width":1820.29,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-27.png","element":"img","alt":"4. If D(λ) ≺ {0}, then {0} = D(∞) ⪰ D(λ) ⪰ {min : ∆ ∈ D(0)}, and TA(∞) ⪰ TA(λ) ⪰ {min :","inline":true,"padRight":true},{"text":"∆ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"∈ T","element":"span"},{"style":{"height":17.6},"width":694.09,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-28.png","element":"img","alt":"A(λ)}, and vice versa if D(λ) ≻ {0}.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"B.3.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-65","style":{"fontWeight":"bold"},"text":"B.2","element":"a"}],[{"text":"Again, we parameterize all solutions to the soft-constrained problem as in correspondence with solutions (","element":"span"},{"style":{"height":17.6},"width":170.54,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-29.png","element":"img","alt":"tA, tB) to","inline":true}],[{"style":{"width":"51%"},"width":958,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/30-30.png","element":"img"}],[{"text":"Letting ∆ := ","element":"span"},{"style":{"height":14.04},"width":132.42,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-0.png","element":"img","alt":" tB − tA","inline":true},{"text":", we can reparameterize the above as","element":"span"}],[{"style":{"width":"50%"},"width":948,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-1.png","element":"img"}],[{"text":"Note then that ","element":"span"},{"style":{"height":17.6},"width":77.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-2.png","element":"img","alt":" D(λ","inline":true},{"text":") denotes the set of ∆ which are partial maximimizers of the above display. If 0 ","element":"span"},{"style":{"height":17.6},"width":183.43,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-3.png","element":"img","alt":" ∈ {D(λ)}","inline":true},{"text":", this implies that there exists a ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"solution for which ∆ = 0, therefore, for all ","element":"span"},{"style":{"height":13.2},"width":72.53,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-4.png","element":"img","alt":"λ >","inline":true,"padRight":true},{"text":"0, all solutions will be ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"solutions for which ","element":"span"},{"style":{"height":17.6},"width":77.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-5.png","element":"img","alt":" D(λ","inline":true},{"text":") = 0. Otherwise assume without loss of generality that ","element":"span"},{"style":{"height":17.6},"width":229.9,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-6.png","element":"img","alt":" D(λ) < {0}.","inline":true}],[{"text":"First, the statement ","element":"span"},{"style":{"height":17.6},"width":1402.7,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-7.png","element":"img","alt":" {0} = D(∞) ⪰ D(λ) ⪰ {min : ∆ ∈ D(0)}, and TA(∞) ⪰ TA(λ) ⪰ {min :","inline":true,"padRight":true},{"text":"∆ ","element":"span"},{"style":{"height":17.6},"width":172.82,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-8.png","element":"img","alt":" ∈ TA(λ)}","inline":true},{"text":", and vice versa if ","element":"span"},{"style":{"height":17.6},"width":221.83,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-9.png","element":"img","alt":" D(λ) ≻ {0}","inline":true,"padRight":true},{"text":"can be solved by on a case-by-case basis. The strategy is to show that if any of these inequalities are violated, then the associated values of ∆ and ","element":"span"},{"style":{"height":14.04},"width":113.64,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-10.png","element":"img","alt":" tA are","inline":true,"padRight":true},{"text":"not partial maximizers of the soft constraint objective. In particular, ","element":"span"},{"style":{"height":17.6},"width":508.06,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-11.png","element":"img","alt":" TA(λ) ⊂ [T−, T+] for some","inline":true,"padRight":true},{"text":"appropriate ","element":"span"},{"style":{"height":15.82},"width":139.07,"height":39.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-12.png","element":"img","alt":" T−, T+.","inline":true}],[{"text":"We now show that ","element":"span"},{"style":{"height":17.6},"width":292.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-13.png","element":"img","alt":" D(λ) and TA(λ","inline":true},{"text":") are non-increasing and non-decreasing, respectively. We shall do so invoking the following technical lemma.","element":"span"}],[{"id":"id-69","style":{"height":17.6},"width":479.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-14.png","element":"img","alt":"Lemma B.3. Let G1(t)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be concave and let ","element":"span"},{"style":{"height":17.6},"width":147.81,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-15.png","element":"img","alt":" G2(t; λ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be concave in ","element":"span"},{"style":{"height":17.6},"width":318.25,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-16.png","element":"img","alt":" t. Let ∂G2(t; λ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the super-gradient of ","element":"span"},{"style":{"height":15.6},"width":203.84,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-17.png","element":"img","alt":" G2, that is","inline":true}],[{"style":{"width":"99%"},"width":1870,"height":377,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-18.png","element":"img"}],[{"text":"For ","element":"span"},{"style":{"height":17.6},"width":77.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-19.png","element":"img","alt":" D(λ","inline":true},{"text":"), one can write any partial maximizer ∆ as","element":"span"}],[{"style":{"width":"22%"},"width":429,"height":69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-20.png","element":"img"}],[{"text":"with ","element":"span"},{"style":{"height":18.07},"width":1282.06,"height":45.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-21.png","element":"img","alt":" G1(∆) = maxtA gAUA(tA; wA)+gBUB(tA +∆; wB) and G2(∆; λ) = λ","inline":true},{"text":"Φ(∆). Note that ","element":"span"},{"style":{"height":17.6},"width":164.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-22.png","element":"img","alt":" G1(∆) is","inline":true,"padRight":true},{"text":"concave, being the partial maximization of a concave function, and ","element":"span"},{"style":{"height":17.6},"width":330.25,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-23.png","element":"img","alt":" ∂G2(∆; λ) = −t∂","inline":true},{"text":"Φ(∆). Since ","element":"span"},{"style":{"height":17.6},"width":412.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-24.png","element":"img","alt":"∂Φ(∆) ⪰ {0} for ∆ ≥","inline":true,"padRight":true},{"text":"0 (by convexity of ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-25.png","element":"img","alt":" φ","inline":true},{"text":") , we have that ","element":"span"},{"style":{"height":17.6},"width":326.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-26.png","element":"img","alt":" ∂G2(∆; λ) = −t∂","inline":true},{"text":"Φ(∆) is non-increasing in ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-27.png","element":"img","alt":" λ","inline":true},{"text":". Hence Lemma ","element":"span"},{"href":"#id-69","text":"B.3 ","element":"a"},{"text":"implies that interval valued function ","element":"span"},{"style":{"height":17.6},"width":77.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-28.png","element":"img","alt":" D(λ","inline":true},{"text":") is non-increasing.","element":"span"}],[{"text":"To show that ","element":"span"},{"style":{"height":17.6},"width":91.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-29.png","element":"img","alt":" TA(λ","inline":true},{"text":") is non-decreasing, we have that any maximizer ","element":"span"},{"style":{"height":14.04},"width":38.76,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-30.png","element":"img","alt":" tA","inline":true,"padRight":true},{"text":"can be written as","element":"span"}],[{"style":{"width":"28%"},"width":527,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-31.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":18.3},"width":1357.2,"height":45.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-32.png","element":"img","alt":" G1(tA) = gAUA(tA; wA) and G2(tA; λ) = max∆≥0 gBUB(tA + ∆; wB) + λ","inline":true},{"text":"Φ(∆). By Danskin’s theorem,","element":"span"}],[{"style":{"width":"58%"},"width":1096,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-33.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":17.6},"width":468.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-34.png","element":"img","alt":" {∆ ∈ arg max G2(tA; λ)}","inline":true,"padRight":true},{"text":"is non-increasing in ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-35.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"for a fixed ","element":"span"},{"style":{"height":14.04},"width":38.76,"height":35.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-36.png","element":"img","alt":" tA","inline":true},{"text":", since the contribution of the regularizer increases. Since the sets ","element":"span"},{"style":{"height":17.6},"width":307.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/31-37.png","element":"img","alt":" ∂UB(tA + ∆; wB","inline":true},{"text":") are themselves non-increasing in ∆ by concavity, we conclude that ","element":"span"},{"style":{"height":17.6},"width":181.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-0.png","element":"img","alt":" ∂G2(tA; λ","inline":true},{"text":") is non-decreasing in ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-1.png","element":"img","alt":" λ","inline":true},{"text":". Hence, Lemma ","element":"span"},{"href":"#id-69","text":"B.3 ","element":"a"},{"text":"implies that ","element":"span"},{"style":{"height":17.6},"width":91.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-2.png","element":"img","alt":"TA(λ","inline":true},{"text":") is non-decreasing in ","element":"span"},{"style":{"height":12.8},"width":37.46,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-3.png","element":"img","alt":" λ.","inline":true}],[{"text":"Finally, to show that max","element":"span"},{"style":{"height":17.6},"width":416,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-4.png","element":"img","alt":"{|∆| : ∆ ∈ D(λ)|} →","inline":true,"padRight":true},{"text":"0, Note that the left and right derivatives of ","element":"span"},{"style":{"height":17.6},"width":532.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-5.png","element":"img","alt":"gAUA(t; wA) and gBUB(t; wB","inline":true},{"text":") are upper bounded by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"whereas, since Φ is strictly convex, we know that for every ","element":"span"},{"style":{"height":17.6},"width":759.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-6.png","element":"img","alt":" ϵ > 0, min{|∂+Φ(∆)|, |∂−Φ(∆)|} > m(ϵ","inline":true},{"text":") for all ∆ : ","element":"span"},{"style":{"height":17.6},"width":142.07,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-7.png","element":"img","alt":" |∆| > ϵ","inline":true},{"text":". Hence, the first order optimality conditions cannot be satisfied for ","element":"span"},{"style":{"height":25.12},"width":870.97,"height":62.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-8.png","element":"img","alt":" |∆| > ϵ, and λ > Mm(ϵ), so as λ → ∞, |∆| → 0.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-69","style":{"fontStyle":"italic"},"text":"B.3","element":"a"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"We prove the case where ","element":"span"},{"style":{"height":17.6},"width":156.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-9.png","element":"img","alt":" ∂G2(t; λ","inline":true},{"text":") is non-increasing. The first order conditions requires that at an optimal ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", one has","element":"span"}],[{"style":{"width":"50%"},"width":952,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-10.png","element":"img"}],[{"text":"where the super-gradients are amended to take into account boundary conditions. Suppose that for the sake of contradiction that for ","element":"span"},{"style":{"height":17.6},"width":543.14,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-11.png","element":"img","alt":" λ′ > λ, MAX(λ′) ⪯ MAX(λ","inline":true},{"text":") fails. Then, there (a) exists a ","element":"span"},{"style":{"height":17.6},"width":220.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-12.png","element":"img","alt":"t ∈ MAX(λ","inline":true},{"text":") such that ","element":"span"},{"style":{"height":17.6},"width":687.22,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-13.png","element":"img","alt":" {t} ≺ MAX(λ′), or (b) t ∈ MAX(λ′","inline":true},{"text":") such that ","element":"span"},{"style":{"height":17.6},"width":284.07,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-14.png","element":"img","alt":" {t} ≻ MAX(λ′","inline":true},{"text":"). Note that if ","element":"span"},{"style":{"height":17.6},"width":281.46,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-15.png","element":"img","alt":" {t} ≺ MAX(λ′","inline":true},{"text":"), it must be the case that","element":"span"}],[{"style":{"width":"27%"},"width":518,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-16.png","element":"img"}],[{"text":"By assumption, ","element":"span"},{"style":{"height":17.6},"width":512.87,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-17.png","element":"img","alt":" ∂−G2(t; λ′)+ ≤ ∂0G2(t; λ)+","inline":true,"padRight":true},{"text":", which implies","element":"span"}],[{"style":{"width":"55%"},"width":1034,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-18.png","element":"img"}],[{"text":"a contradiction.","element":"span"}]]},{"heading":"C Proofs of Main Results","paragraphs":[[{"text":"We remark that the proofs in this section rely crucially on the characterizations of the optimal fairness-constrained policies developed in Section ","element":"span"},{"text":"6","element":"span"},{"text":". We first define the notion of CDF domination, which is referred to in a few of the proofs. Intuitively, it means that for any score, the fraction of group ","element":"span"},{"text":"B ","element":"span"},{"text":"above this is higher than that for group ","element":"span"},{"text":"A","element":"span"},{"text":". It is realistic to assume this if we keep with our convention that group ","element":"span"},{"text":"A ","element":"span"},{"text":"is the disadvantaged group relative to group ","element":"span"},{"text":"B","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition C.1 ","element":"span"},{"text":"(CDF domination)","element":"span"},{"style":{"height":10.84},"width":91,"height":27.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-19.png","element":"img","alt":". πA","inline":true,"padRight":true},{"text":"is said to be ","element":"span"},{"style":{"height":19.05},"width":793.03,"height":47.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-20.png","element":"img","alt":" dominated by πB if ∀a ≥ 1, �x>a πA <","inline":true},{"style":{"height":19.05},"width":173.12,"height":47.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-21.png","element":"img","alt":"�x>a πB","inline":true},{"text":". We denote this as ","element":"span"},{"style":{"height":12.44},"width":182.12,"height":31.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-22.png","element":"img","alt":" πA ≺ πB.","inline":true}],[{"text":"We remark that the ","element":"span"},{"style":{"height":10.4},"width":34,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-23.png","element":"img","alt":" ≺","inline":true,"padRight":true},{"text":"notation in this section is entirely unrelated to the the partial order on intervals from Section ","element":"span"},{"href":"#id-70","text":"B.3","element":"a"},{"text":". Frequently, we shall use the following lemma:","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Lemma C.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose that ","element":"span"},{"style":{"height":12.44},"width":183.02,"height":31.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-24.png","element":"img","alt":" πA ≺ πB","inline":true},{"style":{"fontStyle":"italic"},"text":". Then, for all ","element":"span"},{"style":{"height":16.4},"width":121.64,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-25.png","element":"img","alt":" β > 0","inline":true},{"style":{"fontStyle":"italic"},"text":", it holds that ","element":"span"},{"text":"Q","element":"span"},{"style":{"height":17.6},"width":369.02,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-26.png","element":"img","alt":"A(β) ≤ QB(β) and","inline":true}],[{"style":{"width":"22%"},"width":420,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-27.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"The fact that Q","element":"span"},{"style":{"height":17.6},"width":245.16,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-28.png","element":"img","alt":"A(β) ≤ QB(β","inline":true},{"text":") follows directly from the definition of monotonicty of ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u ","element":"span"},{"text":"implies that ","element":"span"},{"style":{"height":17.6},"width":436.38,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/32-29.png","element":"img","alt":" u(QA(β)) ≤ u(QB(β)).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"C.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-5","style":{"fontWeight":"bold"},"text":"3.1","element":"a"}],[{"text":"The ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policy for group ","element":"span"},{"text":"j ","element":"span"},{"text":"solves the optimization","element":"span"}],[{"style":{"width":"35%"},"width":664,"height":84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-0.png","element":"img"}],[{"text":"Computing left and right derivatives of this objective yields","element":"span"}],[{"style":{"width":"59%"},"width":1116,"height":61,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-1.png","element":"img"}],[{"text":"By concavity, solutions ","element":"span"},{"style":{"height":16.8},"width":182.12,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-2.png","element":"img","alt":" β∗ satisfy","inline":true}],[{"id":"id-71","style":{"width":"63%"},"width":1197,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-3.png","element":"img"}],[{"text":"Therefore, we conclude that the ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"policy loans only to scores ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"s.t. ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0, which implies ","element":"span"},{"style":{"height":17.6},"width":153.98,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-4.png","element":"img","alt":"∆(x) >","inline":true,"padRight":true},{"text":"0 for all scores loaned to. Therefore we must have that 0 ","element":"span"},{"style":{"height":17.93},"width":239.04,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-5.png","element":"img","alt":" ≤ ∆µMaxUtil","inline":true},{"text":". By definition ∆","element":"span"},{"style":{"height":17.93},"width":307.8,"height":44.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-6.png","element":"img","alt":"µMaxUtil ≤ ∆µ∗.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"C.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Corollary ","element":"span"},{"href":"#id-33","style":{"fontWeight":"bold"},"text":"3.2","element":"a"}],[{"text":"We begin with proving part (a), which gives conditions under which ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"cases relative improvement. Recall that ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-7.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"is the largest selection rate for which ","element":"span"},{"style":{"height":19.65},"width":351.91,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-8.png","element":"img","alt":" U(β) = U(βMaxUtilA","inline":true,"padRight":true},{"text":"). First, we derive a condition which bounds the selection rate ","element":"span"},{"style":{"height":22.48},"width":179.41,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-9.png","element":"img","alt":" βDemParityA","inline":true,"padRight":true},{"text":"from below. Fix an acceptance rate ","element":"span"},{"style":{"height":19.65},"width":842.49,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-10.png","element":"img","alt":"β such that βMaxUtilA < β < min{βMaxUtilB , β}","inline":true},{"text":". By Theorem ","element":"span"},{"href":"#id-47","text":"6.1","element":"a"},{"text":", we have that ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"selects to group ","element":"span"},{"text":"A ","element":"span"},{"text":"with rate higher than ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-11.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"as long as","element":"span"}],[{"style":{"width":"24%"},"width":468,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-12.png","element":"img"}],[{"text":"By (","element":"span"},{"href":"#id-71","text":"30","element":"a"},{"text":") and the monotonicity of ","element":"span"},{"style":{"height":17.6},"width":973.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-13.png","element":"img","alt":" u, u(QA(β)) < 0 and u(QB(β)) > 0, so 0 < g1 < 1.","inline":true}],[{"text":"Next, we derive a condition which bounds the selection rate ","element":"span"},{"style":{"height":22.48},"width":179.41,"height":56.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-14.png","element":"img","alt":" βDemParityA","inline":true,"padRight":true},{"text":"from above. ","element":"span"},{"text":"First, consider the case that ","element":"span"},{"style":{"height":19.65},"width":1449.08,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-15.png","element":"img","alt":" βMaxUtilB < β, and fix β′ such that βMaxUtilB < β′ < β. Then DemParity selects","inline":true,"padRight":true},{"text":"group ","element":"span"},{"style":{"height":16.4},"width":370.17,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-16.png","element":"img","alt":" A at a rate βA < β′ ","inline":true,"padRight":true},{"text":"for any proportion ","element":"span"},{"style":{"height":12},"width":43.81,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-17.png","element":"img","alt":" gA","inline":true},{"text":". This follows from applying Theorem ","element":"span"},{"href":"#id-47","text":"6.1 ","element":"a"},{"text":"since we have that ","element":"span"},{"href":"#id-71","style":{"height":20.98},"width":792.06,"height":52.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-18.png","element":"img","alt":" u(Q+A(β′)) < 0 and u(Q+B(β′)) < 0 by (30","inline":true},{"text":") and the monotonicity of ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":".","element":"span"}],[{"text":"Instead, in the case that ","element":"span"},{"style":{"height":19.65},"width":1308.48,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-19.png","element":"img","alt":" βMaxUtilB > β, fix β′ such that β < β′ < βMaxUtilB . Then DemParity","inline":true,"padRight":true},{"text":"selects group ","element":"span"},{"text":"A ","element":"span"},{"text":"at a rate less than ","element":"span"},{"style":{"height":16.8},"width":239.67,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-20.png","element":"img","alt":" β′ as long as","inline":true}],[{"style":{"width":"25%"},"width":485,"height":142,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-21.png","element":"img"}],[{"text":"By (","element":"span"},{"href":"#id-71","text":"30","element":"a"},{"text":") and the monotonicity of ","element":"span"},{"style":{"height":17.6},"width":1225.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-22.png","element":"img","alt":" u, 0 < g0 < g1. Thus for gA ∈ [g0, g1], the DemParity selection","inline":true,"padRight":true},{"text":"rate for group ","element":"span"},{"text":"A ","element":"span"},{"text":"is bounded between ","element":"span"},{"style":{"height":16.8},"width":566.24,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-23.png","element":"img","alt":" β and β′, and thus DemParity","inline":true,"padRight":true},{"text":"results in relative improvement.","element":"span"}],[{"text":"Next, we prove part (b), which gives conditions under which ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"cases relative improvement. First, we derive a condition which bounds the selection rate ","element":"span"},{"style":{"height":22.48},"width":111.7,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-24.png","element":"img","alt":" βEqOptA","inline":true,"padRight":true},{"text":"from below. Fix an acceptance rate ","element":"span"},{"style":{"height":21.25},"width":968.83,"height":53.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/33-25.png","element":"img","alt":" β such that βMaxUtilA < β and βMaxUtilB > G(A→B)(β","inline":true},{"text":"). By Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"selects group ","element":"span"},{"text":"A","element":"span"}],[{"text":"at a rate higher than ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-0.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"as long as","element":"span"}],[{"style":{"width":"43%"},"width":821,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-1.png","element":"img"}],[{"text":"By (","element":"span"},{"href":"#id-71","text":"30","element":"a"},{"text":") and the monotonicity of ","element":"span"},{"style":{"height":20.33},"width":1069.58,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-2.png","element":"img","alt":" u, u(QA(β)) < 0 and u(QB(G(A→B)(β))) > 0, so g3 > 0.","inline":true}],[{"text":"Next, we derive a condition which bounds the selection rate ","element":"span"},{"style":{"height":22.48},"width":111.7,"height":56.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-3.png","element":"img","alt":" βEqOptA","inline":true,"padRight":true},{"text":"from above. First, consider the case that there exists ","element":"span"},{"style":{"height":21.25},"width":1360.09,"height":53.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-4.png","element":"img","alt":" β′ such that β′ < β and βMaxUtilB < G(A→B)(β′) . Then EqOpt selects","inline":true,"padRight":true},{"text":"group ","element":"span"},{"text":"A ","element":"span"},{"text":"at a rate less than this ","element":"span"},{"style":{"height":16.8},"width":249.33,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-5.png","element":"img","alt":" β′ for any gA","inline":true},{"text":". This follows from Theorem ","element":"span"},{"href":"#id-63","text":"6.2 ","element":"a"},{"text":"since we have that ","element":"span"},{"href":"#id-71","style":{"height":21.57},"width":967.69,"height":53.93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-6.png","element":"img","alt":"u(Q+A(β′)) < 0 and u(Q+B(G(A→B)(β′))) < 0 by (30","inline":true},{"text":") and the monotonicity of ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":".","element":"span"}],[{"text":"In the other case, fix ","element":"span"},{"href":"#id-71","style":{"height":21.25},"width":1001.84,"height":53.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-7.png","element":"img","alt":" β′ such that β < β′ < β and βMaxUtilB > G(A→B)(β′","inline":true},{"text":"). By Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"selects group ","element":"span"},{"text":"A ","element":"span"},{"text":"at a rate lower than ","element":"span"},{"style":{"height":16.8},"width":239.68,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-8.png","element":"img","alt":" β′ as long as","inline":true}],[{"style":{"width":"45%"},"width":853,"height":142,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-9.png","element":"img"}],[{"text":"By (","element":"span"},{"href":"#id-71","text":"30","element":"a"},{"text":") and the monotonicity of ","element":"span"},{"style":{"height":17.6},"width":913.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-10.png","element":"img","alt":" u, 0 < g2 < g3. Thus for gA ∈ [g2, g3], the EqOpt","inline":true,"padRight":true},{"text":"selection rate for group ","element":"span"},{"text":"A ","element":"span"},{"text":"is bounded between ","element":"span"},{"style":{"height":16.8},"width":486.7,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-11.png","element":"img","alt":" β and β′, and thus EqOpt","inline":true,"padRight":true},{"text":"results in relative improvement.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"C.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Corollary ","element":"span"},{"href":"#id-30","style":{"fontWeight":"bold"},"text":"3.3","element":"a"}],[{"text":"Recall our assumption that ","element":"span"},{"style":{"height":19.65},"width":630.63,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-12.png","element":"img","alt":" β > βMaxUtilA and βMaxUtilB > β.","inline":true,"padRight":true},{"text":"As argued in the above proof of Corollary ","element":"span"},{"href":"#id-33","text":"3.2","element":"a"},{"text":", by (","element":"span"},{"href":"#id-71","text":"30","element":"a"},{"text":") and the monotonicity of ","element":"span"},{"style":{"height":17.6},"width":679.78,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-13.png","element":"img","alt":" u, u(QA(β)) < 0 and u(QB(β)) >","inline":true,"padRight":true},{"text":"0. Applying Theorem ","element":"span"},{"href":"#id-47","text":"6.1","element":"a"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"selects at a higher rate than ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-14.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"for any population proportion ","element":"span"},{"style":{"height":15.2},"width":164.92,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-15.png","element":"img","alt":" gA ≤ g0,","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":27.65},"width":606.59,"height":69.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-16.png","element":"img","alt":" g0 = 1/(1 − u(QA(β))u(QB(β))) ∈ (0, 1).","inline":true,"padRight":true},{"text":"In particular, if ","element":"span"},{"style":{"height":16.4},"width":144.38,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-17.png","element":"img","alt":" β = β0","inline":true},{"text":", which we defined as the harm threshold (i.e. ∆","element":"span"},{"style":{"height":21.3},"width":197.26,"height":53.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-18.png","element":"img","alt":"µA(r−1πA(β0","inline":true},{"text":")) = 0 and ∆","element":"span"},{"style":{"height":12.19},"width":53.88,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-19.png","element":"img","alt":"µA ","inline":true,"padRight":true},{"text":"is decreasing at ","element":"span"},{"style":{"height":16.4},"width":41.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-20.png","element":"img","alt":" β0","inline":true},{"text":"), then by the concavity of ∆","element":"span"},{"style":{"height":12.19},"width":131.84,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-21.png","element":"img","alt":"µA, we","inline":true,"padRight":true},{"text":"have that ∆","element":"span"},{"style":{"height":23.41},"width":417.05,"height":58.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-22.png","element":"img","alt":"µA(r−1πA(βDemParityA )) <","inline":true,"padRight":true},{"text":"0, that is, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"causes active harm.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"C.4 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Corollary ","element":"span"},{"href":"#id-31","style":{"fontWeight":"bold"},"text":"3.4","element":"a"}],[{"text":"By Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":", ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"selects at a higher rate than ","element":"span"},{"style":{"height":16.4},"width":26,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-23.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"for any population proportion ","element":"span"},{"style":{"height":15.2},"width":168.55,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-24.png","element":"img","alt":" gA ≤ g0,","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":31.12},"width":739.38,"height":77.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-25.png","element":"img","alt":" g0 = 1/(1 − 1κ · ρ(QB(G(A→B)(β)))u(QB(G(A→B)(β)))u(QA(β))ρ(QA(β))","inline":true},{"text":"). Using our assumptions ","element":"span"},{"style":{"height":21.25},"width":497.32,"height":53.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-26.png","element":"img","alt":" βMaxUtilB > G(A→B)(β) and","inline":true},{"style":{"height":19.65},"width":230.72,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-27.png","element":"img","alt":"β > βMaxUtilA","inline":true,"padRight":true},{"text":", we have that ","element":"span"},{"href":"#id-71","style":{"height":20.33},"width":936.41,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-28.png","element":"img","alt":" u(QB(G(A→B)(β))) > 0 and u(QA(β)) < 0, by (30","inline":true},{"text":") and the monotonicity of ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":". This verifies that ","element":"span"},{"style":{"height":17.6},"width":143.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-29.png","element":"img","alt":" g0 ∈ (0,","inline":true,"padRight":true},{"text":"1). In particular, if ","element":"span"},{"style":{"height":16.4},"width":126.85,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-30.png","element":"img","alt":" β = β0","inline":true},{"text":", then by the concavity of ∆","element":"span"},{"style":{"height":16.59},"width":231.64,"height":41.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-31.png","element":"img","alt":"µA, we have","inline":true,"padRight":true},{"text":"that ∆","element":"span"},{"style":{"height":23.41},"width":349.3,"height":58.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-32.png","element":"img","alt":"µA(r−1πA(βEqOptA )) <","inline":true,"padRight":true},{"text":"0, that is, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"causes active harm.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"C.5 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Corollary ","element":"span"},{"href":"#id-4","style":{"fontWeight":"bold"},"text":"3.5","element":"a"}],[{"text":"Applying Theorem ","element":"span"},{"href":"#id-47","text":"6.1","element":"a"},{"text":", we have","element":"span"}],[{"style":{"width":"52%"},"width":986,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/34-33.png","element":"img"}],[{"text":"Applying Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":", we have:","element":"span"}],[{"style":{"width":"89%"},"width":1679,"height":116,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-0.png","element":"img"}],[{"text":"By Corollaries ","element":"span"},{"href":"#id-30","text":"3.3 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-31","text":"3.4","element":"a"},{"text":", choosing ","element":"span"},{"style":{"height":27.65},"width":1131.69,"height":69.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-1.png","element":"img","alt":" gA < g2 := 1/(1 − u(QA(β))u(QB(β))) and gA > g1 := 1/(1 − 1κ ·","inline":true}],[{"style":{"width":"99%"},"width":1866,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-2.png","element":"img"}],[{"text":"to verify this.","element":"span"}],[{"style":{"width":"96%"},"width":1801,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-3.png","element":"img"}],[{"style":{"height":16.4},"width":41.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-4.png","element":"img","alt":"β0","inline":true},{"text":", then by the concavity of ∆","element":"span"},{"style":{"height":12.19},"width":53.89,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-5.png","element":"img","alt":"µA","inline":true},{"text":", we have that ∆","element":"span"},{"style":{"height":23.41},"width":358.38,"height":58.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-6.png","element":"img","alt":"µA(r−1πA(βEqOptA )) >","inline":true,"padRight":true},{"text":"0, that is, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"causes improvement, and ∆","element":"span"},{"style":{"height":23.41},"width":417.05,"height":58.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-7.png","element":"img","alt":"µA(r−1πA(βDemParityA )) <","inline":true,"padRight":true},{"text":"0, that is, ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"causes active harm.","element":"span"}],[{"style":{"width":"96%"},"width":1802,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-8.png","element":"img"}],[{"text":"∆","element":"span"},{"style":{"height":23.41},"width":349.3,"height":58.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-9.png","element":"img","alt":"µA(r−1πA(βEqOptA )) >","inline":true,"padRight":true},{"text":"0, using the concavity of the outcome curve.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Lemma C.2 ","element":"span"},{"text":"(Comparison of ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"selection rates)","element":"span"},{"style":{"height":17.6},"width":598.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-10.png","element":"img","alt":". Fix β ∈ [0, 1]. Suppose πA, πB","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are identical up to a translation with ","element":"span"},{"style":{"height":13.79},"width":167.54,"height":34.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-11.png","element":"img","alt":" µA < µB","inline":true},{"style":{"fontStyle":"italic"},"text":". Also assume ","element":"span"},{"style":{"height":17.6},"width":85.61,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-12.png","element":"img","alt":" ρ(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is affine in ","element":"span"},{"style":{"height":27.64},"width":410.92,"height":69.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-13.png","element":"img","alt":" x. Denote κ = ⟨ρ,πB⟩⟨ρ,πA⟩.","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Then,","element":"span"}],[{"id":"id-73","style":{"width":"99%"},"width":1869,"height":542,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-14.png","element":"img"}],[{"text":"Further, using ","element":"span"},{"style":{"height":20.33},"width":297.39,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-15.png","element":"img","alt":" G(A→B)(β) > β","inline":true,"padRight":true},{"text":"from lemma ","element":"span"},{"href":"#id-72","text":"C.3 ","element":"a"},{"text":"and the fact that ","element":"span"},{"style":{"height":27.65},"width":68.63,"height":69.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-16.png","element":"img","alt":"u(x)ρ(x) ","inline":true,"padRight":true},{"text":"is increasing in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":", we","element":"span"}],[{"text":"have ","element":"span"},{"style":{"height":31.12},"width":492.44,"height":77.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-17.png","element":"img","alt":"u(QB(G(A→B)(β)))ρ(QB(G(A→B)(β))) < u(QB(β))ρ(QB(β))","inline":true},{"text":". Therefore, ","element":"span"},{"style":{"height":29.02},"width":1029.21,"height":72.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-18.png","element":"img","alt":" u(QB(G(A→B)(β))) · κ · ρ(QA(β0))ρ(QB(G(A→B)(β0))) < κ · u(QB(β))ρ(QB(β)) ·","inline":true},{"style":{"height":17.6},"width":386.43,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-19.png","element":"img","alt":"ρ(QA(β)) < u(QB(β","inline":true},{"text":")) where the last inequality follows from (","element":"span"},{"href":"#id-73","text":"31","element":"a"},{"text":").","element":"span"}],[{"id":"id-74","style":{"width":"1%"},"width":30,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-20.png","element":"img"}],[{"text":"We use the following technical lemma in the proof of the above lemma.","element":"span"}],[{"id":"id-72","style":{"height":16.4},"width":462.65,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-21.png","element":"img","alt":"Lemma C.3. If πA, πB","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"that are identical up to a translation with ","element":"span"},{"style":{"height":16.59},"width":280.99,"height":41.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-22.png","element":"img","alt":" µA < µB, then","inline":true}],[{"id":"id-75","style":{"width":"67%"},"width":1270,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-23.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"For (","element":"span"},{"href":"#id-74","text":"32","element":"a"},{"text":"), observe that TPR","element":"span"},{"style":{"height":17.6},"width":765.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-24.png","element":"img","alt":"A = ρ(µA) < TPRB = ρ(µB). For any β","inline":true},{"text":", we can write Q","element":"span"},{"style":{"height":17.6},"width":131.63,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-25.png","element":"img","alt":"B(β) =","inline":true},{"style":{"height":17.6},"width":1086.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/35-26.png","element":"img","alt":"µB + c and QA(β) = µA + c for some c, since πA, πB","inline":true,"padRight":true},{"text":"that are identical up to translation by ","element":"span"},{"style":{"height":12.19},"width":185.28,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-0.png","element":"img","alt":"µA − µB.","inline":true,"padRight":true},{"text":"Thus, by computation, we can see that for Q(","element":"span"},{"style":{"height":20.33},"width":712.43,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-1.png","element":"img","alt":"β) < µ, ∂+G(A→B)(β) > 1 and for","inline":true,"padRight":true},{"text":"Q(","element":"span"},{"style":{"height":20.33},"width":815.61,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-2.png","element":"img","alt":"β) < µ, ∂+G(A→B)(β) < 1. Since G(A→B) ","inline":true,"padRight":true},{"text":"is monotonically increasing on [0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1], we must have ","element":"span"},{"style":{"height":20.33},"width":663.98,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-3.png","element":"img","alt":"G(A→B)(β) > β for every β ∈ [0, 1].","inline":true}],[{"text":"For (","element":"span"},{"href":"#id-75","text":"33","element":"a"},{"text":"), we have ","element":"span"},{"style":{"height":21.05},"width":264.67,"height":52.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-4.png","element":"img","alt":" β > �x>µ πA","inline":true},{"text":", we can again write Q","element":"span"},{"style":{"height":17.6},"width":756.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-5.png","element":"img","alt":"B(β) = µB − c and QA(β) = µA − c, for","inline":true,"padRight":true},{"text":"some ","element":"span"},{"style":{"fontStyle":"italic"},"text":"c > ","element":"span"},{"text":"0. Then it is clear than we have ","element":"span"},{"style":{"height":27.65},"width":219.67,"height":69.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-6.png","element":"img","alt":"µBµA < QB(β)QA(β).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"C.6 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Corollary ","element":"span"},{"href":"#id-32","style":{"fontWeight":"bold"},"text":"3.6","element":"a"}],[{"style":{"width":"99%"},"width":1869,"height":221,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-7.png","element":"img"}],[{"text":"We now give a very simple example of ","element":"span"},{"style":{"height":12.44},"width":175.06,"height":31.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-8.png","element":"img","alt":" πA ≺ πB","inline":true,"padRight":true},{"text":"where Theorem 3.5 holds. The construction of the example exemplifies the more general idea of using large in-group inequality in group ","element":"span"},{"text":"A ","element":"span"},{"text":"to skew the true positive rate at ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil","element":"span"},{"text":", making TPR","element":"span"},{"style":{"height":18.73},"width":578.86,"height":46.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-9.png","element":"img","alt":"A(τ MaxUtil) > TPRB(τ MaxUtil).","inline":true}],[{"style":{"width":"100%"},"width":1878,"height":226,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"C.7 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-76","style":{"fontWeight":"bold"},"text":"4.1","element":"a"}],[{"text":"Denote the upper quantile function under ","element":"span"},{"style":{"height":17.6},"width":1033.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-11.png","element":"img","alt":" �π as �Q. Since �π ≺ π, we have �Q(β) ≤ Q(β). The","inline":true,"padRight":true},{"text":"conclusion follows for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"MaxUtil ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"DemParity ","element":"span"},{"text":"from Theorem ","element":"span"},{"href":"#id-47","text":"6.1 ","element":"a"},{"text":"by the monotonicity of ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u","element":"span"},{"text":".","element":"span"}],[{"text":"If we have that TPR","element":"span"},{"style":{"height":17.6},"width":413.78,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-12.png","element":"img","alt":"A(τ) > �TPRA(τ) ∀ τ","inline":true},{"text":", that is, the true TPR dominates estimated TPR, the conclusion for ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"EqOpt ","element":"span"},{"text":"follows from Theorem ","element":"span"},{"href":"#id-63","text":"6.2","element":"a"},{"text":", by the same argument as in the proof of Corollary ","element":"span"},{"href":"#id-32","text":"3.6","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"C.8 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-77","style":{"fontWeight":"bold"},"text":"4.2","element":"a"}],[{"text":"By Proposition ","element":"span"},{"href":"#id-46","text":"5.3","element":"a"},{"text":", ","element":"span"},{"style":{"height":19.79},"width":427.86,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-13.png","element":"img","alt":" β∗ = argmaxβ ∆µA(β","inline":true},{"text":") exists and is unique. ","element":"span"},{"style":{"height":19.65},"width":565.06,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-14.png","element":"img","alt":" β0 = max{β ∈ [βMaxUtilA , 1] :","inline":true},{"style":{"height":19.65},"width":493.18,"height":49.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-15.png","element":"img","alt":"U(βMaxUtilA ) − UA(β) ≤ δ}","inline":true,"padRight":true},{"text":"which exists and is unique, by the continuity of ∆","element":"span"},{"style":{"height":12.19},"width":53.89,"height":30.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1803.04383/images/36-16.png","element":"img","alt":"µA","inline":true,"padRight":true},{"text":"and Proposition ","element":"span"},{"href":"#id-46","text":"5.3","element":"a"},{"text":".","element":"span"}]]}],"_version":"3.3.2"},"paperNode":"$28:props:children:props:children:0:props:product"}]]