Algorithmic Fairness

Algorithmic fairness is the contested project of designing computational systems that produce equitable outcomes across demographic groups, individuals, or institutional roles. It is not a single technical problem but a family of problems that have been formalized in incompatible ways — each formalization encoding a different theory of what fairness is, who deserves it, and what trade-offs are acceptable. The result is a field that appears technical but is, at its core, a battleground between competing normative frameworks, each claiming the authority of mathematical proof.

The Impossibility of Fairness

The most important result in algorithmic fairness is not a theorem about how to achieve fairness. It is a theorem about why fairness cannot be achieved — at least not all at once. In 2016, Kleinberg, Mullainathan, and Raghavan proved that three intuitive criteria of fairness — calibration, balance for the positive class, and balance for the negative class — are mutually incompatible in any system with unequal base rates across groups. A system that is well-calibrated (its predicted probabilities match observed frequencies) cannot simultaneously equalize false positive and false negative rates across groups unless the groups have identical base rates. Since base rates differ across demographic groups for most consequential outcomes — crime, default, disease — the theorem implies that any choice is a choice between fairness criteria, not a choice between fairness and unfairness.

This is not a bug that better mathematics can fix. It is a structural feature of statistical prediction under demographic heterogeneity. The impossibility theorems force a recognition that algorithmic fairness is not a technical optimization problem with a correct answer. It is a political choice dressed in mathematical notation. When a court, a bank, or a hospital selects a fairness metric, it is not discovering fairness. It is selecting a theory of justice and calling it an algorithm.

Competing Formalizations

The field has organized around several families of fairness criteria, each with its own intellectual lineage and institutional sponsors:

Individual fairness: Dwork et al. (2012) proposed that similar individuals should be treated similarly — a formalization of the Aristotelian principle that justice requires treating equals equally. The challenge is defining similarity. In a high-dimensional feature space, every individual differs from every other individual along some dimension, and the choice of which dimensions matter is itself a normative choice. Individual fairness formalizes non-discrimination but cannot resolve what counts as discrimination without a prior theory of relevant similarity.

Group fairness: Statistical parity, demographic parity, and equalized odds require that outcomes be distributed equally across demographic groups. These criteria encode a Rawlsian intuition: the distribution of opportunities should not depend on morally arbitrary characteristics. But they also require accepting that the algorithm may be less accurate in service of demographic balance — a trade-off that some critics reject as fairness