Contagion Models

Contagion models are mathematical frameworks for describing how a quantity — disease, information, financial stress, social behavior, or failure — spreads through a network of connected nodes. The field emerged from epidemiology but has been absorbed by economics, sociology, and complexity science, each of which adapted its core machinery for different substrates while retaining the foundational insight: the trajectory of spread depends as much on the topology of the network as on the intrinsic properties of what is spreading.

The canonical contagion model is the SIR (Susceptible-Infected-Recovered) framework introduced by Kermack and McKendrick in 1927. In SIR, a population is partitioned into three compartments. Susceptibles become infected at a rate proportional to the number of contacts with infected individuals; infected individuals recover at a fixed rate. The model produces a single key threshold — the basic reproduction number R₀ — which determines whether contagion spreads exponentially (R₀ > 1) or dies out (R₀ < 1). This threshold has become one of the most translated concepts in mathematical epidemiology, for better and for worse.

Network Structure and the Failure of Mean-Field Models

The SIR framework and its variants are mean-field models: they assume that every individual has equal probability of contact with every other individual. This assumption is computationally convenient and empirically false. Real contact networks are heterogeneous — they have heavy-tailed degree distributions (some nodes have vastly more connections than average), community structure (dense clusters with sparse inter-cluster links), and temporal dynamics (contact patterns change over time in ways that correlate with the contagion itself).

The consequences of heterogeneity for contagion dynamics are not marginal corrections to the mean-field result — they are qualitative changes. In a network with a power-law degree distribution (such as many social networks and the internet), the epidemic threshold can approach zero: contagion can spread even when R₀ in the mean-field sense would predict extinction. This is because highly connected hubs serve as super-spreaders, sustaining transmission even when average connectivity is low. The superspreader phenomenon, extensively documented in COVID-19 transmission data, is not an anomaly — it is the expected consequence of heterogeneous contact networks.

More perniciously: mean-field models, when fit to early outbreak data from heterogeneous networks, will systematically overestimate R₀ during the initial phase (when infections are concentrated in high-degree nodes) and underestimate the plateau phase (when the susceptible population becomes depleted in high-risk groups). Epidemiological forecasts built on SIR variants consistently made both errors during the COVID-19 pandemic. This was not a failure of epidemiologists — it was the predictable consequence of using mean-field approximations in a regime where network heterogeneity dominates dynamics.

Financial Contagion and the Correlation Problem

The application of contagion models to financial systems introduced a complication that epidemiology does not face: endogenous correlation. In disease spread, the causal structure is clear — infection propagates from infected to susceptible through physical contact. In financial networks, the causal structure is murkier. Banks fail because their assets lose value; assets lose value because other banks are failing; the correlation between bank failures is simultaneously cause and effect of the contagion.

The 2008 financial crisis demonstrated this endogeneity with unusual clarity. Systemic risk in the pre-crisis period was estimated to be low, partly because correlation metrics computed from historical data showed low pairwise correlations between financial institutions. What the models did not capture was that these correlations were low during normal periods and would become high during stress — precisely the periods when the correlation matters. Tail correlations — correlations during extreme events — were structurally higher than unconditional correlations, and the risk models were fit on unconditional data.

The network-theoretic implication: financial contagion is not like disease contagion. It involves feedback loops in which the act of responding to perceived contagion (selling assets, calling loans, refusing interbank lending) accelerates the very dynamics being responded to. A bank run is not a passive transmission of a pathogen — it is an autopoietically self-fulfilling cascade in which the belief that other agents are acting generates the conditions that validate the belief. No SIR variant captures this. The relevant mathematics is game-theoretic, not epidemiological.

Information Contagion and Threshold Models

A third class of contagion models addresses the spread of behaviors, beliefs, and information. Mark Granovetter's threshold model (1978) treats adoption of a behavior as a function of the fraction of neighbors who have already adopted. Each individual has a threshold — the proportion of adopters required before they join — and the cascade dynamics depend on the distribution of thresholds across the population.

Threshold models generate qualitative phenomena that simple SIR variants cannot: tipping points (small changes in the threshold distribution produce large changes in final cascade size), lock-in (multiple stable equilibria, not all of which are globally optimal), and sensitivity to early adopters (the identity and position of the first movers shapes the eventual extent of adoption). These features are not artifacts of the model — they are observed in empirical adoption data across technology diffusion, social movements, and misinformation spread.

The misinformation application is particularly consequential. False information spreads on social networks through a threshold mechanism: individuals share content when enough of their network has already shared it, independent of whether they have verified its accuracy. This creates a dynamic where accuracy of content is orthogonal to spread velocity — which is precisely what empirical studies of Twitter diffusion (Vosoughi et al., 2018) found: false news spread faster, farther, and more broadly than true news. A contagion model that does not account for this — one that treats all information as equivalent — will systematically underestimate the spread of misinformation and overestimate the equilibrating power of corrections.

The Model-Reality Gap

Contagion models are structurally prone to a specific failure mode: they are calibrated on observed spread, which reflects the interaction between contagion dynamics and the behavioral responses those dynamics trigger. The models observe the trajectory of spread after interventions have been applied; fitting a model to this observed trajectory produces parameter estimates that encode both the intrinsic transmission rate and the behavioral response. Forecasting with these parameters requires assuming that behavioral responses remain constant — an assumption that fails whenever the forecast itself changes behavior.

During COVID-19, epidemic models released publicly changed the behavior they were modeling. High projected fatality curves motivated governments to impose lockdowns; lockdowns changed transmission rates; the models were then re-fit on post-lockdown data; the resulting parameter estimates were inapplicable to the pre-lockdown counterfactual that policymakers needed for planning. Prediction and explanation came apart: the models could describe observed dynamics but could not reliably counterfactually predict what would have happened without intervention. This is not a solvable calibration problem — it is a structural consequence of studying systems that observe and respond to models of themselves.

The deeper implication: contagion modeling is not merely a branch of applied mathematics. It is a form of reflexive intervention in the systems it describes. A model released into a pandemic changes the pandemic. A financial contagion forecast released into a banking crisis changes the crisis. The Goodhart dynamic — that any measure, when used as a target, ceases to be a good measure — applies with particular force to contagion models used for policy. The field has not adequately confronted this.

The uncomfortable truth about contagion models is that they are most accurate precisely when they are least useful: when the system is not yet responding to them. The moment a contagion model influences the system it models, its predictions become self-modifying. Any field that treats this as a calibration problem rather than a foundational epistemological constraint has not done the math.