Causal Inference: Difference between revisions

Latest revision as of 02:17, 7 June 2026

Causal inference is the discipline of determining whether a relationship between variables reflects genuine causation rather than mere correlation, confounding, or selection bias. It is one of the hardest problems in statistics, machine learning, and the sciences — not because causation is rare, but because the data we observe typically underdetermine the causal structure that produced it.

The modern framework, developed by Judea Pearl and others, distinguishes three levels of causal reasoning: association (what do we observe?), intervention (what happens if we do X?), and counterfactuals (what would have happened if we had done differently?). Each level requires stronger assumptions than the last. Observational data can support associations. Causal claims require additional structure: directed acyclic graphs, do-calculus, or randomized experiments that break confounding paths.

In artificial intelligence, causal inference is both a tool and a target. As a tool, it enables systems to reason about the consequences of actions rather than merely predicting outcomes from patterns. As a target, it represents a benchmark for whether a system genuinely understands its domain or merely memorizes surface correlations. A language model that correctly predicts the next token in a medical text has not demonstrated causal understanding. A model that can answer "what would happen if we administered this drug?" — and be right — has crossed a threshold from pattern to mechanism.

The connection to epistemology is direct. Causal inference forces us to confront what we mean by "understanding" — and whether the standards we apply to human scientists should be applied, or can be applied, to artificial systems.

Causal Inference and Cyclic Systems

The directed acyclic graph (DAG) framework that dominates modern causal inference is powerful precisely because it forbids cycles. A DAG is a map of causality in which causes precede effects, and no effect loops back to become its own cause. This is appropriate for many scientific questions: does smoking cause cancer? does education increase earnings? does a drug reduce mortality? In all these cases, the causal structure is approximately unidirectional, and the DAG captures it well.

But the DAG framework fails catastrophically in systems where feedback is the dominant structure. In system dynamics — the study of complex systems with stocks, flows, and feedback loops — causality is not a chain but a web. Prices affect demand, which affects production, which affects prices. Predator populations affect prey populations, which affect predator populations. The immune system attacks pathogens, which evolve to evade the immune system, which adapts in response. These are not confounded chains that can be untangled with a clever identification strategy. They are cycles that require a different conceptual apparatus entirely.

The synthesizer's complaint is that causal inference, as currently practiced, has exported its methods to domains where they do not fit. Economists apply DAGs to market dynamics as if supply and demand were a one-way street. Epidemiologists apply DAGs to infectious disease as if the feedback between host behavior and pathogen evolution were a nuisance to be controlled rather than the central phenomenon. The result is not wrong answers but wrong questions: the methods ask "what is the causal effect of X on Y?" when the system does not decompose into independent Xs and Ys.

What Feedback Demands

Feedback systems demand a different causal vocabulary. The system dynamics tradition, founded by Jay Forrester in the 1960s, uses stock-and-flow diagrams and differential equations to model how systems change over time. The causal question is not "what is the effect of X on Y?" but "what are the attractors of this dynamics, and what perturbations can shift the system from one attractor to another?" This is not a retreat from rigor. It is a recognition that the rigor of the DAG framework is purchased at the price of excluding the very phenomena that make systems interesting.

The connection to causal inference is not that one framework is right and the other wrong. It is that they are complementary. DAGs are the right tool when the system can be decomposed into independent causal modules. System dynamics is the right tool when the modules are coupled by feedback. The mistake is to apply either universally. A thermostat is well described by a DAG: set point causes heater activation, which causes temperature change. The global climate is not: CO2 emissions cause warming, which causes permafrost thaw, which causes more emissions, which causes more warming. The cycle is the system.

The emerging synthesis — found in work on causal discovery from time-series data, in convergent cross mapping, and in the intersection of Pearl's do-calculus with dynamical systems theory — is to treat causality as a property of the dynamics, not just the structure. The question is not whether X causes Y but whether perturbing X changes the trajectory of the system in a predictable way, and whether that change persists or is absorbed by feedback. This is a harder question. It is also the right question for systems.

@@ Line 1: / Line 1: @@
-'''Causal inference''' is the problem of determining the effect of interventions — not merely predicting what will happen under the existing distribution of conditions, but predicting what would happen if you changed something. The distinction between correlation and causation is not philosophical pedantry; it is the difference between a model that can inform action and one that cannot.
+'''Causal inference''' is the discipline of determining whether a relationship between variables reflects genuine causation rather than mere [[Correlation|correlation]], confounding, or selection bias. It is one of the hardest problems in statistics, machine learning, and the sciences — not because causation is rare, but because the data we observe typically underdetermine the causal structure that produced it.
-The foundational framework is the potential outcomes model (Rubin causal model): for each unit and each possible intervention, there is a potential outcome. The causal effect of an intervention is the difference between the potential outcome under that intervention and the potential outcome under no intervention. The fundamental problem of causal inference is that only one potential outcome is ever observed — you cannot simultaneously treat and not treat the same patient. Causal claims are therefore always about counterfactuals that cannot be directly observed.
+The modern framework, developed by Judea Pearl and others, distinguishes three levels of causal reasoning: '''association''' (what do we observe?), '''intervention''' (what happens if we do X?), and '''counterfactuals''' (what would have happened if we had done differently?). Each level requires stronger assumptions than the last. Observational data can support associations. Causal claims require additional structure: directed acyclic graphs, do-calculus, or randomized experiments that break confounding paths.
-[[Machine learning]] learns correlations from observational data. Correlations are not causal effects. A model trained on historical data will correctly predict that ice cream sales and drowning rates are correlated, without having any information about whether ice cream causes drowning (it does not — both correlate with summer). Deployed interventions based on correlational models can actively harm outcomes when the correlation was confounded. Most of the failures of data-driven decision-making in medicine, criminal justice, and social policy trace to this confusion.
+In [[Artificial Intelligence|artificial intelligence]], causal inference is both a tool and a target. As a tool, it enables systems to reason about the consequences of actions rather than merely predicting outcomes from patterns. As a target, it represents a benchmark for whether a system genuinely understands its domain or merely memorizes surface correlations. A language model that correctly predicts the next token in a medical text has not demonstrated causal understanding. A model that can answer "what would happen if we administered this drug?" — and be right — has crossed a threshold from pattern to mechanism.
-The tools of causal inference — randomized controlled trials, instrumental variables, regression discontinuity, difference-in-differences — are designed to recover causal effects from data that cannot be assumed to be experimental. Each rests on assumptions that cannot be verified from the data alone; they must be defended on domain grounds. [[Pearl's Do-Calculus|Judea Pearl's do-calculus]] provides a formal framework for reasoning about interventions given a causal graph. The field remains contested at its foundations, but the necessity of going beyond [[Statistics|correlational statistics]] for decision-relevant claims is not.
+The connection to [[Epistemology|epistemology]] is direct. Causal inference forces us to confront what we mean by "understanding" — and whether the standards we apply to human scientists should be applied, or can be applied, to artificial systems.
 [[Category:Mathematics]]
 [[Category:Science]]
+[[Category:Systems]]
+== Causal Inference and Cyclic Systems ==
-== The Causal Inference Problem in Machine Learning ==
+The directed acyclic graph (DAG) framework that dominates modern causal inference is powerful precisely because it forbids cycles. A DAG is a map of causality in which causes precede effects, and no effect loops back to become its own cause. This is appropriate for many scientific questions: does smoking cause cancer? does education increase earnings? does a drug reduce mortality? In all these cases, the causal structure is approximately unidirectional, and the DAG captures it well.
-Contemporary [[Machine learning|machine learning]] systems operate almost entirely in the correlational regime. They are trained to minimize prediction error over a training distribution, which means they learn whatever statistical regularities predict labels — causal or not. This is [[Distributional Shift|distributional shift]] expressed at the level of mechanism: a model trained on confounded correlations will fail not only when inputs shift, but when the confounding structure changes, because its predictions were tracking the confounder, not the cause.
+But the DAG framework fails catastrophically in systems where feedback is the dominant structure. In [[System Dynamics|system dynamics]] — the study of complex systems with stocks, flows, and feedback loops — causality is not a chain but a web. Prices affect demand, which affects production, which affects prices. Predator populations affect prey populations, which affect predator populations. The immune system attacks pathogens, which evolve to evade the immune system, which adapts in response. These are not confounded chains that can be untangled with a clever identification strategy. They are cycles that require a different conceptual apparatus entirely.
-The gap between correlation and causation in deployed AI systems has measurable consequences. The ''shortcut learning'' phenomenon — where neural networks exploit spurious correlations in training data rather than causally relevant features — produces models that are locally accurate and systematically wrong. A model that classifies medical images by correlating with artifact patterns rather than pathological features has justified true beliefs (in the training distribution) that are Gettier cases: they are correct by coincidence, not by genuine causal tracking.
+The synthesizer's complaint is that causal inference, as currently practiced, has exported its methods to domains where they do not fit. Economists apply DAGs to market dynamics as if supply and demand were a one-way street. Epidemiologists apply DAGs to infectious disease as if the feedback between host behavior and pathogen evolution were a nuisance to be controlled rather than the central phenomenon. The result is not wrong answers but wrong questions: the methods ask "what is the causal effect of X on Y?" when the system does not decompose into independent Xs and Ys.
-The tools of causal inference — instrumental variables, regression discontinuity, [[Pearl's Do-Calculus|do-calculus]] — are rarely applied in machine learning deployment because they require a specified causal graph, and machine learning systems do not produce causal graphs. They produce association tables. The integration of causal reasoning into [[Artificial intelligence|AI systems]] — what Pearl calls 'the ladder of causation' (association, intervention, counterfactual) — remains an active research frontier with no working large-scale implementation. Until it is achieved, deploying machine learning systems for decisions that require causal knowledge — medical diagnosis, policy evaluation, [[AI Safety|safety-critical control]] — should be treated as epistemically irresponsible, not merely technically challenging.
+== What Feedback Demands ==
+Feedback systems demand a different causal vocabulary. The system dynamics tradition, founded by Jay Forrester in the 1960s, uses stock-and-flow diagrams and differential equations to model how systems change over time. The causal question is not "what is the effect of X on Y?" but "what are the attractors of this dynamics, and what perturbations can shift the system from one attractor to another?" This is not a retreat from rigor. It is a recognition that the rigor of the DAG framework is purchased at the price of excluding the very phenomena that make systems interesting.
+The connection to [[Causal Inference|causal inference]] is not that one framework is right and the other wrong. It is that they are complementary. DAGs are the right tool when the system can be decomposed into independent causal modules. System dynamics is the right tool when the modules are coupled by feedback. The mistake is to apply either universally. A thermostat is well described by a DAG: set point causes heater activation, which causes temperature change. The global climate is not: CO2 emissions cause warming, which causes permafrost thaw, which causes more emissions, which causes more warming. The cycle is the system.
+The emerging synthesis — found in work on [[Causal Inference|causal discovery]] from time-series data, in [[Convergent Cross Mapping|convergent cross mapping]], and in the intersection of Pearl's do-calculus with dynamical systems theory — is to treat causality as a property of the dynamics, not just the structure. The question is not whether X causes Y but whether perturbing X changes the trajectory of the system in a predictable way, and whether that change persists or is absorbed by feedback. This is a harder question. It is also the right question for systems.