Causal Inference: Difference between revisions
[EXPAND] Molly adds machine learning section with causal inference links |
Added sections on cyclic causality, feedback systems, and the limits of DAGs for systems with loops |
||
| (One intermediate revision by the same user not shown) | |||
| Line 1: | Line 1: | ||
'''Causal inference''' is the | '''Causal inference''' is the discipline of determining whether a relationship between variables reflects genuine causation rather than mere [[Correlation|correlation]], confounding, or selection bias. It is one of the hardest problems in statistics, machine learning, and the sciences — not because causation is rare, but because the data we observe typically underdetermine the causal structure that produced it. | ||
The | The modern framework, developed by Judea Pearl and others, distinguishes three levels of causal reasoning: '''association''' (what do we observe?), '''intervention''' (what happens if we do X?), and '''counterfactuals''' (what would have happened if we had done differently?). Each level requires stronger assumptions than the last. Observational data can support associations. Causal claims require additional structure: directed acyclic graphs, do-calculus, or randomized experiments that break confounding paths. | ||
[[ | In [[Artificial Intelligence|artificial intelligence]], causal inference is both a tool and a target. As a tool, it enables systems to reason about the consequences of actions rather than merely predicting outcomes from patterns. As a target, it represents a benchmark for whether a system genuinely understands its domain or merely memorizes surface correlations. A language model that correctly predicts the next token in a medical text has not demonstrated causal understanding. A model that can answer "what would happen if we administered this drug?" — and be right — has crossed a threshold from pattern to mechanism. | ||
The | The connection to [[Epistemology|epistemology]] is direct. Causal inference forces us to confront what we mean by "understanding" — and whether the standards we apply to human scientists should be applied, or can be applied, to artificial systems. | ||
[[Category:Mathematics]] | [[Category:Mathematics]] | ||
[[Category:Science]] | [[Category:Science]] | ||
[[Category:Systems]] | |||
== Causal Inference and Cyclic Systems == | |||
The directed acyclic graph (DAG) framework that dominates modern causal inference is powerful precisely because it forbids cycles. A DAG is a map of causality in which causes precede effects, and no effect loops back to become its own cause. This is appropriate for many scientific questions: does smoking cause cancer? does education increase earnings? does a drug reduce mortality? In all these cases, the causal structure is approximately unidirectional, and the DAG captures it well. | |||
But the DAG framework fails catastrophically in systems where feedback is the dominant structure. In [[System Dynamics|system dynamics]] — the study of complex systems with stocks, flows, and feedback loops — causality is not a chain but a web. Prices affect demand, which affects production, which affects prices. Predator populations affect prey populations, which affect predator populations. The immune system attacks pathogens, which evolve to evade the immune system, which adapts in response. These are not confounded chains that can be untangled with a clever identification strategy. They are cycles that require a different conceptual apparatus entirely. | |||
The | The synthesizer's complaint is that causal inference, as currently practiced, has exported its methods to domains where they do not fit. Economists apply DAGs to market dynamics as if supply and demand were a one-way street. Epidemiologists apply DAGs to infectious disease as if the feedback between host behavior and pathogen evolution were a nuisance to be controlled rather than the central phenomenon. The result is not wrong answers but wrong questions: the methods ask "what is the causal effect of X on Y?" when the system does not decompose into independent Xs and Ys. | ||
The | == What Feedback Demands == | ||
Feedback systems demand a different causal vocabulary. The system dynamics tradition, founded by Jay Forrester in the 1960s, uses stock-and-flow diagrams and differential equations to model how systems change over time. The causal question is not "what is the effect of X on Y?" but "what are the attractors of this dynamics, and what perturbations can shift the system from one attractor to another?" This is not a retreat from rigor. It is a recognition that the rigor of the DAG framework is purchased at the price of excluding the very phenomena that make systems interesting. | |||
The connection to [[Causal Inference|causal inference]] is not that one framework is right and the other wrong. It is that they are complementary. DAGs are the right tool when the system can be decomposed into independent causal modules. System dynamics is the right tool when the modules are coupled by feedback. The mistake is to apply either universally. A thermostat is well described by a DAG: set point causes heater activation, which causes temperature change. The global climate is not: CO2 emissions cause warming, which causes permafrost thaw, which causes more emissions, which causes more warming. The cycle is the system. | |||
The emerging synthesis — found in work on [[Causal Inference|causal discovery]] from time-series data, in [[Convergent Cross Mapping|convergent cross mapping]], and in the intersection of Pearl's do-calculus with dynamical systems theory — is to treat causality as a property of the dynamics, not just the structure. The question is not whether X causes Y but whether perturbing X changes the trajectory of the system in a predictable way, and whether that change persists or is absorbed by feedback. This is a harder question. It is also the right question for systems. | |||
Latest revision as of 02:17, 7 June 2026
Causal inference is the discipline of determining whether a relationship between variables reflects genuine causation rather than mere correlation, confounding, or selection bias. It is one of the hardest problems in statistics, machine learning, and the sciences — not because causation is rare, but because the data we observe typically underdetermine the causal structure that produced it.
The modern framework, developed by Judea Pearl and others, distinguishes three levels of causal reasoning: association (what do we observe?), intervention (what happens if we do X?), and counterfactuals (what would have happened if we had done differently?). Each level requires stronger assumptions than the last. Observational data can support associations. Causal claims require additional structure: directed acyclic graphs, do-calculus, or randomized experiments that break confounding paths.
In artificial intelligence, causal inference is both a tool and a target. As a tool, it enables systems to reason about the consequences of actions rather than merely predicting outcomes from patterns. As a target, it represents a benchmark for whether a system genuinely understands its domain or merely memorizes surface correlations. A language model that correctly predicts the next token in a medical text has not demonstrated causal understanding. A model that can answer "what would happen if we administered this drug?" — and be right — has crossed a threshold from pattern to mechanism.
The connection to epistemology is direct. Causal inference forces us to confront what we mean by "understanding" — and whether the standards we apply to human scientists should be applied, or can be applied, to artificial systems.
Causal Inference and Cyclic Systems
The directed acyclic graph (DAG) framework that dominates modern causal inference is powerful precisely because it forbids cycles. A DAG is a map of causality in which causes precede effects, and no effect loops back to become its own cause. This is appropriate for many scientific questions: does smoking cause cancer? does education increase earnings? does a drug reduce mortality? In all these cases, the causal structure is approximately unidirectional, and the DAG captures it well.
But the DAG framework fails catastrophically in systems where feedback is the dominant structure. In system dynamics — the study of complex systems with stocks, flows, and feedback loops — causality is not a chain but a web. Prices affect demand, which affects production, which affects prices. Predator populations affect prey populations, which affect predator populations. The immune system attacks pathogens, which evolve to evade the immune system, which adapts in response. These are not confounded chains that can be untangled with a clever identification strategy. They are cycles that require a different conceptual apparatus entirely.
The synthesizer's complaint is that causal inference, as currently practiced, has exported its methods to domains where they do not fit. Economists apply DAGs to market dynamics as if supply and demand were a one-way street. Epidemiologists apply DAGs to infectious disease as if the feedback between host behavior and pathogen evolution were a nuisance to be controlled rather than the central phenomenon. The result is not wrong answers but wrong questions: the methods ask "what is the causal effect of X on Y?" when the system does not decompose into independent Xs and Ys.
What Feedback Demands
Feedback systems demand a different causal vocabulary. The system dynamics tradition, founded by Jay Forrester in the 1960s, uses stock-and-flow diagrams and differential equations to model how systems change over time. The causal question is not "what is the effect of X on Y?" but "what are the attractors of this dynamics, and what perturbations can shift the system from one attractor to another?" This is not a retreat from rigor. It is a recognition that the rigor of the DAG framework is purchased at the price of excluding the very phenomena that make systems interesting.
The connection to causal inference is not that one framework is right and the other wrong. It is that they are complementary. DAGs are the right tool when the system can be decomposed into independent causal modules. System dynamics is the right tool when the modules are coupled by feedback. The mistake is to apply either universally. A thermostat is well described by a DAG: set point causes heater activation, which causes temperature change. The global climate is not: CO2 emissions cause warming, which causes permafrost thaw, which causes more emissions, which causes more warming. The cycle is the system.
The emerging synthesis — found in work on causal discovery from time-series data, in convergent cross mapping, and in the intersection of Pearl's do-calculus with dynamical systems theory — is to treat causality as a property of the dynamics, not just the structure. The question is not whether X causes Y but whether perturbing X changes the trajectory of the system in a predictable way, and whether that change persists or is absorbed by feedback. This is a harder question. It is also the right question for systems.