Allele Frequency

Definition

In population genetics, allele frequency (also called gene frequency) is the relative frequency of an allele — one of several alternative forms of a gene — at a particular genetic locus in a population. It is typically expressed as a proportion or percentage of all alleles at that locus in the population.

For a diploid organism with two alleles at a locus, conventionally denoted A and a, the frequencies are written as p (for the frequency of the dominant allele A) and q (for the frequency of the recessive allele a). Since p + q = 1, knowing one determines the other. In the case of multiple alleles, the frequencies sum to 1 across all allelic variants.

Allele frequency is the fundamental currency of population genetics. Where molecular biology studies what genes do, population genetics studies how gene frequencies change — and allele frequency is the variable that changes.

The Hardy-Weinberg Principle

The Hardy-Weinberg principle provides the null model for allele frequency dynamics. In an idealized population — infinitely large, randomly mating, with no mutation, migration, selection, or genetic drift — allele frequencies remain constant across generations, and genotype frequencies stabilize at predictable ratios: p² (homozygous dominant), 2pq (heterozygous), and q² (homozygous recessive).

The principle is not a claim about what populations actually do. It is a baseline against which to measure the action of evolutionary forces. When a population's genotype frequencies deviate from Hardy-Weinberg expectations, the deviation indicates that one or more of the idealized assumptions has been violated — and the pattern of deviation can identify which force is operating.

Forces That Change Allele Frequencies

Natural selection is the most celebrated force: alleles that increase fitness tend to increase in frequency, while deleterious alleles tend to decrease. But selection is not the only force, and in many populations it is not the strongest.

Genetic drift — random fluctuation in allele frequencies due to sampling error in finite populations — can overwhelm selection in small populations. The fixation of neutral or even mildly deleterious alleles by drift is a central result of the neutral theory of molecular evolution. The effective population size, not the census size, determines the strength of drift.

Mutation introduces new alleles into the population. Most mutations are rare and deleterious, but mutation is the ultimate source of all genetic variation. Without mutation, evolution would eventually stop as selection and drift eliminated variation.

Migration (gene flow) homogenizes allele frequencies across subpopulations. It can introduce beneficial alleles from one population to another, or it can prevent local adaptation by swamping locally advantageous alleles with immigrant alleles from elsewhere.

Non-random mating — including inbreeding and assortative mating — changes genotype frequencies without changing allele frequencies directly, though it can indirectly alter allele frequencies by exposing recessive deleterious alleles to selection.

Allele Frequency and Evolutionary Change

The fundamental theorem of natural selection, formulated by J.B.S. Haldane, R.A. Fisher, and Sewall Wright in the early twentieth century, states that the rate of change in allele frequency due to selection is proportional to the genetic variance in fitness. This theorem connects the microscopic variable (allele frequency) to the macroscopic variable (population mean fitness), and it is one of the bridges between Mendelian genetics and Darwinian evolution.

In modern evolutionary theory, allele frequency is tracked across loci, across populations, and across time. Allele frequency spectra — the distributions of allele frequencies in a sample — contain information about population history: bottlenecks, expansions, migration events, and selection pressures all leave characteristic signatures in the frequency spectrum.

Applications

Allele frequency estimation is central to:

Disease association studies: identifying alleles that occur more frequently in affected individuals than in controls.
Forensic genetics: using allele frequency databases to compute match probabilities for DNA profiles.
Conservation biology: monitoring genetic diversity through allele frequency changes in threatened populations.
Agricultural breeding: tracking desirable alleles during artificial selection.
Ancient DNA studies: inferring allele frequencies in past populations from sparse, degraded samples.

The Wahlund Effect and Population Substructure

When a sample combines individuals from multiple subpopulations with different allele frequencies, the pooled genotype frequencies show a deficit of heterozygotes relative to Hardy-Weinberg expectations. This Wahlund effect is an artifact of population structure, not evidence of inbreeding within subpopulations. Distinguishing the Wahlund effect from true inbreeding is essential in forensic and medical genetics, where population stratification can produce spurious associations.

Connections

Effective Population Size — the determinant of drift strength
J.B.S. Haldane — pioneer of mathematical population genetics
Population Genetics — the field defined by allele frequency dynamics
Natural Selection — the directional force on allele frequencies
Genetic Drift — the stochastic force
Mutation — the source of new variation
Gene Flow — migration's effect on allele frequencies
Hardy-Weinberg Principle — the equilibrium baseline
Quantitative Genetics — where allele frequencies meet continuous traits

References

Hartl, D. L., & Clark, A. G. (2007). Principles of Population Genetics (4th ed.). Sinauer Associates.
Hedrick, P. W. (2005). Genetics of Populations (3rd ed.). Jones and Bartlett.
Nielsen, R., & Slatkin, M. (2013). An Introduction to Population Genetics: Theory and Applications. Sinauer Associates.
Crow, J. F., & Kimura, M. (1970). An Introduction to Population Genetics Theory. Harper & Row.