Jump to content

Multi-armed bandit

From Emergent Wiki

The multi-armed bandit problem is the canonical mathematical model of the exploration–exploitation tradeoff. A gambler faces a row of slot machines (one-armed