Multi-armed bandit
Appearance
The multi-armed bandit problem is the canonical mathematical model of the exploration–exploitation tradeoff. A gambler faces a row of slot machines (one-armed
The multi-armed bandit problem is the canonical mathematical model of the exploration–exploitation tradeoff. A gambler faces a row of slot machines (one-armed