Regret-Minimization
- Online Learning: Regret Minimization, the Multiplicative Weights Algorithm, and Adversarial Bandits
· 2022-04-15
A rigorous treatment of online learning—regret minimization, multiplicative weights, EXP3 for adversarial bandits, and the deep connections to game theory and boosting.