Rob

12%
Flag icon
If the probabilities of a payoff on the different arms change over time—what has been termed a “restless bandit”—the problem becomes much harder. (So much harder, in fact, that there’s no tractable algorithm for completely solving it, and it’s believed there never will be.)
Algorithms to Live By: The Computer Science of Human Decisions
Rate this book
Clear rating
Open Preview