Debbie Roth

28%
Flag icon
reinforcement learning requires two opponent processes—one for behaviors that were previously reinforced (exploitation) and the other for behaviors that are new (exploration). These choices are, by definition, opposing each other. Exploitation will always drive behavior toward known rewards, and exploration will always drive toward what is unknown.
A Brief History of Intelligence: Evolution, AI, and the Five Breakthroughs That Made Our Brains
Rate this book
Clear rating
Open Preview