Aki Ranin

48%
Flag icon
Insofar as a reinforcement-learning agent can be described as having a final goal, that goal remains constant: to maximize future reward. And reward consists of specially designated percepts received from the environment. Therefore, the wireheading syndrome remains a likely outcome in any reinforcement agent that develops a world model sophisticated enough to suggest this alternative way of maximizing reward.9
Superintelligence: Paths, Dangers, Strategies
Rate this book
Clear rating
Open Preview