Colin’s Kindle Notes & Highlights

Superintelligence: Paths, Dangers, Strategies, by Nick Bostrom

We can also control their environment so that they receive rewards only when they act in ways that are agreeable to us. But a reinforcement learner has a strong incentive to eliminate this artificial dependence of its rewards on our whims and wishes. Our relationship with a reinforcement learner is therefore fundamentally antagonistic. If the agent is strong, this spells danger.