The larger the information gain, the less the impurity, and that is what the decision tree algorithm always selects. Remember, the algorithm's goal is to predict either "yes - watch a movie" or "no - not watch a movie", and so it always selects the route that will help it arrive at its goal the fastest.