The model (set of weights) that gives the highest sum is the model that gives the highest “likelihood” to the data — the “maximum likelihood” model. The maximum likelihood model “on average” gives the highest probabilities to the positive examples and the lowest probabilities to the negative examples.

