Jump to ratings and reviews
Rate this book

Evaluating Machine Learning Models

Rate this book
Data science today is a lot like the Wild West: there’s endless opportunity and excitement, but also a lot of chaos and confusion. If you’re new to data science and applied machine learning, evaluating a machine-learning model can seem pretty overwhelming. Now you have help. With this O’Reilly report, machine-learning expert Alice Zheng takes you through the model evaluation basics.

In this overview, Zheng first introduces the machine-learning workflow, and then dives into evaluation metrics and model selection. The latter half of the report focuses on hyperparameter tuning and A/B testing, which may benefit more seasoned machine-learning practitioners.

With this report, you will:

Learn the stages involved when developing a machine-learning model for use in a software application
Understand the metrics used for supervised learning models, including classification, regression, and ranking
Walk through evaluation mechanisms, such as hold?out validation, cross-validation, and bootstrapping
Explore hyperparameter tuning in detail, and discover why it’s so difficult
Learn the pitfalls of A/B testing, and examine a promising alternative: multi-armed bandits
Get suggestions for further reading, as well as useful software packages

59 pages, ebook

First published September 1, 2015

83 people want to read

About the author

Alice Zheng

6 books8 followers
Alice is a technical leader in the field of machine learning. Her experience spans algorithm and platform development and applications. Currently, she is a Senior Manager in Amazon's Ad Platform. Previous roles include Director of Data Science at GraphLab/Dato/Turi, machine learning researcher at Microsoft Research, Redmond, and postdoctoral fellow at Carnegie Mellon University. She received a Ph.D. in Electrical Engineering and Computer science, and B.A. degrees in Computer Science in Mathematics, all from U.C. Berkeley.

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
10 (22%)
4 stars
26 (59%)
3 stars
8 (18%)
2 stars
0 (0%)
1 star
0 (0%)
Displaying 1 - 5 of 5 reviews
Profile Image for Alex Ott.
Author 3 books207 followers
October 1, 2015
short, quite good book (freely available from O'Reilly) on how to evaluate quality of the machine learning models...
Profile Image for Ira Noviani.
7 reviews
January 23, 2016
To be fair to a thin copy of such subject that can require much details study, the author has sufficiently managed to squeeze in the high-level framework into a book. She added some reading materials for those who are keen to learn more in-depth on certain topic.

She started from evaluating the metrics to choose which depends on the type of problem that currently being worked on. For example, classification should better be evaluated against the accuracy that the model churns while the precision/recall is more suitable for ranking.

The only part that I got a bit lost is when the writer said we should not confuse the model validation, cross validation, and hyperparameter tuning. As this concept of hyperparameter tuning is novel to me, I was expecting the next sentence to give me a definition of the term so that I can make a clear distinction out of them. Alas, the discussion of the hyperparameter starts at the next chapter. So probably better for the reader to keep go back and forth between the chapters to properly understand those three concept.

One more note from the machine learning newbie like me is from the specific statement on page 25: "Every new model needs to be evaluated on a separate dataset". The reasoning for this can be found on page 47: "If the tests are not independent (i.e., maybe your 32 models all came from the same training dataset?), ..." which sums up the importance of the tests independece.
83 reviews4 followers
October 5, 2015
This is excellent reference material for evaluating any machine learning model. Alice does a great job simplifying the evaluation process into its pros and cons. My only gripe is that the book does not stay consistent with teaching in a simple manner. It starts off simple but gets more complex as the book goes on.
Displaying 1 - 5 of 5 reviews

Can't find what you're looking for?

Get help and learn more about the design.