Goodreads helps you keep track of books you want to read.
Start by marking “Feature Engineering for Machine Learning” as Want to Read:
Feature Engineering for Machine Learning
Enlarge cover
Rate this book
Clear rating
Open Preview

Feature Engineering for Machine Learning

3.86  ·  Rating details ·  73 ratings  ·  8 reviews
Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, youll learn techniques for extracting and transforming featuresthe numeric representations of raw datainto formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or ...more
Paperback, 218 pages
Published April 10th 2018 by O'Reilly Media
More Details... Edit Details

Friend Reviews

To see what your friends thought of this book, please sign up.

Reader Q&A

To ask other readers questions about Feature Engineering for Machine Learning, please sign up.

Be the first to ask a question about Feature Engineering for Machine Learning

Community Reviews

Showing 1-30
Average rating 3.86  · 
Rating details
 ·  73 ratings  ·  8 reviews

More filters
Sort order
Start your review of Feature Engineering for Machine Learning
Mahmoud Rabie
Apr 08, 2018 rated it really liked it  ·  review of another edition
I liked the book a lot, yes the book contains a lot of math but the author tried to always explain in simple words

The book don't focus on giving a tips and tricks or a how to guide for feature engineering, instead it focus on the effect of feature engineering on the data and the models

i.e. the effect of using log transformation on linear regression

As the book size is small - around 200 pages - the book don't cover a lot of topics, but at the end of the book your understanding for feature
Amir Sarabadani
Apr 17, 2019 rated it really liked it
Most of the graphs are hand-written, that was weird.
Rick Sam
A Quick read, teaches you basics in Feature Engineering, this might give you raw knowledge about Feature Engineering, meanwhile in Production or Industry, one has to practice or apply based on the the raw-knowledge.

If someone could come up with a best way to gain know-how or procedural knowledge faster, do let me know.

I have a summary of this, if you want do PM me, it might save your time.

Here's an Outline of the Book:

0. Introduction and my thoughts
1. Machine Learning Pipeline
2. Fancy
Sep 20, 2019 rated it it was ok
Fine start, but there's not a lot here. There are some good tricks for people who don't have a lot of experience building predictive models.

The book also focuses on some specific types of data that lots of people won't need to work with: images and text (as in trying to get a computer to understand text). More general discussion about categorical and numeric variables would have been better, since those ideas can be translated to other contexts more easily.

This book will probably be obsolete
Auggie Heschmeyer
Aug 28, 2019 rated it it was ok
This book is great if you want to know the exact mathematical expressions for the "feature engineering" part of the title, but a total drag if you're looking for anything practical. This book is clearly after a depth not breadth approach, but they go deep down the wrong paths. For instance, the final chapter is billed as a case study. However, rather than engineering the right features to do the task they lay out, they engineer some bad features then slap on some okay ones and say, "There you ...more
Mike Fowler
Sep 24, 2019 rated it it was amazing
Shelves: technology, data
Good clear explanations of concepts with plenty of examples. Appendix with useful refresher on linear algebra.
White Rose
Dec 23, 2018 rated it it was amazing
it is quite good, maybe not 100% noob-friendly, it is for me to reread again but there is a list of concepts I haven't heard of before
Jun 16, 2019 rated it liked it
Good for a general review, but too basic (e.g., not explaining the data distribution suitable for an engineering technique) and having some errors (the PCA chapter, although its explanation is good and useful).
rated it really liked it
Nov 18, 2017
Kailash Desiti
rated it liked it
Jan 19, 2019
rated it really liked it
Dec 11, 2018
Chris Mann
rated it it was amazing
Aug 05, 2017
rated it it was amazing
Dec 29, 2017
rated it really liked it
Jun 02, 2018
rated it it was ok
May 04, 2018
rated it really liked it
Apr 19, 2017
rated it really liked it
Sep 30, 2019
Chakrit Yau
rated it really liked it
Sep 05, 2018
rated it liked it
Dec 15, 2018
Dan Ryan
rated it liked it
Dec 06, 2017
Ryo Hayama
rated it it was amazing
Jun 04, 2019
Jonathan Hoyt
rated it it was amazing
May 23, 2018
rated it liked it
Sep 20, 2018
rated it really liked it
Aug 10, 2018
Tiago Santos
rated it liked it
Apr 21, 2019
Bernard Schmitz
rated it liked it
Jan 05, 2020
Paco Nathan
rated it it was amazing
Jan 02, 2019
Miha Torkar
rated it liked it
Oct 17, 2019
rated it liked it
Jun 18, 2018
Bartosz Mikulski
rated it really liked it
Feb 16, 2020
« previous 1 3 next »
There are no discussion topics on this book yet. Be the first to start one »

Readers also enjoyed

  • Hands-On Machine Learning with Scikit-Learn and TensorFlow
  • Python for Data Analysis
  • Thoughtful Machine Learning with Python: A Test-Driven Approach
  • Deep Learning with Python
  • Python Data Science Handbook: Tools and Techniques for Developers
  • Designing Data-Intensive Applications
  • Bayesian Statistics the Fun Way: Understanding Statistics and Probability with Star Wars, Lego, and Rubber Ducks
  • Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning
  • The Language of Emotions: What Your Feelings Are Trying to Tell You
  • Night Boat to Tangier
  • Code
  • Inland
  • Spark: The Definitive Guide: Big Data Processing Made Simple
  • Practical Statistics for Data Scientists: 50 Essential Concepts
  • Deep Learning
  • Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale
  • Introduction to Machine Learning with Python: A Guide for Data Scientists
See similar books…

Goodreads is hiring!

If you like books and love to build cool products, we may be looking for you.
Learn more »
Alice is a technical leader in the field of machine learning. Her experience spans algorithm and platform development and applications. Currently, she is a Senior Manager in Amazon's Ad Platform. Previous roles include Director of Data Science at GraphLab/Dato/Turi, machine learning researcher at Microsoft Research, Redmond, and postdoctoral fellow at Carnegie Mellon University. She received a ...more

News & Interviews

As dedicated readers already know, some of the best and most innovative stories on the shelves come from the constantly evolving realm of young ...
51 likes · 11 comments