50 books
—
4 voters
Goodreads helps you keep track of books you want to read.
Start by marking “Python Data Science Handbook: Tools and Techniques for Developers” as Want to Read:
Python Data Science Handbook: Tools and Techniques for Developers
by
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all—IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.
Working scientists ...more
Working scientists ...more
Get A Copy
Paperback, 500 pages
Published
December 25th 2016
by O'Reilly Media
(first published March 25th 2016)
Friend Reviews
To see what your friends thought of this book,
please sign up.
Reader Q&A
To ask other readers questions about
Python Data Science Handbook,
please sign up.
Be the first to ask a question about Python Data Science Handbook
Community Reviews
Showing 1-30

Start your review of Python Data Science Handbook: Tools and Techniques for Developers

This book is not as good as R for Data Science: Import, Tidy, Transform, Visualize, and Model Data, but if you are constrained or committed to using Python, it is the best available alternative as of 2018. Chapters 1 through 3 on ipython, Numpy, and Pandas are very well written, although they do suffer from using mostly small, made-up examples. Chapter 4 on Matplotlib is disappointing, but that's because Matplotlib is itself a weak and obsolete tool; the book acknowledges that fact and cannot fi
...more

The book is written as a Jupyter notebook, and is available for free on GitHub:
https://github.com/jakevdp/PythonData...
Books written as Jupyter Notebooks are simply wonderful. They should become the default medium for learning new materials related to computer science and mathematics.
Regarding the book itself, it fits more in the "practical knowledge" category, which is totally fine since it's a handbook. Being exposed to the different methods and tools is great. There is however no real theoret ...more
https://github.com/jakevdp/PythonData...
Books written as Jupyter Notebooks are simply wonderful. They should become the default medium for learning new materials related to computer science and mathematics.
Regarding the book itself, it fits more in the "practical knowledge" category, which is totally fine since it's a handbook. Being exposed to the different methods and tools is great. There is however no real theoret ...more

I read this book after having worked as a data scientist for about a year and a half. Most of my work had focused on machine learning, so I had picked up Numpy, Pandas, and Matplotlib along the way. This approach left some glaring holes in my usage of these modules. After having read this book I can see that there has been a couple of things I have been doing wrong -- or at least very ineffectively. So reading this book was definitely a good idea.
I especially appreciated the chapters on Numpy an ...more
I especially appreciated the chapters on Numpy an ...more

Mandatory read, did not finish around 50%.
So I'm in my final year of Information Studies and I feel like it wasn't until I read this book that I truly understood computer programming. It covers very useful packages for Data Science (Numpy, Pandas, Matplotlib), and not only explains what the code does, but also provides many code examples that help you to understand it and use it on your own.
I would highly recommend this book to anyone who has some basic knowledge of Python but wants/needs to be ...more
So I'm in my final year of Information Studies and I feel like it wasn't until I read this book that I truly understood computer programming. It covers very useful packages for Data Science (Numpy, Pandas, Matplotlib), and not only explains what the code does, but also provides many code examples that help you to understand it and use it on your own.
I would highly recommend this book to anyone who has some basic knowledge of Python but wants/needs to be ...more

Extremely well written. Just the right level of depth. It was useful to work through bit by bit to gain a general understanding and practice, and I'm sure it will also be useful as a desktop reference. I was inspired throughout to look at my data in new ways and apply new, modern methods to the data in order to obtain more robust results and hopefully uncover things about it that I simply would not have otherwise. Most of that happened in the machine learning (final) chapter. I appreciated the a
...more

The python data science handbook is the best python tutorial I have read. It is "an overview of python if you want to be a data scientist" - the breadth and depth on specific tools (matplotlib & beyond, pandas, and sci-kit, as well as ipython & jupyter notebooks) is perfect for a data science application. This is definitely addressing the "computer skills" third of the data science Venn diagram (not much on mathematics or subject matter expertise). Recommended for learning python or having as a
...more

It’s a succinct and well written book in data science using python, one of its greatest weaknesses is its examples, the author didn’t relate subjects with examples well and they are too hard to understand. But in a nutshell, it’s a good book for learning the basics of numpy, pandas, matplotlib and a little bit of machine learning

Liked how it goes in depth into NumPy and then Pandas. Sometimes a "little" too API based but that makes it practical in some respects.
...more

While the first four chapters offer a solid, hands-on overview of IPython, Numpy, Pandas and Matplotlib, you can find equivalent tutorials on how to slice arrays and manipulate DataFrames pretty much anywhere. Unless you're a complete beginner in scientific work with Python, these chapters will likely serve as refreshers at best. The chapter that really stood out to me was the last one on Machine Learning, so much so that I almost considered giving this a higher rating. Unfortunately, the lack o
...more

A rigorous overview of data science tools in Python, combined with an introduction to several machine learning techniques using the sci-kit learn library.
As someone that has approached learning data science and programming on a project-by-project basis, it was wonderfully enlightening to see the author dive deep into the syntax, and reasoning behind libraries such as NumPy, Pandas, and Matplotlib. The chapter on machine learning is surprisingly hefty considering how much has come prior to it.
I r ...more
As someone that has approached learning data science and programming on a project-by-project basis, it was wonderfully enlightening to see the author dive deep into the syntax, and reasoning behind libraries such as NumPy, Pandas, and Matplotlib. The chapter on machine learning is surprisingly hefty considering how much has come prior to it.
I r ...more

This is really an amazing technical resource. Vanderplas manages to keep his content extraordinarily practical and grounded, without being irreverent to the theory like so many lower-quality modern data science texts are. As a contributor to the Python data software libraries such as Scikit-learn, the author is eminently qualified to give a tour of their inner workings. Finally, the book is self-aware of where it lacks depth, and does an excellent job in referring readers to further resources.

As a starter, new to python the first four chapters of the book were very easy to follow, I learned too much from those chapters, except for chapter 5 (Introduction to machine learning) was somehow hard for me to follow because the concept of machine learning was new to me and there was too much code in the chapter that the author assumed you might know so there was no explanation, but someone with a bit knowledge on python would follow it very easily.

I'll just say this:
If I was put into this horrible scenario where I was held at a gunpoint next to a gigantic red button and was told that I must press it and nuke *every* single book publisher in the world bar one and I absolutely must choose which one, I would save O'Reilly. And I would use *this* book as an example to justify why. ...more
If I was put into this horrible scenario where I was held at a gunpoint next to a gigantic red button and was told that I must press it and nuke *every* single book publisher in the world bar one and I absolutely must choose which one, I would save O'Reilly. And I would use *this* book as an example to justify why. ...more

Great resource with excellent examples and useful, well-written Python code. A lot of techniques are introduced here, with the unfortunate exception of neural networks/deep learning, which is beyond the scope of this book. The book is written using Jupyter notebooks and printed in black & white, so for some of the plots you'll have to refer to the online versions to better see what's going on.
...more

Excellent book for any one interested in understand the fundamentals of scientific computing for data science in Python. I can't recommend this book enough, if you are interested in data science, read it from beginning to end.
...more

This review has been hidden because it contains spoilers. To view it,
click here.

Great coverage of the basic tools used in data science by somebody who seems to know the subject well. You don't need to be a python coder for the book to be useful, too.
...more

This review has been hidden because it contains spoilers. To view it,
click here.
There are no discussion topics on this book yet.
Be the first to start one »
Goodreads is hiring!
News & Interviews
Listen up, because our colleagues here at Goodreads have some excellent audiobook recommendations for you! Of course, the books they've...
11 likes · 7 comments
No trivia or quizzes yet. Add some now »