Goodreads helps you keep track of books you want to read.

Start by marking “Python for Data Analysis: Data Wrangling with Pandas, Numpy, and Ipython” as Want to Read:

# Python for Data Analysis: Data Wrangling with Pandas, Numpy, and Ipython

by
Wes McKinney

"Python for Data Analysis" is concerned with the nuts and bolts of manipulating, processing, cleaning, and crunching data in Python. It is also a practical, modern introduction to scientific computing in Python, tailored for data-intensive applications. This is a book about the parts of the Python language and libraries you'll need to effectively solve a broad set of data
...more

ebook, 470 pages

Published
October 8th 2012
by O'Reilly Media
(first published December 30th 2011)

## Friend Reviews

To see what your friends thought of this book,
please sign up.

## Reader Q&A

To ask other readers questions about
Python for Data Analysis,
please sign up.

### Recent Questions

This book is not yet featured on Listopia.
Add this book to your favorite list »

## Community Reviews

(showing
1-30
of
1,256)

*Pandas and NumPy in Action*

As the creator of the pandas project, a Python data analysis framework, Wes McKinney is well placed to write this book. His experience and vision for the pandas framework is clear, and he is able to explain the main function and inner workings of both pandas and another package, NumPy, very well.

Although the title of the book suggests a broad look at the Python language for data analysis, McKinney almost exclusively focuses on an in ...more

Python, like most interpreted languages, is slow compared to ...more

Aug 24, 2012
Rob
rated it
3 of 5 stars
·
review of another edition

Recommends it for:
folks doing data analysis that have already decided to use Python

I did copy editing on this book, so my review is of an unfinished (but close to finished) version. That being said: McKinney is the principal author on pandas, a Python package for doing data transformation and statistical analysis. The book is largely about pandas (and NumPy), but also delves into general methodologies for munging data and performing analytical operations on them (e.g., normalizing messy data and turning it into graphs and tables); he also delves into some (semi) esoteric infor
...more

As well as Pandas you'll cover IPython, NumPy and Matplotlib in enough depth to get you started with data analysis and visualization.

You don't need to be a python expert but some python knowledge, and some experience of R, will definitely help.

The book is well structured, breaking down the different topics into well defined chapters which deal with topi ...more

The relatively new windowing functions added in SQL Server 2012 let you do even fancier analysis (at the risk needing to understand some new syntax).

Yet, sometimes, a raw table of SQL results just isn't enough. You mi ...more

Didn't read the last three chapters on time series, financial data analysis and advanced numpy.

Ipython notebooks are available here, forked from the official repository of the book.

As documentation for Pandas alone, this book is useful.

But it wasn't quite what I was expecting. I was expecting less tutorial and more case studies - taking meaningful datasets (instead of makey-upy ones) and using pandas and other tools to pose and answer questions. For me, this would have made the book a much more practical resource.

Упрекать можно, но стоит помнить: pandas де-факто на текущий момент нет альтернатив в области анализа данных на Python, да и инструмент это ...more

This book is a great introduction to pandas (it's written by the main author of pandas) as well as an introduction to Numpy. Great read.

My only major issue is that the content will become more outdated with each passing edition. Pandas is rapidly developing. This is an unfair reason to remove a star, the second reason is the b ...more

It focuses heavily on pandas and the myriad of things you can do with a DataFrame. Very often the examples are extremely specific, yet the example data is contrived, like "here's this rather specific case in which you want to average a subset of a column in a table, but only those cases where the person linked to the index of that column has the astrological sign Pisces", and about 5 minutes later, I already forgot how the author did it. ...more

*Python for Data Analysis*. It's well written and covers a broad range of topics that you'll need when importing, manipulating, aggregating, calculating, or plotting data.

The book covers mostly pandas and doesn't give much information on numpy and matplotlib, and say completely nothing about scipy, which are all more essential for scientific computing as far as I understand that topic.

On the other hand I'm sure that I will use what I've learned here soon, but only after reading more comprehensive information about the whole scipy stack ...more

It's clearly written and well-edited and organized, just like other O'Reilly books. Even if you have no time to learn this yourself, buy it for your lab so your grad students will be more productive. ;-)

Python for Data Analysis would be your first and good stepping stone...

Read fully review at: http://udooz.net/blog/2012/12/book-re...

Jan 26, 2015
Alex
rated it
5 of 5 stars
·
review of another edition

Shelves:
technical-reference,
influential

Indispensable and quite comprehensive. Slightly out of date at this point as pandas has matured since its publication, but this book will undoubtedly be a reference returned to repeatedly. Easy-to-understand code examples, and, more importantly, explanation of why one would want to apply the various capabilities described by them.

It is certainly not for Python beginners. I got also stuck at some mind boggling syntax and financial jargons. But the beauty of the Pandas is still right there to see. It is only with Pandas that I dare to take on some projects that were considered to be too cumbersome to tackle before.

There are no discussion topics on this book yet.
Be the first to start one »